MentalBench: A Benchmark for Evaluating Psychiatric Diagnostic Capability of Large Language Models Paper • 2602.12871 • Published Feb 13 • 5
Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding Paper • 2604.05015 • Published 6 days ago • 225
TeichAI/gemma-4-26B-A4B-it-Claude-Opus-Distill Image-Text-to-Text • 27B • Updated about 15 hours ago • 1.44k • 14
TeichAI/gemma-4-31B-it-Claude-Opus-Distill Image-Text-to-Text • 31B • Updated about 8 hours ago • 6.31k • 31