MME-Benchmarks

non-profit

AI & ML interests

Multimodal LLMs

Recent Activity

THUdyh authored a paper 14 days ago

Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos

THUdyh authored a paper 14 days ago

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

THUdyh authored a paper 14 days ago

S-Agent: Spatial Tool-Use Elicits Reasoning for Spatial Intelligence

View all activity

Papers

Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

View all Papers

MME-Benchmarks 's models

None public yet