Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
josefonte
's Collections
Benchmarks
Benchmarks
updated
Mar 28, 2025
collection of datasets used to train and test MLMMs (VLMs)
Upvote
-
AI4Math/MathVerse
Viewer
•
Updated
May 15, 2025
•
4.73k
•
3.36k
•
68
MMMU/MMMU
Viewer
•
Updated
20 days ago
•
11.6k
•
91.1k
•
320
MMMU/MMMU_Pro
Viewer
•
Updated
Mar 8, 2025
•
5.19k
•
14.2k
•
47
AI4Math/MathVista
Viewer
•
Updated
Feb 11, 2024
•
6.14k
•
13.1k
•
204
MathLLMs/MathVision
Viewer
•
Updated
Nov 27, 2025
•
3.34k
•
13.6k
•
129
TIGER-Lab/MEGA-Bench
Viewer
•
Updated
May 7, 2025
•
7.69k
•
312
•
23
lmms-lab/MMBench_EN
Viewer
•
Updated
Mar 8, 2024
•
11.1k
•
545
•
7
Lin-Chen/MMStar
Viewer
•
Updated
Apr 7, 2024
•
1.5k
•
16.7k
•
47
lmms-lab/MME
Viewer
•
Updated
Dec 23, 2023
•
2.37k
•
27.5k
•
28
MUIRBENCH/MUIRBENCH
Viewer
•
Updated
Jul 1, 2024
•
2.6k
•
2.54k
•
17
BLINK-Benchmark/BLINK
Viewer
•
Updated
Sep 3, 2025
•
3.81k
•
15.8k
•
39
OpenGVLab/CRPE
Viewer
•
Updated
Mar 21, 2024
•
544
•
113
•
9
ByteDance/MTVQA
Viewer
•
Updated
May 30, 2024
•
8.79k
•
207
•
42
lmms-lab/RealWorldQA
Viewer
•
Updated
Apr 13, 2024
•
765
•
7.93k
•
6
yifanzhang114/MME-RealWorld
Preview
•
Updated
Nov 14, 2024
•
541
•
21
lmms-lab/MMVet
Viewer
•
Updated
Mar 8, 2024
•
218
•
2.23k
•
4
mistralai/MM-MT-Bench
Viewer
•
Updated
Oct 10, 2024
•
92
•
1.3k
•
26
edinburgh-dawg/mmlu-redux
Viewer
•
Updated
Feb 9, 2025
•
3k
•
3.28k
•
37
TIGER-Lab/MMLU-Pro
Benchmark
•
Updated
Jan 19
•
12.1k
•
87.8k
•
439
Idavidrein/gpqa
Benchmark
•
Updated
Jan 22
•
1.25k
•
91.5k
•
372
openai/gsm8k
Benchmark
•
Updated
Dec 20, 2025
•
17.6k
•
492k
•
1.18k
openai/openai_humaneval
Viewer
•
Updated
Jan 4, 2024
•
164
•
190k
•
370
nuprl/MultiPL-E
Viewer
•
Updated
Jul 15, 2025
•
12.7k
•
33k
•
64
google/IFEval
Viewer
•
Updated
Aug 14, 2024
•
541
•
61.8k
•
134
opendatalab/OmniDocBench
Viewer
•
Updated
Sep 26, 2025
•
1.36k
•
9.09k
•
71
wulipc/CC-OCR
Viewer
•
Updated
Dec 27, 2024
•
7.06k
•
1.77k
•
5
lmms-lab/ai2d
Viewer
•
Updated
Mar 26, 2024
•
3.09k
•
9.69k
•
18
lmms-lab/textvqa
Viewer
•
Updated
Mar 8, 2024
•
45.3k
•
18.1k
•
23
lmms-lab/DocVQA
Viewer
•
Updated
Apr 18, 2024
•
16.6k
•
21.7k
•
70
HuggingFaceM4/ChartQA
Viewer
•
Updated
Mar 5, 2024
•
32.7k
•
8.22k
•
61
princeton-nlp/CharXiv
Viewer
•
Updated
Sep 27, 2024
•
2.32k
•
2.09k
•
45
AILab-CVC/SEED-Bench-2-plus
Viewer
•
Updated
Apr 27, 2024
•
555
•
142
•
5
echo840/OCRBench
Viewer
•
Updated
Dec 18, 2024
•
1k
•
19.7k
•
22
lmms-lab/OCRBench-v2
Viewer
•
Updated
Feb 9, 2025
•
10k
•
1.06k
•
12
Upvote
-
Share collection
View history
Collection guide
Browse collections