Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
guanning-ai
's Collections
Reasoning-Benchmarks
Reasoning-Benchmarks
updated
9 days ago
A collection of mutiple benchmarks for large reasoning model evaluation
Upvote
-
guanning/amc23
Viewer
•
Updated
May 25, 2025
•
40
•
13
guanning/math
Viewer
•
Updated
Jun 12, 2025
•
12.5k
•
20
guanning/aime24
Viewer
•
Updated
May 25, 2025
•
30
•
13
guanning/aime25
Viewer
•
Updated
May 25, 2025
•
30
•
10
guanning/gsm8k
Viewer
•
Updated
May 25, 2025
•
8.79k
•
13
guanning/olympiadbench
Viewer
•
Updated
May 28, 2025
•
675
•
16
guanning-ai/dapo17k
Viewer
•
Updated
Nov 9, 2025
•
17.2k
•
7
guanning-ai/dapo14k
Viewer
•
Updated
Jun 11, 2025
•
14k
•
9
guanning-ai/mmlu-pro
Viewer
•
Updated
Jul 4, 2025
•
12k
•
8
guanning-ai/knowlogic-en
Viewer
•
Updated
Jul 10, 2025
•
2.4k
•
5
guanning-ai/bigmath
Viewer
•
Updated
Jul 27, 2025
•
251k
•
6
guanning-ai/COM2
Viewer
•
Updated
Aug 6, 2025
•
3.76k
•
12
guanning-ai/beyondaime
Viewer
•
Updated
Oct 21, 2025
•
100
•
47
guanning-ai/Polaris-53K
Viewer
•
Updated
Dec 11, 2025
•
53.3k
•
10
guanning-ai/openr1-93K
Viewer
•
Updated
Dec 11, 2025
•
93.7k
•
19
guanning-ai/gsm8k-mugglemath
Viewer
•
Updated
15 days ago
•
157k
•
14
guanning-ai/gsm8k-metamath
Viewer
•
Updated
12 days ago
•
160k
•
28
guanning-ai/gsm8k-mumath
Viewer
•
Updated
15 days ago
•
92k
•
21
guanning-ai/minervamath
Viewer
•
Updated
9 days ago
•
272
•
10
Upvote
-
Share collection
View history
Collection guide
Browse collections