Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
guanning-ai
's Collections
MaxRL
Reasoning-Benchmarks
Reasoning-Benchmarks
updated
22 days ago
A collection of mutiple benchmarks for large reasoning model evaluation
Upvote
-
guanning/amc23
Viewer
•
Updated
May 25, 2025
•
40
•
17
guanning/math
Viewer
•
Updated
Jun 12, 2025
•
12.5k
•
20
guanning/aime24
Viewer
•
Updated
May 25, 2025
•
30
•
5
guanning/aime25
Viewer
•
Updated
May 25, 2025
•
30
•
10
guanning/gsm8k
Viewer
•
Updated
May 25, 2025
•
8.79k
•
9
guanning/olympiadbench
Viewer
•
Updated
May 28, 2025
•
675
•
7
guanning-ai/dapo17k
Viewer
•
Updated
Nov 9, 2025
•
17.2k
•
32
guanning-ai/dapo14k
Viewer
•
Updated
Jun 11, 2025
•
14k
•
6
guanning-ai/mmlu-pro
Viewer
•
Updated
Jul 4, 2025
•
12k
•
5
guanning-ai/knowlogic-en
Viewer
•
Updated
Jul 10, 2025
•
2.4k
•
9
guanning-ai/bigmath
Viewer
•
Updated
Jul 27, 2025
•
251k
•
15
guanning-ai/COM2
Viewer
•
Updated
Aug 6, 2025
•
3.76k
•
5
guanning-ai/beyondaime
Viewer
•
Updated
Oct 21, 2025
•
100
•
44
guanning-ai/Polaris-53K
Viewer
•
Updated
Dec 11, 2025
•
53.3k
•
7
guanning-ai/openr1-93K
Viewer
•
Updated
Dec 11, 2025
•
93.7k
•
7
guanning-ai/gsm8k-mugglemath
Viewer
•
Updated
Dec 27, 2025
•
157k
•
5
guanning-ai/gsm8k-metamath
Viewer
•
Updated
Dec 30, 2025
•
160k
•
8
guanning-ai/gsm8k-mumath
Viewer
•
Updated
Dec 27, 2025
•
92k
•
5
guanning-ai/minervamath
Viewer
•
Updated
Jan 2
•
272
•
14
guanning-ai/gsm8k-platinum
Viewer
•
Updated
Jan 7
•
1.21k
•
8
guanning-ai/dapo17k_splited
Viewer
•
Updated
Mar 2
•
17.2k
•
17
guanning-ai/hmmt2025feb
Viewer
•
Updated
Mar 2
•
30
•
14
guanning-ai/sciknoweval_l3
Viewer
•
Updated
Mar 3
•
4.14k
•
34
guanning-ai/hmmt2025nov
Viewer
•
Updated
26 days ago
•
30
•
21
guanning-ai/jeebench-math
Viewer
•
Updated
26 days ago
•
236
•
20
Upvote
-
Share collection
View history
Collection guide
Browse collections