Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Evaluation datasets
community
Activity Feed
Follow
74
AI & ML interests
None defined yet.
Recent Activity
alozowski
authored
a paper
3 days ago
YourBench: Easy Custom Evaluation Sets for Everyone
SaylorTwift
new
activity
10 days ago
OpenEvals/SimpleQA:
adds_eval_yaml
SaylorTwift
updated
a dataset
10 days ago
OpenEvals/SimpleQA
View all activity
Team members
8
lighteval
's datasets
192
Sort: Recently updated
lighteval/RULER-4096-Falcon-H1-3B-Base
Viewer
•
Updated
Jun 18
•
6.5k
•
21
lighteval/RULER-4096-Lamma3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
37
lighteval/RULER-4096-Qwen2.5-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
30
lighteval/RULER-8192-Falcon-H1-3B-Base
Viewer
•
Updated
Jun 18
•
6.5k
•
73
lighteval/RULER-8192-Lamma3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
31
lighteval/RULER-8192-Qwen2.5-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
191
lighteval/RULER-8192-Qwen-3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
55
lighteval/RULER-8192-Qwen-3
Viewer
•
Updated
Jun 18
•
6.5k
•
45
lighteval/RULER-16384-Falcon-H1-3B-Base
Viewer
•
Updated
Jun 18
•
6.5k
•
80
lighteval/RULER-16384-Lamma3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
65
lighteval/RULER-16384-Qwen2.5-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
25
lighteval/RULER-16384-Qwen-3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
31
lighteval/RULER-16384-Qwen-3
Viewer
•
Updated
Jun 18
•
6.5k
•
41
lighteval/RULER-32768-Falcon-H1-3B-Base
Viewer
•
Updated
Jun 18
•
6.5k
•
42
lighteval/RULER-32768-Lamma3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
60
lighteval/RULER-32768-Qwen2.5-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
89
lighteval/RULER-32768-Qwen-3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
43
•
2
lighteval/RULER-32768-Qwen-3
Viewer
•
Updated
Jun 18
•
6.5k
•
125
lighteval/RULER-65536-Falcon-H1-3B-Base
Viewer
•
Updated
Jun 18
•
6.5k
•
263
lighteval/RULER-65536-Lamma3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
101
lighteval/RULER-65536-Qwen2.5-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
90
lighteval/RULER-65536-Qwen-3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
28
lighteval/RULER-65536-Qwen-3
Viewer
•
Updated
Jun 18
•
6.5k
•
56
lighteval/RULER-131072-Falcon-H1-3B-Base
Viewer
•
Updated
Jun 18
•
6.5k
•
47
lighteval/RULER-131072-Lamma3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
71
lighteval/RULER-131072-Qwen2.5-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
105
lighteval/RULER-131072-Qwen-3-Instruct
Viewer
•
Updated
Jun 18
•
6.5k
•
44
lighteval/RULER-131072-Qwen-3
Viewer
•
Updated
Jun 18
•
6.5k
•
49
lighteval/okapi_mmlu
Viewer
•
Updated
Mar 24
•
443k
•
228
•
1
lighteval/okapi_arc_challenge
Viewer
•
Updated
Mar 24
•
79.6k
•
152
•
1
Previous
1
...
3
4
5
6
7
Next