Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
hud
Team
company
https://www.hud.so
hud_evals
hud-evals
Activity Feed
Follow
8
AI & ML interests
AI, Evaluations, RL
Team members
5
hud-evals
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Articles
lorenss
updated
a dataset
6 months ago
hud-evals/SheetBench-50
Viewer
•
Updated
Dec 3, 2025
•
50
•
149
jdchawla29
updated
a dataset
6 months ago
hud-evals/SheetBench-50
Viewer
•
Updated
Dec 3, 2025
•
50
•
149
lorenss
updated
8 datasets
6 months ago
hud-evals/SpreadSheetBench-200
Viewer
•
Updated
Nov 23, 2025
•
200
•
85
hud-evals/SpreadSheetBench
Viewer
•
Updated
Nov 23, 2025
•
912
•
202
hud-evals/OSWorld-Gold-Mini
Viewer
•
Updated
Nov 18, 2025
•
20
•
35
hud-evals/2048-basic
Viewer
•
Updated
Nov 18, 2025
•
1
•
47
hud-evals/OSWorld-Verified
Viewer
•
Updated
Nov 18, 2025
•
369
•
113
hud-evals/Online-Mind2Web-Tiny
Viewer
•
Updated
Nov 18, 2025
•
10
•
20
hud-evals/Online-Mind2Web
Viewer
•
Updated
Nov 18, 2025
•
300
•
38
hud-evals/OSWorld-Gold
Viewer
•
Updated
Nov 18, 2025
•
294
•
127
parth220
published
2 datasets
6 months ago
hud-evals/SpreadSheetBench-200
Viewer
•
Updated
Nov 23, 2025
•
200
•
85
hud-evals/SpreadSheetBench
Viewer
•
Updated
Nov 23, 2025
•
912
•
202
lorenss
published
2 datasets
7 months ago
hud-evals/Online-Mind2Web-Tiny
Viewer
•
Updated
Nov 18, 2025
•
10
•
20
hud-evals/Online-Mind2Web
Viewer
•
Updated
Nov 18, 2025
•
300
•
38
lorenss
updated
a dataset
8 months ago
hud-evals/SheetBench-50_db_test
Viewer
•
Updated
Oct 1, 2025
•
50
•
19
lorenss
published
a dataset
8 months ago
hud-evals/SheetBench-50_db_test
Viewer
•
Updated
Oct 1, 2025
•
50
•
19
lorenss
updated
a dataset
8 months ago
hud-evals/OSWorld-Gold_db_test
Viewer
•
Updated
Sep 30, 2025
•
294
•
17
lorenss
published
a dataset
8 months ago
hud-evals/OSWorld-Gold_db_test
Viewer
•
Updated
Sep 30, 2025
•
294
•
17
lorenss
updated
a dataset
8 months ago
hud-evals/OSWorld-Verified-XLang_db_test
Viewer
•
Updated
Sep 30, 2025
•
369
•
34
lorenss
published
a dataset
8 months ago
hud-evals/OSWorld-Verified-XLang_db_test
Viewer
•
Updated
Sep 30, 2025
•
369
•
34
Load more