Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
robbyrob42
's Collections
datasets
LLM Evals
Science LLM/ML Tools
LLM Evals
updated
5 days ago
Upvote
-
cais/mmlu
Viewer
•
Updated
Mar 8, 2024
•
231k
•
542k
•
761
ZhuofengLi/web-bench
Viewer
•
Updated
Jan 19
•
3.94k
•
396
OpenResearcher/web-bench
Viewer
•
Updated
16 days ago
•
5.5k
•
3.75k
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections