Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Quantummed
's Collections
coding-benchmarking dataset
coding-benchmarking dataset
updated
Oct 11, 2025
data-sets for benchmarking LLM for software devt
Upvote
-
livebench/liveswebench
Viewer
•
Updated
Mar 31, 2025
•
53
•
18
•
1
livebench/liveswebench-patches
Viewer
•
Updated
Mar 31, 2025
•
1
•
51
livebench/reasoning
Viewer
•
Updated
Apr 7, 2025
•
200
•
4.49k
•
15
livebench/data_analysis
Viewer
•
Updated
Apr 7, 2025
•
150
•
3.75k
•
5
livebench/coding
Viewer
•
Updated
Apr 7, 2025
•
128
•
4.5k
•
7
livebench/instruction_following
Viewer
•
Updated
Apr 7, 2025
•
400
•
3.51k
•
4
livebench/math
Viewer
•
Updated
Apr 7, 2025
•
368
•
4.54k
•
1
livebench/language
Viewer
•
Updated
Apr 7, 2025
•
190
•
3.34k
livebench/model_judgment
Viewer
•
Updated
Apr 7, 2025
•
60.4k
•
187
•
1
livebench/model_answer
Viewer
•
Updated
Oct 22, 2024
•
93.7k
•
1.81k
Upvote
-
Share collection
View history
Collection guide
Browse collections