Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Quantummed
's Collections
coding-benchmarking dataset
coding-benchmarking dataset
updated
Oct 11, 2025
data-sets for benchmarking LLM for software devt
Upvote
-
livebench/liveswebench
Viewer
•
Updated
Mar 31, 2025
•
53
•
378
•
1
livebench/liveswebench-patches
Viewer
•
Updated
Mar 31, 2025
•
1
•
70
livebench/reasoning
Viewer
•
Updated
Apr 7, 2025
•
200
•
4.69k
•
18
livebench/data_analysis
Viewer
•
Updated
Apr 7, 2025
•
150
•
3k
•
6
livebench/coding
Viewer
•
Updated
Apr 7, 2025
•
128
•
10.2k
•
9
livebench/instruction_following
Viewer
•
Updated
Apr 7, 2025
•
400
•
3.25k
•
5
livebench/math
Viewer
•
Updated
Apr 7, 2025
•
368
•
5.23k
•
1
livebench/language
Viewer
•
Updated
Apr 7, 2025
•
190
•
2.88k
livebench/model_judgment
Viewer
•
Updated
Apr 7, 2025
•
60.4k
•
602
•
1
livebench/model_answer
Viewer
•
Updated
Oct 22, 2024
•
93.7k
•
118
Upvote
-
Share collection
View history
Collection guide
Browse collections