Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Quantummed 's Collections
coding-benchmarking dataset

coding-benchmarking dataset

updated Oct 11, 2025

data-sets for benchmarking LLM for software devt

Upvote
-

  • livebench/liveswebench

    Viewer • Updated Mar 31, 2025 • 53 • 378 • 1

  • livebench/liveswebench-patches

    Viewer • Updated Mar 31, 2025 • 1 • 70

  • livebench/reasoning

    Viewer • Updated Apr 7, 2025 • 200 • 4.69k • 18

  • livebench/data_analysis

    Viewer • Updated Apr 7, 2025 • 150 • 3k • 6

  • livebench/coding

    Viewer • Updated Apr 7, 2025 • 128 • 10.2k • 9

  • livebench/instruction_following

    Viewer • Updated Apr 7, 2025 • 400 • 3.25k • 5

  • livebench/math

    Viewer • Updated Apr 7, 2025 • 368 • 5.23k • 1

  • livebench/language

    Viewer • Updated Apr 7, 2025 • 190 • 2.88k

  • livebench/model_judgment

    Viewer • Updated Apr 7, 2025 • 60.4k • 602 • 1

  • livebench/model_answer

    Viewer • Updated Oct 22, 2024 • 93.7k • 118
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs