Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Quantummed 's Collections
coding-benchmarking dataset

coding-benchmarking dataset

updated Oct 11, 2025

data-sets for benchmarking LLM for software devt

Upvote
-

  • livebench/liveswebench

    Viewer • Updated Mar 31, 2025 • 53 • 18 • 1

  • livebench/liveswebench-patches

    Viewer • Updated Mar 31, 2025 • 1 • 51

  • livebench/reasoning

    Viewer • Updated Apr 7, 2025 • 200 • 4.49k • 15

  • livebench/data_analysis

    Viewer • Updated Apr 7, 2025 • 150 • 3.75k • 5

  • livebench/coding

    Viewer • Updated Apr 7, 2025 • 128 • 4.5k • 7

  • livebench/instruction_following

    Viewer • Updated Apr 7, 2025 • 400 • 3.51k • 4

  • livebench/math

    Viewer • Updated Apr 7, 2025 • 368 • 4.54k • 1

  • livebench/language

    Viewer • Updated Apr 7, 2025 • 190 • 3.34k

  • livebench/model_judgment

    Viewer • Updated Apr 7, 2025 • 60.4k • 187 • 1

  • livebench/model_answer

    Viewer • Updated Oct 22, 2024 • 93.7k • 1.81k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs