coding-benchmarking dataset data-sets for benchmarking LLM for software devt livebench/liveswebench Viewer • Updated Mar 31, 2025 • 53 • 378 • 1 livebench/liveswebench-patches Viewer • Updated Mar 31, 2025 • 1 • 70 livebench/reasoning Viewer • Updated Apr 7, 2025 • 200 • 4.69k • 18 livebench/data_analysis Viewer • Updated Apr 7, 2025 • 150 • 3k • 6
coding-benchmarking dataset data-sets for benchmarking LLM for software devt livebench/liveswebench Viewer • Updated Mar 31, 2025 • 53 • 378 • 1 livebench/liveswebench-patches Viewer • Updated Mar 31, 2025 • 1 • 70 livebench/reasoning Viewer • Updated Apr 7, 2025 • 200 • 4.69k • 18 livebench/data_analysis Viewer • Updated Apr 7, 2025 • 150 • 3k • 6