SWE-bench-Live The datasets for benchmarking and training of LLM coding agents. SWE-bench-Live/SWE-bench-Live Viewer • Updated Sep 18, 2025 • 3.69k • 7.75k • 7 SWE-bench-Live/MultiLang Viewer • Updated May 16 • 743 • 945 SWE-bench-Live/Windows Viewer • Updated May 14 • 61 • 65
SWE-bench-Live The datasets for benchmarking and training of LLM coding agents. SWE-bench-Live/SWE-bench-Live Viewer • Updated Sep 18, 2025 • 3.69k • 7.75k • 7 SWE-bench-Live/MultiLang Viewer • Updated May 16 • 743 • 945 SWE-bench-Live/Windows Viewer • Updated May 14 • 61 • 65