mlfoundations-dev/DCAgent_dev_set_71_tasks_penfever_nl2bash-2ep_20251116_111730 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent_dev_set_71_tasks_DCAgent_freelancer-askllm-filtered-sandboxese4c04c37 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent_dev_set_71_tasks_mlfoundations-dev_nemo-prism-math-sandboxes-t1e3bfba7 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent_dev_set_71_tasks_mlfoundations-dev_inferredbugs-sandboxes-trac74e35da5 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent_dev_set_71_tasks_Qwen_Qwen3-4B-Thinking-2507_20251116_110617 Updated Nov 16, 2025 • 6
mlfoundations-dev/DCAgent_dev_set_71_tasks_penfever_GLM-4_6-codeforces-32ep-32k_20251116_111753 Updated Nov 16, 2025 • 8
mlfoundations-dev/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-termin8cb606b0 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-termina27f1ba8 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent_dev_set_71_tasks_DCAgent_code_contests_10k_OG_10k_New_Questiond3d5abd3 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent_dev_set_71_tasks_DCAgent_codeforces-gptoss120b-traces_20251116_114126 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent_dev_set_71_tasks_Qwen_Qwen3-Coder-30B-A3B-Instruct_20251116_110612 Updated Nov 16, 2025 • 8
mlfoundations-dev/DCAgent2_terminal_bench_2_penfever_GLM-4_6-codeforces-32ep-32k_20251115_174617 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent2_terminal_bench_2_penfever_nl2bash-2ep_20251115_174617 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent2_terminal_bench_2_Qwen_Qwen3-Coder-30B-A3B-Instruct_20251115_174607 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent2_terminal_bench_2_mlfoundations-dev_stackexchange-overflow-sanea09bd74 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent2_terminal_bench_2_penfever_nl2bash-verified-GLM-4_6-traces-32ea42eb0d3 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent2_terminal_bench_2_penfever_nl2bash-4ep_20251115_195753 Updated Nov 16, 2025 • 6
mlfoundations-dev/DCAgent2_terminal_bench_2_Qwen_Qwen3-4B-Thinking-2507_20251115_174607 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent2_terminal_bench_2_DCAgent_freelancer-long-instruction-filter_Qe7982902 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent2_terminal_bench_2_DCAgent_freelancer-askllm-filtered-sandboxesca505876 Updated Nov 16, 2025 • 7
mlfoundations-dev/DCAgent2_terminal_bench_2_mlfoundations-dev_inferredbugs-sandboxes-trabeb72207 Updated Nov 16, 2025 • 9
mlfoundations-dev/DCAgent2_terminal_bench_2_DCAgent_staqc-sandboxes-traces-terminus-2_Qw83cf348b Updated Nov 16, 2025 • 5
mlfoundations-dev/DCAgent2_terminal_bench_2_DCAgent_code-contests-sandboxes-traces-termif5b86b17 Updated Nov 16, 2025 • 8