DCAgent/DCAgent_dev_set_71_tasks_mlfoundations-dev_freelancer-projects-sandboxes-tracesf3b86114
Viewer
• Updated • 9 • 7
DCAgent/Qwen3-8B-dev-71-tasks
Viewer
• Updated • 69 • 5
DCAgent/dev-set-71-tasks-fixed-nov-7
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_staqc-sandboxes-traces-terminus-2_Qwen3-8B-Base41f41393
Viewer
• Updated • 137k • 12
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_staqc-sandboxes-traces-terminus-2_Qwen3-4B-Thine5e4e3b3
Viewer
• Updated • 18.3k • 5
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_cutofe0851f37
Viewer
• Updated • 29.2k • 7
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_staqc-sandboxes-traces-terminus-2_Qwen3-1-7B_20452f046c
Viewer
• Updated • 28k • 8
DCAgent/bench-traces-480B_g16_pp4_tp1_r4_ep8_c32
Viewer
• Updated • 491 • 4
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_warmuff558f69
Viewer
• Updated • 17.6k • 4
DCAgent/DCAgent_dev_set_71_tasks_mlfoundations-dev_stackexchange-overflow-sandboxes-trac6871b135
Viewer
• Updated • 58.4k • 4
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_adam73c415ce
Viewer
• Updated • 18.1k • 4
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_weighbfdc2d38
Viewer
• Updated • 17.7k • 9
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_staqc-sandboxes-traces-terminus-2_Qwen3-4B-Instf3fddc60
Viewer
• Updated • 13.2k • 4
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-long-instruction-filter_Qwen3-8B_20255e8f414
Viewer
• Updated • 132k • 13
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_adam72dc9ad8
Viewer
• Updated • 17.9k • 5
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_freelancer-askllm-filtered-sandboxes-traces-ter33ebca93
Viewer
• Updated • 133k • 27
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_cutof5406fdd2
Viewer
• Updated • 15.2k • 8
DCAgent/DCAgent_dev_set_71_tasks_mlfoundations-dev_inferredbugs-sandboxes-traces-terminu6c3c9fcf
Viewer
• Updated • 70.1k • 6
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_adameb7fae7e
Viewer
• Updated • 17.8k • 7
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code_contests_10k_OG_10k_New_Questions_GPT5-min02042521
Viewer
• Updated • 18.5k • 6
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_adam24f1dfad
Viewer
• Updated • 16.5k • 6
DCAgent/DCAgent_dev_set_71_tasks_mlfoundations-dev_nemo-prism-math-sandboxes-traces-term0bdc8136
Viewer
• Updated • 15.7k • 7
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_weighdbca32d7
Viewer
• Updated • 17.6k • 6
DCAgent/bench-traces-480B_g8_pp4_tp1_r2_ep8_c32
Viewer
• Updated • 483 • 6
DCAgent/DCAgent_dev_set_71_tasks_DCAgent_code-contests-sandboxes-traces-terminus-2_adamc4418e27
Viewer
• Updated • 17.4k • 4
DCAgent/bench-traces-480B_g4_pp4_tp1_r1_ep8_c32
Viewer
• Updated • 453 • 5
DCAgent/llm-verifier-freelancer
Viewer
• Updated • 10k • 9
DCAgent/llm-verifier-clean-sandboxes-eval-set
Viewer
• Updated • 90 • 4
DCAgent/bench-traces-30B_g16_pp4_tp1_r4_ep8_c32
Viewer
• Updated • 496 • 4
DCAgent/DCAgent_dev_set_71_tasks_Qwen_Qwen3-14B_20251107_135831
Viewer
• Updated • 5.77k • 2