DCAgent2/dev_set_v2_nemosci_tasrep_a1mfc_dev1_maxeps_32b__Qwen3_32B_20260417_190127 Viewer • Updated Apr 18 • 292 • 6
DCAgent2/dev_set_v2_nemosci_tasrep_a1mfc_dev1_maxeps_swes_r2eg_32b__Qwen3_32B_20260417_185915 Viewer • Updated Apr 18 • 299 • 7
DCAgent2/swebench_verified_random_100_folders_nemosci_tasrep_a1mfc_dev1_maxeps_32b__Qwen62b42fb1 Viewer • Updated Apr 18 • 300 • 6
DCAgent2/gaia_127_g1_min_episodes_e1_gpt_long_tacc_20260417_200409 Viewer • Updated Apr 18 • 373 • 5.44k
DCAgent2/terminal_bench_2_nemosci_tasrep_a1mfc_dev1_maxeps_swes_r2eg__Qwen3_8B_20260417_172242 Viewer • Updated Apr 18 • 258 • 7
DCAgent2/aider_polyglot_g1_min_episodes_e1_gpt_long_tacc_20260416_185035-traces Viewer • Updated Apr 17 • 675 • 7
DCAgent2/bfcl_parity_g1_min_episodes_e1_gpt_long_tacc_20260417_200337-traces Viewer • Updated Apr 17 • 354 • 8
DCAgent2/bfcl_parity_g1_min_episodes_e1_gpt_long_thinking_tacc_Qwen3_32B_20260417_194404-traces Viewer • Updated Apr 17 • 354 • 11
DCAgent2/medagentbench_g1_min_episodes_e1_gpt_long_tacc_20260417_200353 Viewer • Updated Apr 17 • 897 • 6
DCAgent2/terminal_bench_2_nemosci_tasrep_a1mfc_gfistaqc_dev1_scaff_maxeps__Qwen3_8B_20268f2cfab0 Viewer • Updated Apr 17 • 255 • 6
DCAgent2/medagentbench_g1_min_episodes_e1_gpt_long_thinking_tacc_Qwen3_32B_20260417_194451 Viewer • Updated Apr 17 • 900 • 6
DCAgent2/dev_set_v2_nemosci_tasrep_a1mfc_dev1_maxeps_swes_r2eg__Qwen3_8B_20260417_172150 Viewer • Updated Apr 17 • 288 • 7
DCAgent2/swebench_verified_random_100_folders_nemosci_tasrep_a1mfc_dev1_maxeps_swes_r2egb3d507fc Viewer • Updated Apr 17 • 300 • 7
DCAgent2/swebench_verified_random_100_folders_nemosci_tasrep_a1mfc_dev1_maxeps__Qwen3_8Be67bee26 Viewer • Updated Apr 17 • 300 • 8
DCAgent2/dev_set_v2_nemosci_tasrep_a1mfc_dev1_maxeps__Qwen3_8B_20260417_172147 Viewer • Updated Apr 17 • 290 • 7
DCAgent2/dev_set_v2_nemosci_tasrep_a1mfc_gfistaqc_dev1_scaff_maxeps__Qwen3_8B_20260417_172153 Viewer • Updated Apr 17 • 288 • 7
DCAgent2/swebench_verified_random_100_folders_nemosci_tasrep_a1mfc_gfistaqc_dev1_scaff_m3d894285 Viewer • Updated Apr 17 • 300 • 7
DCAgent2/terminal_bench_2_g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc_Qwenae9f2946 Viewer • Updated Apr 17 • 263 • 6
DCAgent2/swebench_verified_random_100_folders_swesmith_glm5_awq_traces_10k_tacc_202604154a426610 Viewer • Updated Apr 17 • 124 • 5
DCAgent2/terminal_bench_2_g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc_Qwen2023418f Viewer • Updated Apr 17 • 262 • 6
DCAgent2/terminal_bench_2_g1_min_episodes_e1_gpt_long_sampled_swesmith_psu_thinking_tacce3882488 Viewer • Updated Apr 17 • 261 • 11
DCAgent2/terminal_bench_2_g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc_Qwenf01bb680 Viewer • Updated Apr 17 • 263 • 7
DCAgent2/swebench_verified_random_100_folders_g1_min_episodes_sampled_131k_20260416_221145 Viewer • Updated Apr 17 • 286 • 7
DCAgent2/terminal_bench_2_g1_min_episodes_e1_gpt_long_thinking_tacc_Qwen3_32B_20260416_233700 Viewer • Updated Apr 17 • 261 • 6
DCAgent2/dev_set_v2_g1_min_episodes_e1_gpt_long_sampled_swesmith_psu_thinking_tacc_Qwen3f140de7e Viewer • Updated Apr 17 • 296 • 15