DCAgent2/DCAgent_dev_set_v2_laion_exp_tas_top_p_0_9_traces_20260302_123249 Viewer • Updated Mar 2 • 246 • 20
DCAgent2/DCAgent_dev_set_v2_laion_GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-rea3bf0e39d Viewer • Updated Mar 2 • 263 • 18
DCAgent2/DCAgent_dev_set_v2_laion_GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-rea7d0e66f0 Viewer • Updated Mar 2 • 243 • 19
DCAgent2/DCAgent_dev_set_v2_laion_GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reaac3919c1 Viewer • Updated Mar 2 • 254 • 18
DCAgent2/dev_set_71_tasks_r2egym_nl2bash_stack_bugsseq_fixthink_stack_csharp_20260301_160104 Viewer • Updated Mar 2 • 210 • 17
DCAgent2/dev_set_71_tasks_r2egym_nl2bash_stack_bugsseq_fixthink_methods2test_v2_20260301_160105 Viewer • Updated Mar 2 • 210 • 17
DCAgent2/dev_set_71_tasks_r2egym_nl2bash_stack_bugsseq_fixthink_again_20260301_160100 Viewer • Updated Mar 2 • 210 • 22
DCAgent2/dev_set_71_tasks_r2egym_nl2bash_stack_bugsseq_pytest_v2_20260301_145952 Viewer • Updated Mar 2 • 210 • 17
DCAgent2/dev_set_71_tasks_r2egym_nl2bash_stack_bugsseq_fixthink_stack_pytest_large_202609fb421cd Viewer • Updated Mar 2 • 210 • 16
DCAgent2/terminal_bench_2_dev_set_part1_10k_glm_4_7_traces_jupiter_cleaned_20260301_164806 Viewer • Updated Mar 2 • 263 • 18
DCAgent2/dev_set_71_tasks_r2egym_nl2bash_stack_bugsseq_junit_20260301_145958 Viewer • Updated Mar 2 • 210 • 19
DCAgent2/terminal_bench_2_r2egym_nl2bash_stack_bugsseq_pytest_v2_20260301_164726 Viewer • Updated Mar 2 • 267 • 18
DCAgent2/dev_set_v2_GLM_4_7_r2egym_sandboxes_maxeps_131k_lc_20260301_221049 Viewer • Updated Mar 2 • 294 • 18
DCAgent2/DCAgent_dev_set_v2_laion_exp-gfi-staqc-askllm-filtered-10K_glm_4_7_traces_jupit457cae71 Viewer • Updated Mar 2 • 173 • 20
DCAgent2/DCAgent_dev_set_v2_laion_glm46-stackexchange-tezos-maxeps-131k_20260302_074412 Viewer • Updated Mar 2 • 178 • 18
DCAgent2/terminal_bench_2_sft_GLM_4_7_swesmith_sandboxes_with_tests_oracle_verified_120sf9369deb Viewer • Updated Mar 2 • 263 • 17
DCAgent2/DCAgent_dev_set_v2_laion_stackexchange-tezos-sandboxes_glm_4_7_traces_locetash_643bedc5 Viewer • Updated Mar 2 • 183 • 18
DCAgent2/DCAgent_dev_set_v2_laion_glm46-neulab-agenttuning-alfworld-sandboxes-maxeps-131aaba1500 Viewer • Updated Mar 2 • 188 • 19
DCAgent2/DCAgent_dev_set_v2_laion_stackexchange-tezos-sandboxes_glm_4_6_traces_together_6dd97bd0 Viewer • Updated Mar 2 • 182 • 17
DCAgent2/DCAgent_dev_set_v2_DCAgent_nl2bash-nl2bash-bugsseq_Qwen3-8B-maxEps32-accThink-d384c9f4c Viewer • Updated Mar 2 • 258 • 16
DCAgent2/DCAgent_dev_set_v2_bespokelabs_Qwen3-8B-ot_step100_20260302_074401 Viewer • Updated Mar 2 • 181 • 17
DCAgent2/terminal_bench_2_r2egym_nl2bash_stack_bugsseq_bash_withtests_20260301_164734 Viewer • Updated Mar 2 • 267 • 15
DCAgent2/DCAgent_dev_set_v2_DCAgent_nl2bash-nl2bash-bugsseq_Qwen3-8B-maxEps24-112925harb137485eb Viewer • Updated Mar 2 • 279 • 14
DCAgent2/DCAgent_dev_set_v2_SWE-bench_SWE-agent-LM-7B_20260302_074346 Viewer • Updated Mar 2 • 220 • 17
DCAgent2/DCAgent_dev_set_v2_DCAgent_nl2bashG5CP-nl2bash-bs_Q3-8B-mE32-aT-dS-120325hbr_stf72b1266 Viewer • Updated Mar 2 • 247 • 17
DCAgent2/DCAgent_dev_set_v2_laion_exp_tas_linear_history_off_traces_20260302_074414 Viewer • Updated Mar 2 • 180 • 13
DCAgent2/DCAgent_dev_set_v2_laion_GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-read27ff8c2 Viewer • Updated Mar 2 • 289 • 12
DCAgent2/DCAgent_dev_set_v2_DCAgent_nl2bash-nl2bash-bugsseq_Qwen3-8B-maxEps32-accThink-d47592b14 Viewer • Updated Mar 2 • 281 • 10
DCAgent2/DCAgent_dev_set_v2_DCAgent_nl2bash-nl2bash-bugsseq_Qwen3-8B-maxEps32-accThink-df86dffc4 Viewer • Updated Mar 2 • 279 • 12
DCAgent2/DCAgent_dev_set_v2_DCAgent_nl2bash-nl2bash-bugsseq_Qwen3-8B-maxEps32-accThink-d5361b3d4 Viewer • Updated Mar 2 • 285 • 9