DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3784fcf03 Viewer • Updated 18 days ago • 300 • 10
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nc1f36548 Viewer • Updated 18 days ago • 267 • 13
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwenb614d222 Viewer • Updated 18 days ago • 300 • 14
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_5ec1a2bb Viewer • Updated 18 days ago • 300 • 12
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nf9a08746 Viewer • Updated 18 days ago • 267 • 9
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_36c670a3c Viewer • Updated 18 days ago • 300 • 13
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_37b767779 Viewer • Updated 18 days ago • 300 • 12
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n75ef2aa7 Viewer • Updated 18 days ago • 267 • 12
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwen373bb11f Viewer • Updated 18 days ago • 300 • 10
DCAgent2/swebench_verified_random_100_folders_GLM_4_7_stackexchange_tezos_sandboxes_maxeb63e8f41 Viewer • Updated 18 days ago • 300 • 13
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n8d0a982d Viewer • Updated 18 days ago • 267 • 11
DCAgent2/swebench_verified_random_100_folders_tbench_dev_71_nl2bash_bugsseq_Qwen3_8B_8noea505880 Viewer • Updated 18 days ago • 300 • 8
DCAgent2/DCAgent2_aider_polyglot_DCAgent2_nl2bash-stack-bugsseq Viewer • Updated 18 days ago • 675 • 15
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwen87ac5385 Viewer • Updated 18 days ago • 300 • 10
DCAgent2/DCAgent_dev_set_v2_laion_glm-4_6-dclm-baseline-terminal-traces-32ep-131k Viewer • Updated 18 days ago • 296 • 14
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_4e9207c7 Viewer • Updated 18 days ago • 300 • 17
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n39450f89 Viewer • Updated 18 days ago • 267 • 14
DCAgent2/DCAgent_dev_set_v2_laion_exp_tas_frequency_penalty_0_25_traces Viewer • Updated 18 days ago • 147 • 19
DCAgent2/DCAgent_dev_set_v2_laion_exp_tas_summarize_threshold_2048_traces Viewer • Updated 18 days ago • 160 • 17
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_4dfa69bf Viewer • Updated 18 days ago • 300 • 14
DCAgent2/DCAgent_dev_set_v2_laion_Qwen3-8B_perturbed-docker-exp-taskmaster2-tasks_glm_43cedcdf9 Viewer • Updated 18 days ago • 167 • 17
DCAgent2/DCAgent_dev_set_v2_laion_dev_set_part1_10k_glm_4_7_traces_locetash_tm4x_20260304_145314 Viewer • Updated 18 days ago • 173 • 18
DCAgent2/DCAgent_dev_set_v2_laion_glm46-glaive-code-assistant-sandboxes-maxeps-131k Viewer • Updated 18 days ago • 172 • 16
DCAgent2/DCAgent_dev_set_v2_laion_r2egym-nl2bash-stack-bugsseq-fixthink-stack-pytest-large Viewer • Updated 18 days ago • 164 • 16