DCAgent2/DCAgent2_terminal_bench_2_laion_dev_set_part1_10k_glm_4_7_traces_locetash_tm4x_c7ecbc26 Viewer • Updated 13 days ago • 229 • 9
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_38fc13ff0 Viewer • Updated 13 days ago • 300 • 9
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3bc5c959d Viewer • Updated 13 days ago • 300 • 8
DCAgent2/DCAgent2_terminal_bench_2_laion_exp-gfi-staqc-embedding-mean-filtered-10K_glm_403c1ab04 Viewer • Updated 13 days ago • 267 • 9
DCAgent2/DCAgent2_terminal_bench_2_laion_dev_set_part1_10k_glm_4_7_traces_locetash_tm2x_464fbcd6 Viewer • Updated 13 days ago • 230 • 9
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n06686347 Viewer • Updated 13 days ago • 267 • 9
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n2b795cf5 Viewer • Updated 13 days ago • 267 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n84a9ea79 Viewer • Updated 13 days ago • 267 • 11
DCAgent2/swebench_verified_random_100_folders_exp_syh_r2egym_swesmith_mixed_glm_4_7_trac5e5e90ca Viewer • Updated 13 days ago • 300 • 7
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n628c4122 Viewer • Updated 13 days ago • 267 • 9
DCAgent2/DCAgent_dev_set_v2_laion_sft_GLM-4-7-swesmith-sandboxes-with_tests-oracle_verif72e0cbfb Viewer • Updated 13 days ago • 169 • 6
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_na21fa9e8 Viewer • Updated 13 days ago • 267 • 9
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nda6f89c2 Viewer • Updated 13 days ago • 267 • 9
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n24d847a9 Viewer • Updated 13 days ago • 267 • 9
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_39d1b8287 Viewer • Updated 13 days ago • 300 • 9
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_na7d55e8d Viewer • Updated 13 days ago • 267 • 9
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3c72c5d6e Viewer • Updated 13 days ago • 300 • 11
DCAgent2/DCAgent2_terminal_bench_2_laion_sft_GLM-4-7-swesmith-sandboxes-with_tests-oracl3c119c65 Viewer • Updated 13 days ago • 229 • 9
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n79a04a62 Viewer • Updated 13 days ago • 267 • 11
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nb2976578 Viewer • Updated 13 days ago • 267 • 11
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n334397c3 Viewer • Updated 13 days ago • 267 • 9
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_30958b6f3 Viewer • Updated 13 days ago • 300 • 7
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n33157e73 Viewer • Updated 13 days ago • 267 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n74a633ed Viewer • Updated 13 days ago • 267 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n315b42e5 Viewer • Updated 13 days ago • 267 • 8
DCAgent2/medagentbench_laion_exp_tas_timeout_multiplier_1_0_traces Viewer • Updated 13 days ago • 894 • 12
DCAgent2/medagentbench_DCAgent_exp_tas_max_episodes_32_traces Viewer • Updated 13 days ago • 896 • 12
DCAgent2/medagentbench_laion_perturbed-docker-exp-freelancer-tasks_glm_4_7_traces Viewer • Updated 13 days ago • 893 • 10
DCAgent2/medagentbench_laion_GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasonin0049b2df Viewer • Updated 13 days ago • 898 • 12