DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n6e321d01 Viewer • Updated 7 days ago • 267 • 7
DCAgent2/swebench_verified_random_100_folders_rl_rl_conf_24GP_base_yaml_mode_path_r2eg_n6d1a5547 Viewer • Updated 7 days ago • 300 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nc17da311 Viewer • Updated 7 days ago • 267 • 8
DCAgent2/terminal_bench_2_glm46_swegym_tasks_maxeps_131k_lc_20260308_165252 Viewer • Updated 7 days ago • 267 • 6
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n095db287 Viewer • Updated 7 days ago • 267 • 8
DCAgent2/eval__openthoughts-tb-dev__r2egym-nl2bash-stack__lambda-traces Viewer • Updated 7 days ago • 5.63k • 8
DCAgent2/terminal_bench_2_bs64_rloo_n_noct_stri_micr_auto_conv_pref_model_r2e_120_20260323746588 Viewer • Updated 7 days ago • 267 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n9f4e8694 Viewer • Updated 7 days ago • 267 • 7
DCAgent2/swebench_verified_random_100_folders_glm46_Toolscale_tasks_traces_20260308_172629 Viewer • Updated 7 days ago • 300 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_ncc4c8a29 Viewer • Updated 7 days ago • 267 • 7
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_396093dc5 Viewer • Updated 7 days ago • 300 • 7
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nd79f15ae Viewer • Updated 7 days ago • 267 • 6
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n155cf0b4 Viewer • Updated 7 days ago • 267 • 7
DCAgent2/terminal_bench_2_r2egym_nl2bash_stack_bugsseq_fixthink_stack_csharp_20260308_125216 Viewer • Updated 7 days ago • 266 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nb44ca51d Viewer • Updated 7 days ago • 267 • 7
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nff1e5748 Viewer • Updated 8 days ago • 267 • 7
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_37c07c149 Viewer • Updated 8 days ago • 300 • 7
DCAgent2/swebench_verified_random_100_folders_rl_r2egym_nl2bash_stack_bugsseq_fixthink_le5a7d45d Viewer • Updated 8 days ago • 300 • 6
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nda23bbc4 Viewer • Updated 8 days ago • 267 • 8
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_2311d395 Viewer • Updated 8 days ago • 300 • 7
DCAgent2/GLM-4.7-swe_rebench-sandboxes-maxeps-131k-jup_chunk4 Viewer • Updated 8 days ago • 1.24k • 8
DCAgent2/GLM-4.7-swe_rebench-sandboxes-maxeps-131k-jup_chunk0 Viewer • Updated 8 days ago • 1.24k • 10
DCAgent2/GLM-4.7-swe_rebench-sandboxes-maxeps-131k-jup_chunk3 Viewer • Updated 8 days ago • 1.24k • 6
DCAgent2/GLM-4.7-swe_rebench-sandboxes-maxeps-131k-jup_chunk2 Viewer • Updated 8 days ago • 1.24k • 10
DCAgent2/GLM-4.7-swe_rebench-sandboxes-maxeps-131k-jup_chunk1 Viewer • Updated 8 days ago • 1.24k • 8
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3d52e76a1 Viewer • Updated 8 days ago • 300 • 8
DCAgent2/swebench_verified_random_100_folders_rl_rl_conf_24GP_base_yaml_mode_path_r2eg_n30e029c8 Viewer • Updated 8 days ago • 300 • 8
DCAgent2/swebench_verified_random_100_folders_rl_swesmith_fixthink_pymethods2test_45_202b0c44f46 Viewer • Updated 8 days ago • 300 • 8