DCAgent2/swebench_verified_random_100_folders_bs64_rloo_n_noct_stri_micr_model_noconv_r2b6ea6ba7 Viewer • Updated 4 days ago • 300 • 9
DCAgent2/swebench_verified_random_100_folders_bs64_rloo_n_noct_stri_micr_model_r2eg_nl2_4a47d21a Viewer • Updated 4 days ago • 300 • 9
DCAgent2/swebench_verified_random_100_folders_rl_rl_config_24GPU_base_yaml_model_path_Qwc87a1a63 Viewer • Updated 4 days ago • 300 • 9
DCAgent2/swebench_verified_random_100_folders_rl_r2egym_nl2bash_stack_bugsseq_fixthink_a563fb95d Viewer • Updated 4 days ago • 300 • 11
DCAgent2/swebench_verified_random_100_folders_bs64_rloo_n_noct_stri_micr_auto_conv_pref_a2c2566a Viewer • Updated 4 days ago • 300 • 11
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_webshop_sandboxes_maxeps_32k_2026030c14e75 Viewer • Updated 4 days ago • 267 • 9
DCAgent2/terminal_bench_2_dev_set_part1_10k_glm_4_7_traces_jupiter_cleaned_20260310_165959 Viewer • Updated 4 days ago • 264 • 9
DCAgent2/swebench_verified_random_100_folders_bs64_rloo_n_noct_stri_micr_auto_tis_model_b3c926d0 Viewer • Updated 4 days ago • 300 • 9
DCAgent2/DCAgent_dev_set_v2_laion_GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reabfaeb4c0 Viewer • Updated 4 days ago • 327 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_ne790163c Viewer • Updated 4 days ago • 267 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_nc113e380 Viewer • Updated 4 days ago • 267 • 8
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n1ea92711 Viewer • Updated 4 days ago • 267 • 8
DCAgent2/terminal_bench_2_Kimi_K2T_ling_coder_sft_sandboxes_1_maxeps_32k_20260310_170002 Viewer • Updated 4 days ago • 267 • 7
DCAgent2/terminal_bench_2_Qwen3_8B_exp_tas_summarize_threshold_4096_traces_save_strategy6757ed2b Viewer • Updated 4 days ago • 267 • 5
DCAgent2/DCAgent2_terminal_bench_2_laion_exp_tas_optimal_combined_traces_tm8x_20260309_155137 Viewer • Updated 4 days ago • 231 • 7
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n748607fc Viewer • Updated 4 days ago • 267 • 7
DCAgent2/DCAgent2_terminal_bench_2_laion_exp_tas_timeout_multiplier_1_0_traces_tm4x_20265b3b0c27 Viewer • Updated 4 days ago • 231 • 5
DCAgent2/terminal_bench_2_glm46_Toolscale_tasks_traces_20260310_125924 Viewer • Updated 4 days ago • 267 • 7
DCAgent2/DCAgent_dev_set_v2_laion_exp_tas_timeout_multiplier_1_0_traces_tm8x_20260309_155136 Viewer • Updated 4 days ago • 174 • 6
DCAgent2/terminal_bench_2_GLM_4_7_stackexchange_tezos_sandboxes_maxeps_131k_20260310_053513 Viewer • Updated 4 days ago • 266 • 8
DCAgent2/terminal_bench_2_exp_swd_r2egym_wo_docker_glm_4_7_traces_20260310_125925 Viewer • Updated 4 days ago • 267 • 8
DCAgent2/DCAgent2_terminal_bench_2_laion_exp_tas_timeout_multiplier_1_0_traces_tm8x_20268d9e647c Viewer • Updated 4 days ago • 231 • 7
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n1ba2235d Viewer • Updated 4 days ago • 267 • 7
DCAgent2/terminal_bench_2_rl_rl_conf_24GP_base_yaml_mode_path_r2eg_nl2b_stac_bugs_fixt_t5f893ad4 Viewer • Updated 4 days ago • 267 • 5
DCAgent2/DCAgent2_terminal_bench_2_laion_exp_tas_optimal_combined_traces_tm4x_20260309_155137 Viewer • Updated 4 days ago • 231 • 8
DCAgent2/DCAgent2_aider_polyglot_laion_GLM-4.6-stackexchange-overflow-sandboxes-32eps-6591d732c6 Viewer • Updated 4 days ago • 675 • 6
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n5c86bef8 Viewer • Updated 4 days ago • 267 • 7
DCAgent2/DCAgent2_aider_polyglot_laion_GLM-4.6-stackexchange-overflow-sandboxes-32eps-65f6ca3db4 Viewer • Updated 4 days ago • 675 • 8
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3fe051100 Viewer • Updated 4 days ago • 300 • 7
DCAgent2/DCAgent2_aider_polyglot_laion_syh-r2eg-askl-glm_4-7_trac_jupi_-gfi-swes-rand-fiee3aff23 Viewer • Updated 4 days ago • 675 • 10