DCAgent2/swebench_verified_random_100_folders_rl_rl_conf_24GP_base_yaml_mode_path_exp_ta7ad5624e Viewer • Updated about 1 hour ago • 300
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n903c0ee0 Viewer • Updated about 2 hours ago • 267
DCAgent2/swebench_verified_random_100_folders_swesmith_sandboxes_with_tests_gpt_5_mini_p710cae67 Viewer • Updated about 5 hours ago • 300
DCAgent2/terminal_bench_2_glm46_Toolscale_tasks_traces_20260311_174322 Viewer • Updated about 5 hours ago • 267
DCAgent2/swebench_verified_random_100_folders_rl_r2egym_nl2bash_stack_bugsseq_fixthink_ac6ef45e4 Viewer • Updated about 5 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_1e58fbf5 Viewer • Updated about 6 hours ago • 300
DCAgent2/terminal_bench_2_rl_r2egym_nl2bash_stack_bugsseq_fixthink_again_lr1e_5_postmort1bdb5755 Viewer • Updated about 6 hours ago • 267
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_ceabc985 Viewer • Updated about 7 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_nl2bash_nl2bash_bugsseq_Qwen3_8B_maxEps24_114efe33 Viewer • Updated about 7 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3c2e552a9 Viewer • Updated about 8 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwena0e0c3f6 Viewer • Updated about 9 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwenb1c78a15 Viewer • Updated about 10 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwen6a3f6328 Viewer • Updated about 10 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_rl_r2egym_nl2bash_stack_bugsseq_fixthink_a7e78a3c7 Viewer • Updated about 14 hours ago • 300
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_10K_glm_4_7_traces_20260311_170344 Viewer • Updated about 15 hours ago • 267
DCAgent2/swebench_verified_random_100_folders_Kimi_K2T_neulab_agenttuning_webshop_sandbod07c3d59 Viewer • Updated about 16 hours ago • 300
DCAgent2/swebench_verified_random_100_folders_Kimi_K2T_neulab_agenttuning_kg_sandboxes_me5f27cd1 Viewer • Updated about 16 hours ago • 300
DCAgent2/terminal_bench_2_exp_psu_stackoverflow_316_glm_4_7_traces_20260311_170339 Viewer • Updated about 16 hours ago • 267
DCAgent2/medagentbench_laion_r2egym-nl2bash-stack-bugsseq Viewer • Updated about 17 hours ago • 900 • 9
DCAgent2/medagentbench_laion_rl_tp4s64_8x_minimal_instructions Viewer • Updated about 18 hours ago • 900
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_1_0_traces_20260311_010108 Viewer • Updated about 22 hours ago • 267
DCAgent2/terminal_bench_2_exp_tas_timeout_multiplier_0_25_traces_20260311_010106 Viewer • Updated about 24 hours ago • 267
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_mind2web_sandboxes_maxeps_32k_2026a0df17cd Viewer • Updated 1 day ago • 267
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_mind2web_sandboxes_maxeps_32k_2026c560513a Viewer • Updated 1 day ago • 267
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_32029ed94 Viewer • Updated 1 day ago • 300
DCAgent2/terminal_bench_2_Kimi_K2T_neulab_agenttuning_kg_sandboxes_maxeps_32k_20260310_170004 Viewer • Updated 1 day ago • 267
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_35be6aa00 Viewer • Updated 1 day ago • 300
DCAgent2/swebench_verified_random_100_folders_GLM_4_6_stackexchange_overflow_sandboxes_3044fabdf Viewer • Updated 1 day ago • 300