DCAgent2/DCAgent_dev_set_v2_laion_Qwen3-8B_exp_tas_temp_0.25_traces_save-strategy_steps Viewer • Updated 19 days ago • 298 • 18
DCAgent2/DCAgent_dev_set_v2_DCAgent2_nl2bash-swesmith-undr7030 Viewer • Updated 19 days ago • 307 • 12
DCAgent2/DCAgent_dev_set_v2_laion_GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_2733413b Viewer • Updated 19 days ago • 296 • 29
DCAgent2/DCAgent_dev_set_v2_DCAgent2_nl2bash-stack-bugs-undr503020 Viewer • Updated 19 days ago • 302 • 14
DCAgent2/DCAgent_dev_set_v2_laion_rl_tp4s64_8x_nemotron-junit Viewer • Updated 19 days ago • 300 • 15
DCAgent2/DCAgent_dev_set_v2_laion_kimi-k2-r2egym_sandboxes-maxeps-32k Viewer • Updated 19 days ago • 300 • 18
DCAgent2/DCAgent_dev_set_v2_DCAgent_r2egymGPT5CodexPassed-nl2bash-bugsseq_Qwen3-8B-maxEpa10e967e Viewer • Updated 19 days ago • 300 • 14
DCAgent2/DCAgent2_bfcl-parity_laion_glm46-neulab-agenttuning-alfworld-sandboxes-maxeps-131k Viewer • Updated 19 days ago • 11
DCAgent2/DCAgent2_bfcl-parity_laion_glm-4_6-stackexchange-tezos-32ep-131k Viewer • Updated 19 days ago • 13
DCAgent2/terminal_bench_2_GLM_4_6_stackexchange_overflow_sandboxes_32eps_65k_reasoning_n78b5af2a Viewer • Updated 19 days ago • 267 • 12
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwendf4fcb5f Viewer • Updated 19 days ago • 300 • 13
DCAgent2/DCAgent2_bfcl-parity_laion_r2egym-nl2bash-stack-bugsseq-cpp Viewer • Updated 19 days ago • 10
DCAgent2/DCAgent2_bfcl-parity_laion_r2egym-nl2bash-stack-bugsseq-fixthink-exercism-python Viewer • Updated 19 days ago • 13
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwen7de61de3 Viewer • Updated 19 days ago • 300 • 12
DCAgent2/swebench_verified_random_100_folders_r2egymGPT5CodexPassed_nl2bash_bugsseq_Qwen1433ccc0 Viewer • Updated 19 days ago • 300 • 11
DCAgent2/DCAgent_dev_set_v2_laion_exp-syh-tezos-stackoverflow-mixed_glm_4_7_traces_jupitabc03382 Viewer • Updated 19 days ago • 290 • 16
DCAgent2/DCAgent_dev_set_v2_mlfoundations-dev_code-contests-sandboxes-traces-terminus-2 Viewer • Updated 19 days ago • 299 • 11
DCAgent2/medagentbench_laion_r2egym-nl2bash-stack-bugsseq-junit Viewer • Updated 19 days ago • 900 • 15
DCAgent2/swebench_verified_random_100_folders_rl_v1_tp4s64_8x_nemotron_junit_20260305_155303 Viewer • Updated 19 days ago • 300 • 13
DCAgent2/financeagent_terminal_laion_exp-syh-tezos-askllm-constrained_glm_4_7_traces_jupiter Viewer • Updated 19 days ago • 150 • 17
DCAgent2/DCAgent_dev_set_v2_laion_r2egym-nl2bash-stack-bugsseq-bash-withtests Viewer • Updated 19 days ago • 300 • 12
DCAgent2/DCAgent_dev_set_v2_laion_exp_tas_summarize_threshold_16384_traces Viewer • Updated 19 days ago • 299 • 17
DCAgent2/gaia_127_laion_exp-syh-tezos-askllm-constrained_glm_4_7_traces_jupiter Viewer • Updated 19 days ago • 364 • 1.29k