DCAgent/eval-GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k_16coc1aa4095 Viewer • Updated Feb 23 • 320 • 16
DCAgent/eval-GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k-fixtf8f83e5e Viewer • Updated Feb 23 • 301 • 14
DCAgent/eval-r2egym-nl2bash-stack-bugsseq-fixthink_16concurrency_eval_ctx32k_swebench-vefa3c5eee Viewer • Updated Feb 22 • 126 • 18
DCAgent/eval-r2egym-nl2bash-stack-bugsseq_16concurrency_eval_ctx32k_swebench-verified-ra64e5b9f4 Viewer • Updated Feb 22 • 2.07k • 19
DCAgent/eval-r2egym-nl2bash-stack-bugsseq_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated Feb 22 • 149 • 16
DCAgent/exp-gfi-staqc-embedding-mean-filtered-10K_glm_4.7_traces_jupiter Viewer • Updated Feb 22 • 9.21k • 13
DCAgent/exp-gfi-swesmith-short-response-filtered-10K_glm_4.7_traces_jupiter Viewer • Updated Feb 22 • 8.83k • 19
DCAgent/eval-r2egym-nl2bash-stack-bugsseq-fixthink_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated Feb 21 • 769 • 14