DCAgent/exp-syh-tezos-askllm-hardened_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 9.57k • 17
DCAgent/exp-syh-tezos-askllm-constrained_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 8.51k • 13
DCAgent/exp-syh-r2egym-swesmith-mixed_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 10k • 15
DCAgent/exp-syh-r2egym-askllm-hardened_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 9.83k • 14
DCAgent/exp-syh-r2egym-askllm-constrained_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 8.58k • 26
DCAgent/exp-gfi-swesmith-short-response-filtered-10K_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 8.58k • 18
DCAgent/exp-gfi-swesmith-random-filtered-10K_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 10k • 16
DCAgent/exp-gfi-staqc-embedding-mean-filtered-10K_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 9.21k • 16
DCAgent/exp-gfi-staqc-askllm-filtered-10K_glm_4.7_traces_jupiter_cleaned Viewer • Updated 21 days ago • 9.86k • 26
DCAgent/stackexchange-tezos-sandboxes_glm_4.7_traces_openhands Viewer • Updated 22 days ago • 9.68k • 25
DCAgent/eval-GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k_16co923617cf Viewer • Updated 22 days ago • 582 • 16
DCAgent/eval-GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k-fixt72b77c06 Viewer • Updated 22 days ago • 555 • 16
DCAgent/eval-GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k_16coc1aa4095 Viewer • Updated 22 days ago • 320 • 16
DCAgent/eval-GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k-fixtf8f83e5e Viewer • Updated 22 days ago • 301 • 14
DCAgent/exp-syh-r2egym-askllm-hardened_glm_4.7_traces_jupiter Viewer • Updated 22 days ago • 9.83k • 16
DCAgent/eval-r2egym-nl2bash-stack-bugsseq-fixthink_16concurrency_eval_ctx32k_swebench-vefa3c5eee Viewer • Updated 22 days ago • 126 • 10
DCAgent/eval-r2egym-nl2bash-stack-bugsseq_16concurrency_eval_ctx32k_swebench-verified-ra64e5b9f4 Viewer • Updated 22 days ago • 2.07k • 19
DCAgent/eval-r2egym-nl2bash-stack-bugsseq_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated 22 days ago • 149 • 18
DCAgent/exp-syh-tezos-askllm-constrained_glm_4.7_traces_jupiter Viewer • Updated 22 days ago • 8.51k • 17
DCAgent/exp-gfi-staqc-embedding-mean-filtered-10K_glm_4.7_traces_jupiter Viewer • Updated 23 days ago • 9.21k • 17
DCAgent/exp-gfi-swesmith-short-response-filtered-10K_glm_4.7_traces_jupiter Viewer • Updated 23 days ago • 8.83k • 19
DCAgent/eval-r2egym-nl2bash-stack-bugsseq-fixthink_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated 23 days ago • 769 • 15