DCAgent/eval-Kimi-Dev-72B_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated Feb 12 • 178 • 4
DCAgent/eval-SERA-8B_16concurrency_swe_agent_eval_c_OpenThoughts-TB-dev Viewer • Updated Feb 11 • 34 • 6
DCAgent/eval-SERA-8B_16concurrency_swe_agent_eval_c_terminal-bench-2.0 Viewer • Updated Feb 11 • 195 • 7
DCAgent/eval-Kimi-Dev-72B_16concurrency_eval_ctx32k_terminal-bench-2.0 Viewer • Updated Feb 11 • 241 • 6
DCAgent/eval-Kimi-Dev-72B_16concurrency_eval_ctx32k_swebench-verified-random-100-folders Viewer • Updated Feb 11 • 3
DCAgent/eval-SERA-8B_16concurrency_eval_ctx32k_swebench-verified-random-100-folders Viewer • Updated Feb 6 • 247 • 3
DCAgent/perturbed-docker-exp-magicoder-tasks-2_glm_4.7_traces_locetash Viewer • Updated Feb 6 • 7.87k • 2
DCAgent/exp-gfi-staqc-short-response-filtered-10K_glm_4.7_traces_locetash Viewer • Updated Feb 6 • 9.95k • 2
DCAgent/eval-SERA-32B_16concurrency_eval_ctx32k_swebench-verified-random-100-folders Viewer • Updated Feb 6 • 742 • 2
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_16concurrency_openhands_eval_c_terminal-bench-2.0 Viewer • Updated Feb 6 • 266 • 4
DCAgent/eval-Qwen3-Coder-30B-A3B-Instruct_16concurrency_openhands_eval_c_OpenThoughts-TB-dev Viewer • Updated Feb 5 • 200 • 7
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epocf8c274b2 Viewer • Updated Feb 5 • 295 • 2
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc9b2dd9fa Viewer • Updated Feb 5 • 265 • 4
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-986b6fefd Viewer • Updated Feb 5 • 552 • 2