DCAgent/eval-glm46-swegym-tasks-maxeps-131k_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated Feb 5 • 817 • 2
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoc7bd19372 Viewer • Updated Feb 4 • 541 • 1
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-93b7ec80c Viewer • Updated Feb 4 • 390 • 3
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_terminal-bench-2.0 Viewer • Updated Feb 4 • 261 • 5
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-99a1741f7 Viewer • Updated Feb 4 • 415 • 4
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_swebench-vera9b71b18 Viewer • Updated Feb 4 • 300 • 3
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_OpenThoue429c793 Viewer • Updated Feb 4 • 202 • 6
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-956774c43 Viewer • Updated Feb 3 • 435 • 3
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_terminal67fe5eed Viewer • Updated Feb 3 • 265 • 6
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_OpenThoughts-TB-dev Viewer • Updated Feb 3 • 203 • 2
DCAgent/eval-SA-SWE-32B_16concurrency_openhands_eval_c_swebench-verified-random-100-folders Viewer • Updated Feb 1 • 282 • 1
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-9f9d0f79f Viewer • Updated Jan 31 • 448 • 4
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-9fd9119c4 Viewer • Updated Jan 30 • 177 • 2
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-95bee1cc5 Viewer • Updated Jan 30 • 184 • 2
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-9537755c2 Viewer • Updated Jan 30 • 93 • 7
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_swebench09c38651 Viewer • Updated Jan 30 • 3
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoca04b4811 Viewer • Updated Jan 30 • 2
DCAgent/eval-SWE-Swiss-32B_16concurrency_eval_ctx32k_swebench-verified-random-100-folders Viewer • Updated Jan 29 • 180 • 5
DCAgent/eval-SWE-Swiss-32B_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated Jan 29 • 186 • 5
DCAgent/eval-SWE-Swiss-32B_16concurrency_eval_ctx32k_terminal-bench-2.0 Viewer • Updated Jan 28 • 216 • 7