DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-93b7ec80c Viewer • Updated Feb 4 • 390 • 19
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_terminal-bench-2.0 Viewer • Updated Feb 4 • 261 • 19
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-99a1741f7 Viewer • Updated Feb 4 • 415 • 14
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_swebench-vera9b71b18 Viewer • Updated Feb 4 • 300 • 11
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_OpenThoue429c793 Viewer • Updated Feb 4 • 202 • 12
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-956774c43 Viewer • Updated Feb 3 • 435 • 14
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_terminal67fe5eed Viewer • Updated Feb 3 • 265 • 14
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_eval_ctx131k_OpenThoughts-TB-dev Viewer • Updated Feb 3 • 203 • 11
DCAgent/eval-SA-SWE-32B_16concurrency_openhands_eval_c_swebench-verified-random-100-folders Viewer • Updated Feb 1 • 282 • 17
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-9f9d0f79f Viewer • Updated Jan 31 • 448 • 13
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-9fd9119c4 Viewer • Updated Jan 30 • 177 • 12
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-95bee1cc5 Viewer • Updated Jan 30 • 184 • 13
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-9537755c2 Viewer • Updated Jan 30 • 93 • 16
DCAgent/eval-NVIDIA-Nemotron-3-Nano-30B-A3B-BF16_16concurrency_openhands_eval_c_swebench09c38651 Viewer • Updated Jan 30 • 8
DCAgent/eval-GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_num-train-epoca04b4811 Viewer • Updated Jan 30 • 18
DCAgent/eval-SWE-Swiss-32B_16concurrency_eval_ctx32k_swebench-verified-random-100-folders Viewer • Updated Jan 29 • 180 • 11
DCAgent/eval-SWE-Swiss-32B_16concurrency_eval_ctx32k_OpenThoughts-TB-dev Viewer • Updated Jan 29 • 186 • 12
DCAgent/eval-SWE-Swiss-32B_16concurrency_eval_ctx32k_terminal-bench-2.0 Viewer • Updated Jan 28 • 216 • 22