On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published Feb 24 • 101
AlienKevin/nemotron-terminal-8b-25pct-eval-terminal-bench-lite-concurrency-25 Updated 12 days ago • 192
AlienKevin/nemotron-terminal-8b-25pct-eval-terminal-bench-lite-concurrency-25 Updated 12 days ago • 192
AlienKevin/nemotron-terminal-8b-5pct-rand-skill-based-eval-terminal-bench-lite-concurrency-25 Viewer • Updated 14 days ago • 1.4k • 2.06k
AlienKevin/nemotron-terminal-8b-5pct-rand-skill-based-eval-terminal-bench-lite-concurrency-25 Viewer • Updated 14 days ago • 1.4k • 2.06k
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-32b-eval Updated 15 days ago • 198
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-32b-eval Updated 15 days ago • 198
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-required-workflow-eval Updated 15 days ago • 110
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-required-workflow-eval Updated 15 days ago • 110
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-eval Updated 15 days ago • 97
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-view-before-edit-system-prompt-eval Updated 15 days ago • 107
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-view-before-edit-system-prompt-eval Updated 15 days ago • 107
AlienKevin/sweb-verified-rand-100-mini-swe-v2.2.7-regex-parser-qwen3-8b-eval Updated 15 days ago • 97