Evaluating Agentic Backend Coding Capabilities in Real-World Development Scenarios
-
ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development
Paper ⢠2601.11077 ⢠Published ⢠63 -
OpenMOSS-Team/ABC-Bench
Viewer ⢠Updated ⢠224 ⢠226 ⢠2 -
OpenMOSS-Team/Qwen3-32B-ABC
Text Generation ⢠33B ⢠Updated ⢠6 -
OpenMOSS-Team/Qwen3-8B-ABC
Text Generation ⢠8B ⢠Updated ⢠21 ⢠1