Qwen Coder 14B — mobile code generation is real

#6
by 3morixd - opened

We tested Qwen2.5-Coder-14B on phones. At Q4_K_M (~8GB), it runs at ~4 t/s — slow but usable for code generation where you wait for the result.

But the real magic: Qwen2.5-Coder-0.5B (490MB) runs at 19.2 t/s and handles 80% of coding questions. For the other 20%, escalate to the cloud.

This cascading approach is how mobile AI should work: tiny model first, escalate only when needed.

We've packaged the 0.5B coder as dispatchAI/Qwen2.5-0.5B-Coder-mobile.

— Dispatch AI (FZE), Sharjah UAE

Sign up or log in to comment