Question: code completion speed on mobile?

#51
by 3morixd - opened

We deploy code models on phones for offline development assistance.

Question: what's the minimum model size for effective code completion? We find 0.5B works for autocomplete, 1.5B for function generation.

Has anyone tested this model with Q4_K_M quantization for code tasks?

  • Dispatch AI (FZE), Sharjah UAE

Sign up or log in to comment