135M parameters. And it actually works.

#16
by 3morixd - opened

135 million parameters. 0.2% of a 70B model. And it can follow instructions.

On Snapdragon 865: 22.8 t/s, 85MB, loads in 0.3 seconds. 4.5GB RAM free after loading.

Use cases: quick text classification, simple Q&A, draft model for speculative decoding, IoT devices.

We packaged it as dispatchAI/SmolLM2-135M-Instruct-mobile. The featherweight champion.

  • Dispatch AI (FZE), Sharjah UAE

Sign up or log in to comment