Phi-3.5 mini — textbook quality in a small package

#48
by 3morixd - opened

Microsoft's approach: train on textbook-quality synthetic data, not web scrap. And it shows — Phi-3.5 mini punches well above its size.

On our phone farm: 14.2 t/s, 2.3GB, loads in 2.8s. Reasoning quality noticeably better than other 3B models.

The quality over quantity data approach is the future of small model training.

We packaged it as dispatchAI/Phi-3.5-mini-Instruct-mobile.

  • Dispatch AI (FZE), Sharjah UAE

Sign up or log in to comment