Guys please make 30b a3b like MOE model

#2
by Narutoouz - opened

That will give better throughput to run on macbooks and many nvidea cards. TQ for open sourcing this. Please make MOE models.

Seconding this but with Nemotron-3-Nano + long context tasks

Sign up or log in to comment