Guys please make 30b a3b like MOE model
#2
by
Narutoouz
- opened
That will give better throughput to run on macbooks and many nvidea cards. TQ for open sourcing this. Please make MOE models.
Seconding this but with Nemotron-3-Nano + long context tasks