no mention of this model having 3B active parameters

#10
by shahidchdry - opened

I think you guys should mention that this model is 35B MoE with 3B active parameters in README.md, this is a great advantage which is not mentioned for users who might assume that this is a 35B dense model

Sign up or log in to comment