What quantization is the base model?

#8
by spanspek - opened

The model weights look too small to be FP16 or FP8, what quantization is this model?

I looked in the README.md and tech blog but couldn't see it...

It says INT4 on the model card

IMG_1561

spanspek changed discussion status to closed

Sign up or log in to comment