very sensitive to quantization

#17
by J22 - opened

I have tested with f16 and q8_0. q8_0 looks worse.

I have tested with f16 and q8_0. q8_0 looks worse.

How much worse? Is it far worse, or is it very marginal?

Sign up or log in to comment