Even the F16 model is small

by astrowonk - opened Sep 13, 2025

Sep 13, 2025

•

edited Sep 13, 2025

I think there's boilerplate text when converting models leads to describing the F16 non-quantized models as "very large not recommend" but this model is a small SBERT model and even the F16 models is only ~50MB. Is there really a need to do quantization on models this small?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment