gguf / Quants

by PsiPi - opened Aug 25, 2025

Discussion

PsiPi

Aug 25, 2025

Someone had to say it
I get it's tiny.

nickrenwick

Aug 26, 2025

Guess there would be testing needed on how much quality is retained at a Q4 or Q5 level.
It would be particularly interesting to see if quantization affects the "vibe" of voice, and if the compression significantly impacts the frame rate which seems to be a key feature of how this model excels.

PsiPi

Aug 26, 2025

Yeah, the thought was a q8 to start and with the advent of the incoming 0.5 (and perhaps a 7?) quants may be less (or more) important,
In some case simply being in the correct format "wrapper" allows people to use their preferred tooling also.

PsiPi

Sep 3, 2025

https://huggingface.co/wsbagnsv1/VibeVoice-1.5B-gguf

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment