Hi, any pointers on how to quantize this model to int8 weight-only precision?

by tanvij - opened Mar 31, 2025

Mar 31, 2025

I'm looking into quantizing this model to int8 precision and I'm wondering if I should manually quantize the weights or use an automated technique like AWQ or bitsandbytes. Any recommendations on which method works best for this model? Thanks!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment