Is desklib able to support quantization ?

by sexyOG - opened Mar 9, 2025

Mar 9, 2025

I would like to run desklib locally with a 8GB-memory GPU. Since FP16 is not supported, it is okay to use quantization like using ONNX Runtime? What about bitsandbytes?

desklib

Owner Mar 21, 2025

You can try it and see how it affects the accuracy. You can also use CPU based inference if you are not looking to process a lot of data quickly.

desklib changed discussion status to closed Mar 21, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment