Model request?

by pathosethoslogos - opened 24 days ago

Discussion

pathosethoslogos

24 days ago

Just wondering if you take model requests

If you do, please do a 2 bit quant of this! :)

fno2010

Owner 21 days ago

Just wondering if you take model requests

If you do, please do a 2 bit quant of this! :)

Which kind of 2 bit quant are you looking for?

This quantized model is using turboquant-vllm for weight quantization. As far as I know, it only supports TurboQuant 3 bits and 4 bits right now.

If you are using Apple Silicon, you can try MiniMax-M2.7-JANGTQ and MiniMax-M2.7-JANG_2L. They are around 2-bit quant. But it only supports MLX so far. I'm working on a CUDA port of it recently, but it is still not ready.

pathosethoslogos

21 days ago

Which kind of 2 bit quant are you looking for?

AutoRound if possible, if not then AWQ please! 😊

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment