Is it possible to release a version with low bit quantization?

#11

by lan0004 - opened 1 day ago

Discussion

lan0004

1 day ago

It works really well with OpenClaw, especially for those who want a local low-bit quantization version.

bobzhuyb

StepFun org 1 day ago

Are you asking for 2-bit or even 1.58-bit version?

lan0004

about 23 hours ago

For example, its quantized size, capable of running on a machine with an AI MAX 395, currently exceeds 128GB in size due to the model size of INT4 plus context space and system overhead.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment