mkleinegger's picture
Update README.md
4086e01 verified

This is the official MatGPTQ checkpoint of Qwen/Qwen3-8B-Base, produced as described in the "MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization" paper.

This model can be run via vLLM. Checkout our integration at IST-DASLab/MatGPTQ