Qwen3-8B-MatGPTQ / README.md
mkleinegger's picture
Update README.md
a37388f verified
This is the official MatGPTQ checkpoint of `Qwen/Qwen3-8B`, produced as described in the [**"MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization"**](https://arxiv.org/abs/2602.03537) paper.
This model can be run via vLLM. Checkout our integration at [IST-DASLab/MatGPTQ](https://github.com/IST-DASLab/MatGPTQ)