mkleinegger's picture
Update README.md
912fdab verified
This is the official MatGPTQ checkpoint of `microsoft/Phi-3-medium-128k-instruct`, produced as described in the [**"MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization"**](https://arxiv.org/abs/2602.03537) paper.
This model can be run via vLLM. Checkout our integration at [IST-DASLab/MatGPTQ](https://github.com/IST-DASLab/MatGPTQ)