ISTA-DASLab
/

Llama-3.1-8B-Instruct-MatGPTQ

8-bit precision

Model card Files Files and versions

Llama-3.1-8B-Instruct-MatGPTQ / README.md

mkleinegger's picture

Update README.md

5b3c0d3 verified 7 days ago

|

history blame contribute delete

347 Bytes

This is the official MatGPTQ checkpoint of meta-llama/Llama-3.1-8B-Instruct, produced as described in the "MatGPTQ: Accurate and Efficient Post-Training Matryoshka Quantization" paper.

This model can be run via vLLM. Checkout our integration at IST-DASLab/MatGPTQ