GGUF importance matrix (imatrix) quants for https://huggingface.co/abacusai/Smaug-Mixtral-v0.1
The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a general purpose imatrix calibration dataset.
The imatrix is being used on the K-quants as well.

NOTE: The new IQ3_M/IQ3_S (and updated Q3_K_XS) quants have been added, as well as IQ2_S/IQ2_M (requires commit a33e6a0d).

Layers	Context	Template
32	32768	<s>[INST] {prompt} [/INST] {response}

GGUF

Model size

47B params

Architecture

llama

Hardware compatibility

2-bit

3-bit

4-bit

5-bit

8-bit