- GGUF importance matrix (imatrix) quants for https://huggingface.co/abacusai/Smaug-Mixtral-v0.1
- The importance matrix was trained for ~50K tokens (105 batches of 512 tokens) using a general purpose imatrix calibration dataset.
- The imatrix is being used on the K-quants as well.
NOTE: The new IQ3_M/IQ3_S (and updated Q3_K_XS) quants have been added, as well as IQ2_S/IQ2_M (requires commit a33e6a0d).
| Layers | Context | Template |
|---|---|---|
32 |
32768 |
<s>[INST] {prompt} [/INST] |
- Downloads last month
- 150
Hardware compatibility
Log In to add your hardware
2-bit
3-bit
4-bit
5-bit
8-bit
