These are GGUF quantized versions of mistralai/Mixtral-8x7B-Instruct-v0.1.
The importance matrix was trained for 100K tokens (200 batches of 512 tokens) using wiki.train.raw.
Some model files above 50GB are split into smaller files. To concatenate them, use the cat command (on Windows, use PowerShell): cat foo-Q6_K.gguf.* > foo-Q6_K.gguf
- What quant do I need? See https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
- Quant requests? Just open a discussion in the community tabs.
- Downloads last month
- 1,179
Hardware compatibility
Log In
to view the estimation
2-bit
3-bit
4-bit
5-bit
6-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support