dmx-mistral-7b-m7
DMX M=7 compressed version of mistralai/Mistral-7B-Instruct-v0.3.
Stats
- Source: mistralai/Mistral-7B-Instruct-v0.3 (FP16)
- Format: DMX BFP M=7 (7 mantissa bits, block floating point)
- File size: 6.79 GB (53% smaller than FP16)
- Quality: Within GPU variance of FP16 (BF16-equivalent precision)
Usage
pip install dmx-compress dmx-runtime
from dmx_runtime import from_dmx_compressed
model = from_dmx_compressed(
"model.dmx",
model_id="mistralai/Mistral-7B-Instruct-v0.3"
)
Compressed with dmx-compress.
- Downloads last month
- 5
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for Senat1/dmx-mistral-7b-m7
Base model
mistralai/Mistral-7B-v0.3 Finetuned
mistralai/Mistral-7B-Instruct-v0.3