dmx-mistral-7b-m7

DMX M=7 compressed version of mistralai/Mistral-7B-Instruct-v0.3.

Stats

  • Source: mistralai/Mistral-7B-Instruct-v0.3 (FP16)
  • Format: DMX BFP M=7 (7 mantissa bits, block floating point)
  • File size: 6.79 GB (53% smaller than FP16)
  • Quality: Within GPU variance of FP16 (BF16-equivalent precision)

Usage

pip install dmx-compress dmx-runtime
from dmx_runtime import from_dmx_compressed

model = from_dmx_compressed(
    "model.dmx",
    model_id="mistralai/Mistral-7B-Instruct-v0.3"
)

Compressed with dmx-compress.

Downloads last month
5
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Senat1/dmx-mistral-7b-m7

Finetuned
(485)
this model