NeoRoth
/

voxtral-3b-quantized

voxtral-mini-3b-2507

Model card Files Files and versions

NeoRoth commited on Aug 26, 2025

Commit

a31ef7f

·

verified ·

1 Parent(s): 8882701

Add model card with checksums

Files changed (1) hide show

README.md +43 -0

README.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+tags:
+  - voxtral
+  - quantized
+  - mlx
+  - gguf
+library_name: mlx
+---
+# Voxtral 3B — Quantized (MLX + GGUF)
+Ce dépôt regroupe des variantes quantifiées du modèle Voxtral 3B pour MLX (Apple Silicon) et GGUF (llama.cpp).
+## Variantes
+- MLX Q4: dossier `mlx-q4/`
+- MLX Q8: dossier `mlx-q8/`
+- GGUF: dossier `gguf/`
+## Intégrité (SHA256)
+- MLX Q4 `model-00001-of-00001.safetensors`:
+  - `eec98aef078b3db2c226943d38558d814b10ec387dc5359d333eeed4be5298d2`
+- MLX Q8 `model-00001-of-00001.safetensors`:
+  - `37999e4a9dda52a0aedb593636be6c12e69dd8b8457f15ce48134f88b1ccebd3`
+- GGUF `ggml-model-Q4_K_S.gguf`:
+  - `c9221f05d388848ef117566fb50e835c111f055a6de399e559ec51ba59e7f286`
+- GGUF `mmproj-model.gguf`:
+  - `c25bbc0ce7a8f32665302f6c7db4d215e811180cac1e3b056affe8b6b1057b05`
+## Utilisation rapide
+- MLX (Python):
+```python
+from mlx_lm import load
+# Exemple: charger les poids quantifiés MLX Q4
+model, tokenizer = load("NeoRoth/voxtral-3b-quantized")
+```
+- GGUF (llama.cpp):
+```bash
+./main -m gguf/ggml-model-Q4_K_S.gguf -p "Bonjour"
+```
+## Notes
+- Ces fichiers sont des quantifications dérivées du modèle Voxtral 3B. Respectez la licence du modèle d’origine.
+- Ouvrez une issue si vous repérez un problème (poids manquants, checksum incorrect, etc.).