majentik
/

Voxtral-Mini-3B-2507-RotorQuant-MLX-2bit

@@ -4,24 +4,30 @@ library_name: mlx
 license: apache-2.0
 pipeline_tag: automatic-speech-recognition
 tags:
-- voxtral
-- audio
-- speech
-- speech-recognition
-- transcription
-- translation
-- mlx
-- rotorquant
-- quantization
-- 2-bit
-language:
-- en
 ---
 # Voxtral-Mini-3B-2507-RotorQuant-MLX-2bit
 2-bit MLX weight-quantized build of [`mistralai/Voxtral-Mini-3B-2507`](https://huggingface.co/mistralai/Voxtral-Mini-3B-2507) with a RotorQuant KV-cache profile. Ultra-compact, best-available 2-bit stability for streaming audio on Apple Silicon.
 ## Overview
 - **Base:** `mistralai/Voxtral-Mini-3B-2507` — 3B speech-understanding model

 license: apache-2.0
 pipeline_tag: automatic-speech-recognition
 tags:
+  - voxtral
+  - audio
+  - speech
+  - speech-recognition
+  - transcription
+  - translation
+  - mlx
+  - rotorquant
+  - quantization
+  - 2-bit
 ---
 # Voxtral-Mini-3B-2507-RotorQuant-MLX-2bit
 2-bit MLX weight-quantized build of [`mistralai/Voxtral-Mini-3B-2507`](https://huggingface.co/mistralai/Voxtral-Mini-3B-2507) with a RotorQuant KV-cache profile. Ultra-compact, best-available 2-bit stability for streaming audio on Apple Silicon.
+## Hardware compatibility
+| Device | VRAM / RAM | Recommendation |
+| --- | --- | --- |
+| Apple M4 Max 128 GB | ~1.3 GB | recommended — headroom for long context |
+| Apple M3 Max 64 GB | ~1.3 GB | comfortable |
+| Apple M2 Max 32 GB | ~1.2 GB | fits |
 ## Overview
 - **Base:** `mistralai/Voxtral-Mini-3B-2507` — 3B speech-understanding model