NeoRoth
/

voxtral-4b-realtime-2602-mlx

Automatic Speech Recognition

mixed-precision

Model card Files Files and versions

NeoRoth commited on Mar 10

Commit

14de29c

·

verified ·

1 Parent(s): 33a7ce1

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +59 -0

README.md ADDED Viewed

	@@ -0,0 +1,59 @@

+---
+license: apache-2.0
+base_model: mistralai/Voxtral-Mini-4B-Realtime-2602
+tags:
+  - mlx
+  - safetensors
+  - voxtral
+  - realtime
+  - mxfp4
+  - mixed-precision
+  - apple-silicon
+  - speech-to-text
+language:
+  - fr
+  - en
+  - de
+  - es
+  - it
+  - pt
+  - nl
+  - pl
+  - ru
+  - uk
+  - ja
+  - ko
+  - zh
+  - ar
+  - hi
+library_name: mlx
+pipeline_tag: automatic-speech-recognition
+---
+# Voxtral Mini 4B Realtime 2602 — MLX
+MLX quantized weights for [Voxtral-Mini-4B-Realtime-2602](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602) on Apple Silicon.
+## Architecture
+Voxtral Realtime differs from the standard Voxtral Mini 3B. It uses **additive fusion** (audio embeddings + token embeddings) instead of sequence concatenation, and a **causal transformer encoder** with sliding window attention (window=750) instead of a bidirectional Whisper-style encoder. This makes it suitable for streaming / low-latency inference.
+## Variants
+| Folder | Quantization | Description |
+|--------|-------------|-------------|
+| `mlx-mxfp4-mixed/` | **MXFP4 mixed precision** | Text decoder: MXFP4 4-bit (group_size=32). Audio encoder/projector: 8-bit affine (group_size=64). Embeddings: full precision. |
+## License
+This model is distributed under the **Apache 2.0** license, following the upstream [Voxtral-Mini-4B-Realtime-2602](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602) license.
+## Requirements
+- Apple Silicon (M1+, M5+ recommended for native MXFP4)
+- MLX >= 0.30.0
+- Python 3.11+
+## Source
+Converted from [mistralai/Voxtral-Mini-4B-Realtime-2602](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602) using [oriloq-mlx](https://github.com/oriloq-s/oriloq-mlx).