ellamind
/

Voxtral-Mini-4B-Realtime-8bit-mlx

Automatic Speech Recognition

🇪🇺 Region: EU

Model card Files Files and versions

jphme commited on Feb 20

Commit

50e9028

·

verified ·

1 Parent(s): 49cde31

Update README.md

update description

Files changed (1) hide show

README.md +2 -36

README.md CHANGED Viewed

@@ -17,6 +17,8 @@ pipeline_tag: automatic-speech-recognition
 This is an 8-bit quantized [MLX](https://github.com/ml-explore/mlx) version of [Voxtral Mini 4B Realtime](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602) by Mistral AI, converted using [voxmlx](https://github.com/awni/voxmlx).
 ## Model Details
 - **Base model:** [mistralai/Voxtral-Mini-4B-Realtime-2602](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602)
@@ -29,42 +31,6 @@ This is an 8-bit quantized [MLX](https://github.com/ml-explore/mlx) version of [
 Voxtral Mini is a speech-to-text model that supports 13+ languages with sub-500ms latency. This version has been quantized to 8-bit precision for efficient inference on Apple Silicon using the MLX framework.
-## Usage
-Install voxmlx:
-```bash
-pip install voxmlx
-```
-Transcribe from a file:
-```python
-import voxmlx
-model = voxmlx.load("ellamind/Voxtral-Mini-4B-Realtime-8bit-mlx")
-text = voxmlx.transcribe(model, "audio.wav")
-print(text)
-```
-Transcribe from microphone:
-```bash
-voxmlx-transcribe --model ellamind/Voxtral-Mini-4B-Realtime-8bit-mlx
-```
-## Conversion
-This model was converted using [voxmlx](https://github.com/awni/voxmlx):
-```bash
-pip install voxmlx
-voxmlx-convert \
-  --hf-path mistralai/Voxtral-Mini-4B-Realtime-2602 \
-  --mlx-path voxtral-mlx-8bit \
-  --quantize \
-  --bits 8
-```
 ## Credits

 This is an 8-bit quantized [MLX](https://github.com/ml-explore/mlx) version of [Voxtral Mini 4B Realtime](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602) by Mistral AI, converted using [voxmlx](https://github.com/awni/voxmlx).
+This version was created for use with [Supervoxtral](https://github.com/jphme/supervoxtral), enabling blazingly-fast realtime transcription on MacOS.
 ## Model Details
 - **Base model:** [mistralai/Voxtral-Mini-4B-Realtime-2602](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602)
 Voxtral Mini is a speech-to-text model that supports 13+ languages with sub-500ms latency. This version has been quantized to 8-bit precision for efficient inference on Apple Silicon using the MLX framework.
 ## Credits