| | --- |
| | title: Voxtral Realtime 4B |
| | emoji: ๐๏ธ |
| | colorFrom: yellow |
| | colorTo: red |
| | sdk: static |
| | pinned: false |
| | license: apache-2.0 |
| | short_description: Speech-to-Text in the browser with transformers.js + WebGPU |
| | language: |
| | - en |
| | - fr |
| | - es |
| | - de |
| | - ru |
| | - zh |
| | - ja |
| | - it |
| | - pt |
| | - nl |
| | - ar |
| | - hi |
| | - ko |
| | --- |
| | |
| | # Voxtral Realtime 4B โ Live Speech-to-Text |
| |
|
| | Real-time speech transcription running entirely in your browser using [Voxtral-Mini-4B-Realtime](https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-2602) via [transformers.js](https://github.com/huggingface/transformers.js) + WebGPU. |
| |
|
| | - Click the mic to start listening |
| | - VAD automatically detects speech segments |
| | - Words appear as the model generates them |
| | - All processing happens locally โ no server needed |
| |
|