Automatic Speech Recognition
Transformers
Safetensors
voxtral_realtime
fp8
quantized
vllm
mistral
compressed-tensors
llm-compressor
Instructions to use ghecko78/Voxtral-Mini-4B-Realtime-2602-FP8-Dynamic with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use ghecko78/Voxtral-Mini-4B-Realtime-2602-FP8-Dynamic with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="ghecko78/Voxtral-Mini-4B-Realtime-2602-FP8-Dynamic")# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("ghecko78/Voxtral-Mini-4B-Realtime-2602-FP8-Dynamic") model = AutoModelForSpeechSeq2Seq.from_pretrained("ghecko78/Voxtral-Mini-4B-Realtime-2602-FP8-Dynamic") - Notebooks
- Google Colab
- Kaggle
Error Loading using vLLM
3
#1 opened 14 days ago
by
suleimanelkhoury