Automatic Speech Recognition
Transformers
PyTorch
JAX
Safetensors
whisper
audio
hf-asr-leaderboard
Eval Results
Instructions to use openai/whisper-large-v3 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openai/whisper-large-v3 with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="openai/whisper-large-v3")# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("openai/whisper-large-v3") model = AutoModelForSpeechSeq2Seq.from_pretrained("openai/whisper-large-v3") - Inference
- Notebooks
- Google Colab
- Kaggle
Auto speech/languages detection in Real Time streaming?
#89
by sabys - opened
Hi, i am looking to get some help for fellow users on this model. I am looking for Auto speech/languages detection feature in Real Time streaming mode. Has anyone used this model to use it this way? if yes could you help with some information on how to , documentations.
I am looking to use in similar way as Google Speech to Text. The model supports but is really expensive.
https://cloud.google.com/speech-to-text/docs/transcription-model