Automatic Speech Recognition
Transformers
PyTorch
TensorFlow
JAX
Safetensors
whisper
audio
hf-asr-leaderboard
Eval Results (legacy)
Instructions to use openai/whisper-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openai/whisper-base with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="openai/whisper-base")# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("openai/whisper-base") model = AutoModelForSpeechSeq2Seq.from_pretrained("openai/whisper-base") - Notebooks
- Google Colab
- Kaggle
Help regarding malformed files?
#34
by DribDrab - opened
Uploading wav or webms here, in the demo work fine, but uploading them through the inference api report back as malformed,
Anyway to change this or troubleshoot the issue? ive already tried fixing my files but it seems to be something api side,
Help would be greatly appreciated!
Ive attached a wav file that works here in the demo but fails when submitted through the api