Automatic Speech Recognition
Transformers
PyTorch
TensorFlow
Safetensors
English
speech_to_text
audio
Instructions to use facebook/s2t-medium-librispeech-asr with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use facebook/s2t-medium-librispeech-asr with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="facebook/s2t-medium-librispeech-asr")# Load model directly from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq processor = AutoProcessor.from_pretrained("facebook/s2t-medium-librispeech-asr") model = AutoModelForSpeechSeq2Seq.from_pretrained("facebook/s2t-medium-librispeech-asr") - Notebooks
- Google Colab
- Kaggle
Fix examples: input_ids -> input_features
#1
by sanchit-gandhi - opened
Model expects args of input_features not input_ids: https://github.com/huggingface/transformers/blob/fc95386ea12fc11942cc7f2a4f99ef9602d774ef/src/transformers/models/speech_to_text/modeling_speech_to_text.py#L1298
Related: https://github.com/huggingface/transformers/issues/20575
cc @patrickvonplaten @anton-l
Thanks!
patrickvonplaten changed pull request status to merged