Automatic Speech Recognition
ESPnet
multilingual
audio
speech-translation
language-identification
Eval Results
Instructions to use espnet/owsm_ctc_v4_1B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- ESPnet
How to use espnet/owsm_ctc_v4_1B with ESPnet:
from espnet2.bin.asr_inference import Speech2Text model = Speech2Text.from_pretrained( "espnet/owsm_ctc_v4_1B" ) speech, rate = soundfile.read("speech.wav") text, *_ = model(speech)[0] - Notebooks
- Google Colab
- Kaggle
Add pipeline tag and link to paper
#1
by nielsr HF Staff - opened
This PR improves the model card by:
- Adding
pipeline_tag: automatic-speech-recognitionso that it shows up in the model list at https://huggingface.co/models?pipeline_tag=automatic-speech-recognition. - Linking to the paper on Hugging Face: https://huggingface.co/papers/2506.00338
Fhrozen changed pull request status to merged