Instructions to use espnet/owsm_v3.1_ebf with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- ESPnet
How to use espnet/owsm_v3.1_ebf with ESPnet:
from espnet2.bin.asr_inference import Speech2Text model = Speech2Text.from_pretrained( "espnet/owsm_v3.1_ebf" ) speech, rate = soundfile.read("speech.wav") text, *_ = model(speech)[0] - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -62,7 +62,7 @@ Specifically, it supports the following speech-to-text tasks:
|
|
| 62 |
@inproceedings{owsm-v4,
|
| 63 |
title={{OWSM} v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning},
|
| 64 |
author={Yifan Peng and Shakeel Muhammad and Yui Sudo and William Chen and Jinchuan Tian and Chyi-Jiunn Lin and Shinji Watanabe},
|
| 65 |
-
booktitle={Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH)
|
| 66 |
year={2025},
|
| 67 |
}
|
| 68 |
```
|
|
|
|
| 62 |
@inproceedings{owsm-v4,
|
| 63 |
title={{OWSM} v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning},
|
| 64 |
author={Yifan Peng and Shakeel Muhammad and Yui Sudo and William Chen and Jinchuan Tian and Chyi-Jiunn Lin and Shinji Watanabe},
|
| 65 |
+
booktitle={Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH)},
|
| 66 |
year={2025},
|
| 67 |
}
|
| 68 |
```
|