Update README.md
Browse files
README.md
CHANGED
|
@@ -14,7 +14,7 @@ license: cc-by-4.0
|
|
| 14 |
|
| 15 |
[OWSM](https://arxiv.org/abs/2309.13876) is an Open Whisper-style Speech Model from [CMU WAVLab](https://www.wavlab.org/). It reproduces Whisper-style training using publicly available data and an open-source toolkit [ESPnet](https://github.com/espnet/espnet).
|
| 16 |
|
| 17 |
-
Our demo is available [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo).
|
| 18 |
|
| 19 |
OWSM v3 has 889M parameters and is trained on 180k hours of public speech data. It supports various speech-to-text tasks:
|
| 20 |
- Speech recognition
|
|
|
|
| 14 |
|
| 15 |
[OWSM](https://arxiv.org/abs/2309.13876) is an Open Whisper-style Speech Model from [CMU WAVLab](https://www.wavlab.org/). It reproduces Whisper-style training using publicly available data and an open-source toolkit [ESPnet](https://github.com/espnet/espnet).
|
| 16 |
|
| 17 |
+
Our demo is available [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo). The [project page](https://www.wavlab.org/activities/2024/owsm/) contains various resources.
|
| 18 |
|
| 19 |
OWSM v3 has 889M parameters and is trained on 180k hours of public speech data. It supports various speech-to-text tasks:
|
| 20 |
- Speech recognition
|