espnet
/

owsm_v3

@@ -14,7 +14,7 @@ license: cc-by-4.0
 [OWSM](https://arxiv.org/abs/2309.13876) is an Open Whisper-style Speech Model from [CMU WAVLab](https://www.wavlab.org/). It reproduces Whisper-style training using publicly available data and an open-source toolkit [ESPnet](https://github.com/espnet/espnet).
-Our demo is available [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo).
 OWSM v3 has 889M parameters and is trained on 180k hours of public speech data. It supports various speech-to-text tasks:
 - Speech recognition

 [OWSM](https://arxiv.org/abs/2309.13876) is an Open Whisper-style Speech Model from [CMU WAVLab](https://www.wavlab.org/). It reproduces Whisper-style training using publicly available data and an open-source toolkit [ESPnet](https://github.com/espnet/espnet).
+Our demo is available [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo). The [project page](https://www.wavlab.org/activities/2024/owsm/) contains various resources.
 OWSM v3 has 889M parameters and is trained on 180k hours of public speech data. It supports various speech-to-text tasks:
 - Speech recognition