Update README.md
Browse files
README.md
CHANGED
|
@@ -27,7 +27,7 @@ The Gradio demo is [here](https://huggingface.co/spaces/pyf98/OWSM_v3_demo).
|
|
| 27 |
Additionally, OWSM v4 applies 8 times subsampling (instead of 4 times in OWSM v3.1) to the log Mel features, leading to a final resolution of 80 ms in the encoder.
|
| 28 |
When running inference, we recommend setting `maxlenratio=1.0` (default) instead of smaller values.
|
| 29 |
|
| 30 |
-
This repo contains a
|
| 31 |
It is trained on 320k hours of public speech data.
|
| 32 |
The newly curated data are publicly released: https://huggingface.co/datasets/espnet/yodas_owsmv4
|
| 33 |
|
|
|
|
| 27 |
Additionally, OWSM v4 applies 8 times subsampling (instead of 4 times in OWSM v3.1) to the log Mel features, leading to a final resolution of 80 ms in the encoder.
|
| 28 |
When running inference, we recommend setting `maxlenratio=1.0` (default) instead of smaller values.
|
| 29 |
|
| 30 |
+
This repo contains a small-sized model with 370M parameters, developed by [Yifan Peng](https://pyf98.github.io/) (CMU).
|
| 31 |
It is trained on 320k hours of public speech data.
|
| 32 |
The newly curated data are publicly released: https://huggingface.co/datasets/espnet/yodas_owsmv4
|
| 33 |
|