Mention 75k and 0.7.0 phonemizer
Browse files
README.md
CHANGED
|
@@ -143,8 +143,8 @@ base_model:
|
|
| 143 |
|
| 144 |
# Whisper IPA
|
| 145 |
|
| 146 |
-
Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Fine-tuned on
|
| 147 |
-
of labelled synthetic IPA data (generated using the goruut 0.
|
| 148 |
to generalise to many languages, datasets and domains **without** the need for fine-tuning.
|
| 149 |
|
| 150 |
Whisper was proposed in the paper [Robust Speech Recognition via Large-Scale Weak Supervision](https://arxiv.org/abs/2212.04356)
|
|
|
|
| 143 |
|
| 144 |
# Whisper IPA
|
| 145 |
|
| 146 |
+
Whisper is a pre-trained model for automatic speech recognition (ASR) and speech translation. Fine-tuned on 75000 wavs
|
| 147 |
+
of labelled synthetic IPA data (generated using the goruut 0.7.0 phonemizer), Whisper models demonstrate a strong ability
|
| 148 |
to generalise to many languages, datasets and domains **without** the need for fine-tuning.
|
| 149 |
|
| 150 |
Whisper was proposed in the paper [Robust Speech Recognition via Large-Scale Weak Supervision](https://arxiv.org/abs/2212.04356)
|