Update README.md
Browse files
README.md
CHANGED
|
@@ -218,40 +218,77 @@ Stage 1 employed low threshold values (0 to 0.30 BLEU depending on dataset), whe
|
|
| 218 |
| NST | 250 | 250 |
|
| 219 |
| **Total** | **56,514** | **8,533** |
|
| 220 |
|
| 221 |
-
|
|
|
|
| 222 |
|
| 223 |
### Evaluation
|
| 224 |
|
| 225 |
|
| 226 |
-
#### WER
|
| 227 |
| Model size | | FLEURS | CommonVoice | NST |
|
| 228 |
|------------|---------|--------|-------------|------|
|
| 229 |
| [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **KBLab** | **13.2** | **12.9** | **11.2** |
|
| 230 |
| | OpenAI | 59.2 | 67.8 | 85.2 |
|
| 231 |
| [base](https://huggingface.co/KBLab/kb-whisper-base) | **KBLab** | **9.1** | **8.7** | **7.8** |
|
| 232 |
| | OpenAI | 39.6 | 52.1 | 53.4 |
|
| 233 |
-
| [small](https://huggingface.co/KBLab/kb-whisper-small)
|
| 234 |
| | OpenAI | 20.6 | 26.4 | 26.4 |
|
| 235 |
-
| [medium](https://huggingface.co/KBLab/kb-whisper-medium)
|
| 236 |
| | OpenAI | 12.1 | 15.8 | 17.1 |
|
| 237 |
-
| [large-v3](https://huggingface.co/KBLab/kb-whisper-large)
|
| 238 |
| | OpenAI | 7.8 | 9.5 | 11.3 |
|
| 239 |
|
|
|
|
| 240 |
|
| 241 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 242 |
| Model size | | FLEURS | CommonVoice | NST |
|
| 243 |
|------------|---------|--------|-------------|------|
|
| 244 |
-
| tiny | KBLab | **76.6** | **73.7** | **74.3** |
|
| 245 |
| | OpenAI | 26.9 | 21.1 | 24.0 |
|
| 246 |
-
| base | KBLab | **83.2** | **79.9** | **78.3** |
|
| 247 |
| | OpenAI | 41.1 | 32.5 | 36.9 |
|
| 248 |
-
| small
|
| 249 |
| | OpenAI | 64.0 | 56.5 | 58.2 |
|
| 250 |
-
| medium
|
| 251 |
| | OpenAI | 77.1 | 70.1 | 68.9 |
|
| 252 |
-
| large-v3
|
| 253 |
| | OpenAI | 84.9 | 79.1 | 75.1 |
|
| 254 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 255 |
### Acknowledgements
|
| 256 |
|
| 257 |
We acknowledge the EuroHPC Joint Undertaking for awarding this project access to the EuroHPC supercomputer LEONARDO, hosted by CINECA (Italy) and the LEONARDO consortium through an EuroHPC AI and Data-Intensive Applications Access call.
|
|
|
|
| 218 |
| NST | 250 | 250 |
|
| 219 |
| **Total** | **56,514** | **8,533** |
|
| 220 |
|
| 221 |
+
|
| 222 |
+
The default when loading our models through Hugging Face is **Stage 2**. We have however also uploaded continued pretraining checkpoints and tagged them. You can load these other checkpoints by specifying the `revision` in `.from_pretrained()`. The pretrained checkpoints tag can for example be found here: [`pretrained-checkpoint`](https://huggingface.co/KBLab/kb-whisper-large/tree/pretrained-checkpoint). The Stage 2 default model tag is named `standard`. We supply two different stage 2 checkpoints -- one with a more condensed style of transcribing -- under the name `subtitle`, and one more verbose called `strict`.
|
| 223 |
|
| 224 |
### Evaluation
|
| 225 |
|
| 226 |
|
| 227 |
+
#### WER compared to OpenAI
|
| 228 |
| Model size | | FLEURS | CommonVoice | NST |
|
| 229 |
|------------|---------|--------|-------------|------|
|
| 230 |
| [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **KBLab** | **13.2** | **12.9** | **11.2** |
|
| 231 |
| | OpenAI | 59.2 | 67.8 | 85.2 |
|
| 232 |
| [base](https://huggingface.co/KBLab/kb-whisper-base) | **KBLab** | **9.1** | **8.7** | **7.8** |
|
| 233 |
| | OpenAI | 39.6 | 52.1 | 53.4 |
|
| 234 |
+
| [small](https://huggingface.co/KBLab/kb-whisper-small) | **KBLab** | **7.3** | **6.4** | **6.6** |
|
| 235 |
| | OpenAI | 20.6 | 26.4 | 26.4 |
|
| 236 |
+
| [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **KBLab** | **6.6** | **5.4** | **5.8** |
|
| 237 |
| | OpenAI | 12.1 | 15.8 | 17.1 |
|
| 238 |
+
| [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **KBLab** | **5.4** | **4.1** | **5.2** |
|
| 239 |
| | OpenAI | 7.8 | 9.5 | 11.3 |
|
| 240 |
|
| 241 |
+
#### WER for different KBLab stage2 versions
|
| 242 |
|
| 243 |
+
| Model size | | FLEURS | CommonVoice | NST |
|
| 244 |
+
|------------|---------|--------|-------------|------|
|
| 245 |
+
| [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **standard** | **13.2** | **12.9** | **11.2** |
|
| 246 |
+
| | strict | 14.1 | 13.4 | 11.0 |
|
| 247 |
+
| | subtitle | 13.3 | 12.9 | 11.4 |
|
| 248 |
+
| [base](https://huggingface.co/KBLab/kb-whisper-base) | **standard** | **9.1** | **8.7** | **7.8** |
|
| 249 |
+
| | strict | 10.4 | 9.6 | 8.4 |
|
| 250 |
+
| | subtitle | 9.1 | 8.7 | 7.9 |
|
| 251 |
+
| [small](https://huggingface.co/KBLab/kb-whisper-small) | **standard** | **7.3** | **6.4** | **6.6** |
|
| 252 |
+
| | strict | 8.2 | 7.0 | 6.7 |
|
| 253 |
+
| | subtitle | 7.3 | 6.4 | 6.6 |
|
| 254 |
+
| [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **standard** | **6.6** | **5.4** | **5.8** |
|
| 255 |
+
| | strict | 6.8 | 5.4 | 6.0 |
|
| 256 |
+
| [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **standard** | **5.4** | **4.1** | **5.2** |
|
| 257 |
+
| | strict | 5.3 | 4.0 | 5.1 |
|
| 258 |
+
|
| 259 |
+
#### BLEU Score compared to OpenAI
|
| 260 |
| Model size | | FLEURS | CommonVoice | NST |
|
| 261 |
|------------|---------|--------|-------------|------|
|
| 262 |
+
| [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **KBLab** | **76.6** | **73.7** | **74.3** |
|
| 263 |
| | OpenAI | 26.9 | 21.1 | 24.0 |
|
| 264 |
+
| [base](https://huggingface.co/KBLab/kb-whisper-base) | **KBLab** | **83.2** | **79.9** | **78.3** |
|
| 265 |
| | OpenAI | 41.1 | 32.5 | 36.9 |
|
| 266 |
+
| [small](https://huggingface.co/KBLab/kb-whisper-small) | **KBLab** | **86.6** | **83.5** | **79.6** |
|
| 267 |
| | OpenAI | 64.0 | 56.5 | 58.2 |
|
| 268 |
+
| [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **KBLab** | **87.6** | **85.0** | **80.2** |
|
| 269 |
| | OpenAI | 77.1 | 70.1 | 68.9 |
|
| 270 |
+
| [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **KBLab** | **89.8** | **87.2** | **81.1** |
|
| 271 |
| | OpenAI | 84.9 | 79.1 | 75.1 |
|
| 272 |
|
| 273 |
+
#### BLEU Score for different KBLab stage2 versions
|
| 274 |
+
| Model size | | FLEURS | CommonVoice | NST |
|
| 275 |
+
|------------|---------|--------|-------------|------|
|
| 276 |
+
| [tiny](https://huggingface.co/KBLab/kb-whisper-tiny) | **standard** | **76.6** | **73.7** | **74.3** |
|
| 277 |
+
| | strict | 75.3 | 72.9 | 74.6 |
|
| 278 |
+
| | subtitle | 76.6 | 73.7 | 74.1 |
|
| 279 |
+
| [base](https://huggingface.co/KBLab/kb-whisper-base) | **standard** | **83.2** | **79.9** | **78.3** |
|
| 280 |
+
| | strict | 81.0 | 78.4 | 77.5 |
|
| 281 |
+
| | subtitle | 83.2 | 79.8 | 78.2 |
|
| 282 |
+
| [small](https://huggingface.co/KBLab/kb-whisper-small) | **standard** | **86.6** | **83.5** | **79.6** |
|
| 283 |
+
| | strict | 84.9 | 82.4 | 79.3 |
|
| 284 |
+
| | subtitle | 86.6 | 83.5 | 79.6 |
|
| 285 |
+
| [medium](https://huggingface.co/KBLab/kb-whisper-medium) | **standard** | **87.6** | **85.0** | **80.2** |
|
| 286 |
+
| | strict | 87.3 | 84.9 | 80.1 |
|
| 287 |
+
| [large-v3](https://huggingface.co/KBLab/kb-whisper-large) | **standard** | **89.8** | **87.2** | **81.1** |
|
| 288 |
+
| | strict | 90.0 | 87.4 | 81.2 |
|
| 289 |
+
|
| 290 |
+
|
| 291 |
+
|
| 292 |
### Acknowledgements
|
| 293 |
|
| 294 |
We acknowledge the EuroHPC Joint Undertaking for awarding this project access to the EuroHPC supercomputer LEONARDO, hosted by CINECA (Italy) and the LEONARDO consortium through an EuroHPC AI and Data-Intensive Applications Access call.
|