procit006/STT_TTS_MozillaAndSTC_VoiceTextData_August27
Viewer • Updated • 86.9k • 5
This model is a fine-tuned version of openai/whisper-small on the Common Voice + STC Aug 27 + Speechgen dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 0.0892 | 0.2047 | 500 | 0.0897 | 6.6898 |
| 0.048 | 0.4093 | 1000 | 0.0480 | 3.4328 |
| 0.0379 | 0.6140 | 1500 | 0.0339 | 2.2033 |
| 0.0299 | 0.8187 | 2000 | 0.0269 | 2.2453 |
| 0.0074 | 1.0233 | 2500 | 0.0216 | 1.4842 |
| 0.0055 | 1.2280 | 3000 | 0.0202 | 1.3962 |
Base model
openai/whisper-small