google/fleurs
Viewer • Updated • 768k • 57.7k • 402
This model is a fine-tuned version of openai/whisper-medium on the google/fleurs dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|---|---|---|---|---|---|
| 0.107 | 0.2 | 1000 | 0.6957 | 43.6179 | 14.1782 |
| 0.0471 | 0.4 | 2000 | 0.7596 | 39.8086 | 12.9539 |
| 0.0273 | 0.6 | 3000 | 0.8375 | 40.2070 | 12.8450 |
| 0.0077 | 1.163 | 4000 | 0.8775 | 39.8814 | 12.8099 |
| 0.0149 | 1.363 | 5000 | 0.8789 | 39.4004 | 12.6049 |
Please cite the model using the following BibTeX entry:
@misc{deepdml/whisper-medium-ig-mix-norm,
title={Fine-tuned Whisper medium ASR model for speech recognition in Lingala},
author={Jimenez, David},
howpublished={\url{https://huggingface.co/deepdml/whisper-medium-ig-mix-norm}},
year={2025}
}
Base model
openai/whisper-medium