deepdml
/

whisper-tiny-af-mix-norm

@@ -6,8 +6,10 @@ base_model: openai/whisper-tiny
 tags:
 - generated_from_trainer
 datasets:
 - voice-biomarkers/openslr-32-hq-SA-languages-Afrikaans
 - google/fleurs
 metrics:
 - wer
 model-index:
@@ -18,15 +20,16 @@ model-index:
       type: automatic-speech-recognition
     dataset:
       name: Common Voice 17.0
-      type: voice-biomarkers/openslr-32-hq-SA-languages-Afrikaans
       config: af_za
       split: test
       args: af_za
     metrics:
     - name: Wer
       type: wer
-      value: 52.17316017316017
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
@@ -34,9 +37,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3668
-- Wer: 52.1732
-- Cer: 20.9395
 ## Model description
@@ -66,28 +69,28 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Wer     | Cer     |
-|:-------------:|:-------:|:----:|:---------------:|:-------:|:-------:|
-| 1.3199        | 0.05    | 100  | 1.4770          | 59.2208 | 24.5022 |
-| 0.6393        | 1.0315  | 200  | 1.2510          | 51.2554 | 20.9454 |
-| 0.3811        | 2.013   | 300  | 1.2197          | 49.4545 | 20.1155 |
-| 0.261         | 2.063   | 400  | 1.2089          | 48.3290 | 19.2036 |
-| 0.186         | 3.0445  | 500  | 1.2141          | 48.0693 | 19.8575 |
-| 0.1459        | 4.026   | 600  | 1.2341          | 49.8701 | 20.2621 |
-| 0.0963        | 5.0075  | 700  | 1.2517          | 48.4675 | 19.5437 |
-| 0.0809        | 5.0575  | 800  | 1.2674          | 51.0823 | 21.0715 |
-| 0.0536        | 6.039   | 900  | 1.2812          | 48.2597 | 19.5408 |
-| 0.0432        | 7.0205  | 1000 | 1.3003          | 48.5022 | 19.4910 |
-| 0.0379        | 8.002   | 1100 | 1.3117          | 51.6190 | 21.2298 |
-| 0.0333        | 8.052   | 1200 | 1.3314          | 52.3463 | 21.7078 |
-| 0.0247        | 9.0335  | 1300 | 1.3389          | 52.0    | 21.4644 |
-| 0.0201        | 10.015  | 1400 | 1.3484          | 51.4113 | 22.1769 |
-| 0.0194        | 10.065  | 1500 | 1.3469          | 51.8442 | 21.0685 |
-| 0.0191        | 11.0465 | 1600 | 1.3536          | 52.4502 | 21.3471 |
-| 0.0179        | 12.028  | 1700 | 1.3611          | 51.7229 | 21.1272 |
-| 0.0155        | 13.0095 | 1800 | 1.3637          | 52.4329 | 20.9512 |
-| 0.0159        | 13.0595 | 1900 | 1.3651          | 52.0346 | 20.9483 |
-| 0.0152        | 14.041  | 2000 | 1.3668          | 52.1732 | 20.9395 |
 ### Framework versions
@@ -96,16 +99,3 @@ The following hyperparameters were used during training:
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1
-## Citation
-Please cite the model using the following BibTeX entry:
-```bibtex
-@misc{deepdml/whisper-tiny-af-mix-norm,
-      title={Fine-tuned Whisper tiny ASR model for speech recognition in Afrikaans},
-      author={Jimenez, David},
-      howpublished={\url{https://huggingface.co/deepdml/whisper-tiny-af-mix-norm}},
-      year={2026}
-    }
-```

 tags:
 - generated_from_trainer
 datasets:
+- andreoosthuizen/afrikaans-30s
 - voice-biomarkers/openslr-32-hq-SA-languages-Afrikaans
 - google/fleurs
+- dsfsi-anv/multilingual-nchlt-dataset
 metrics:
 - wer
 model-index:
       type: automatic-speech-recognition
     dataset:
       name: Common Voice 17.0
+      type: andreoosthuizen/afrikaans-30s
       config: af_za
       split: test
       args: af_za
     metrics:
     - name: Wer
       type: wer
+      value: 44.935064935064936
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1668
+- Wer: 44.9351
+- Cer: 18.2741
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     | Cer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| 1.3743        | 0.05  | 100  | 1.5130          | 63.5498 | 26.0798 |
+| 0.7811        | 0.1   | 200  | 1.2491          | 51.7403 | 19.9455 |
+| 0.5477        | 0.15  | 300  | 1.1820          | 48.1732 | 18.7168 |
+| 0.4289        | 0.2   | 400  | 1.1518          | 49.6277 | 19.5203 |
+| 0.3573        | 0.25  | 500  | 1.1410          | 48.6234 | 19.9044 |
+| 0.2835        | 0.3   | 600  | 1.1289          | 47.3074 | 19.3649 |
+| 0.2602        | 0.35  | 700  | 1.1318          | 45.7835 | 19.3150 |
+| 0.217         | 0.4   | 800  | 1.1297          | 46.8398 | 19.1361 |
+| 0.2007        | 0.45  | 900  | 1.1358          | 47.4286 | 20.3296 |
+| 0.1798        | 0.5   | 1000 | 1.1383          | 47.4459 | 20.5906 |
+| 0.1548        | 0.55  | 1100 | 1.1497          | 49.3853 | 21.7723 |
+| 0.1384        | 0.6   | 1200 | 1.1525          | 48.5022 | 20.2827 |
+| 0.1325        | 0.65  | 1300 | 1.1574          | 48.8831 | 20.3120 |
+| 0.1259        | 0.7   | 1400 | 1.1625          | 45.4372 | 18.7637 |
+| 0.125         | 0.75  | 1500 | 1.1606          | 44.7100 | 18.1128 |
+| 0.1083        | 0.8   | 1600 | 1.1609          | 48.0519 | 20.1507 |
+| 0.1169        | 0.85  | 1700 | 1.1660          | 47.6017 | 19.6112 |
+| 0.1008        | 0.9   | 1800 | 1.1644          | 47.8095 | 19.9924 |
+| 0.1016        | 0.95  | 1900 | 1.1658          | 44.7273 | 18.1626 |
+| 0.0983        | 1.0   | 2000 | 1.1668          | 44.9351 | 18.2741 |
 ### Framework versions
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1