End of training
Browse files
README.md
CHANGED
|
@@ -6,8 +6,10 @@ base_model: openai/whisper-tiny
|
|
| 6 |
tags:
|
| 7 |
- generated_from_trainer
|
| 8 |
datasets:
|
|
|
|
| 9 |
- voice-biomarkers/openslr-32-hq-SA-languages-Afrikaans
|
| 10 |
- google/fleurs
|
|
|
|
| 11 |
metrics:
|
| 12 |
- wer
|
| 13 |
model-index:
|
|
@@ -18,15 +20,16 @@ model-index:
|
|
| 18 |
type: automatic-speech-recognition
|
| 19 |
dataset:
|
| 20 |
name: Common Voice 17.0
|
| 21 |
-
type:
|
| 22 |
config: af_za
|
| 23 |
split: test
|
| 24 |
args: af_za
|
| 25 |
metrics:
|
| 26 |
- name: Wer
|
| 27 |
type: wer
|
| 28 |
-
value:
|
| 29 |
---
|
|
|
|
| 30 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
| 31 |
should probably proofread and complete it, then remove this comment. -->
|
| 32 |
|
|
@@ -34,9 +37,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
| 34 |
|
| 35 |
This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
|
| 36 |
It achieves the following results on the evaluation set:
|
| 37 |
-
- Loss: 1.
|
| 38 |
-
- Wer:
|
| 39 |
-
- Cer:
|
| 40 |
|
| 41 |
## Model description
|
| 42 |
|
|
@@ -66,28 +69,28 @@ The following hyperparameters were used during training:
|
|
| 66 |
|
| 67 |
### Training results
|
| 68 |
|
| 69 |
-
| Training Loss | Epoch
|
| 70 |
-
|:-------------:|:-----
|
| 71 |
-
| 1.
|
| 72 |
-
| 0.
|
| 73 |
-
| 0.
|
| 74 |
-
| 0.
|
| 75 |
-
| 0.
|
| 76 |
-
| 0.
|
| 77 |
-
| 0.
|
| 78 |
-
| 0.
|
| 79 |
-
| 0.
|
| 80 |
-
| 0.
|
| 81 |
-
| 0.
|
| 82 |
-
| 0.
|
| 83 |
-
| 0.
|
| 84 |
-
| 0.
|
| 85 |
-
| 0.
|
| 86 |
-
| 0.
|
| 87 |
-
| 0.
|
| 88 |
-
| 0.
|
| 89 |
-
| 0.
|
| 90 |
-
| 0.
|
| 91 |
|
| 92 |
|
| 93 |
### Framework versions
|
|
@@ -96,16 +99,3 @@ The following hyperparameters were used during training:
|
|
| 96 |
- Pytorch 2.3.0+cu121
|
| 97 |
- Datasets 2.19.1
|
| 98 |
- Tokenizers 0.19.1
|
| 99 |
-
|
| 100 |
-
## Citation
|
| 101 |
-
|
| 102 |
-
Please cite the model using the following BibTeX entry:
|
| 103 |
-
|
| 104 |
-
```bibtex
|
| 105 |
-
@misc{deepdml/whisper-tiny-af-mix-norm,
|
| 106 |
-
title={Fine-tuned Whisper tiny ASR model for speech recognition in Afrikaans},
|
| 107 |
-
author={Jimenez, David},
|
| 108 |
-
howpublished={\url{https://huggingface.co/deepdml/whisper-tiny-af-mix-norm}},
|
| 109 |
-
year={2026}
|
| 110 |
-
}
|
| 111 |
-
```
|
|
|
|
| 6 |
tags:
|
| 7 |
- generated_from_trainer
|
| 8 |
datasets:
|
| 9 |
+
- andreoosthuizen/afrikaans-30s
|
| 10 |
- voice-biomarkers/openslr-32-hq-SA-languages-Afrikaans
|
| 11 |
- google/fleurs
|
| 12 |
+
- dsfsi-anv/multilingual-nchlt-dataset
|
| 13 |
metrics:
|
| 14 |
- wer
|
| 15 |
model-index:
|
|
|
|
| 20 |
type: automatic-speech-recognition
|
| 21 |
dataset:
|
| 22 |
name: Common Voice 17.0
|
| 23 |
+
type: andreoosthuizen/afrikaans-30s
|
| 24 |
config: af_za
|
| 25 |
split: test
|
| 26 |
args: af_za
|
| 27 |
metrics:
|
| 28 |
- name: Wer
|
| 29 |
type: wer
|
| 30 |
+
value: 44.935064935064936
|
| 31 |
---
|
| 32 |
+
|
| 33 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
| 34 |
should probably proofread and complete it, then remove this comment. -->
|
| 35 |
|
|
|
|
| 37 |
|
| 38 |
This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
|
| 39 |
It achieves the following results on the evaluation set:
|
| 40 |
+
- Loss: 1.1668
|
| 41 |
+
- Wer: 44.9351
|
| 42 |
+
- Cer: 18.2741
|
| 43 |
|
| 44 |
## Model description
|
| 45 |
|
|
|
|
| 69 |
|
| 70 |
### Training results
|
| 71 |
|
| 72 |
+
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
|
| 73 |
+
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
|
| 74 |
+
| 1.3743 | 0.05 | 100 | 1.5130 | 63.5498 | 26.0798 |
|
| 75 |
+
| 0.7811 | 0.1 | 200 | 1.2491 | 51.7403 | 19.9455 |
|
| 76 |
+
| 0.5477 | 0.15 | 300 | 1.1820 | 48.1732 | 18.7168 |
|
| 77 |
+
| 0.4289 | 0.2 | 400 | 1.1518 | 49.6277 | 19.5203 |
|
| 78 |
+
| 0.3573 | 0.25 | 500 | 1.1410 | 48.6234 | 19.9044 |
|
| 79 |
+
| 0.2835 | 0.3 | 600 | 1.1289 | 47.3074 | 19.3649 |
|
| 80 |
+
| 0.2602 | 0.35 | 700 | 1.1318 | 45.7835 | 19.3150 |
|
| 81 |
+
| 0.217 | 0.4 | 800 | 1.1297 | 46.8398 | 19.1361 |
|
| 82 |
+
| 0.2007 | 0.45 | 900 | 1.1358 | 47.4286 | 20.3296 |
|
| 83 |
+
| 0.1798 | 0.5 | 1000 | 1.1383 | 47.4459 | 20.5906 |
|
| 84 |
+
| 0.1548 | 0.55 | 1100 | 1.1497 | 49.3853 | 21.7723 |
|
| 85 |
+
| 0.1384 | 0.6 | 1200 | 1.1525 | 48.5022 | 20.2827 |
|
| 86 |
+
| 0.1325 | 0.65 | 1300 | 1.1574 | 48.8831 | 20.3120 |
|
| 87 |
+
| 0.1259 | 0.7 | 1400 | 1.1625 | 45.4372 | 18.7637 |
|
| 88 |
+
| 0.125 | 0.75 | 1500 | 1.1606 | 44.7100 | 18.1128 |
|
| 89 |
+
| 0.1083 | 0.8 | 1600 | 1.1609 | 48.0519 | 20.1507 |
|
| 90 |
+
| 0.1169 | 0.85 | 1700 | 1.1660 | 47.6017 | 19.6112 |
|
| 91 |
+
| 0.1008 | 0.9 | 1800 | 1.1644 | 47.8095 | 19.9924 |
|
| 92 |
+
| 0.1016 | 0.95 | 1900 | 1.1658 | 44.7273 | 18.1626 |
|
| 93 |
+
| 0.0983 | 1.0 | 2000 | 1.1668 | 44.9351 | 18.2741 |
|
| 94 |
|
| 95 |
|
| 96 |
### Framework versions
|
|
|
|
| 99 |
- Pytorch 2.3.0+cu121
|
| 100 |
- Datasets 2.19.1
|
| 101 |
- Tokenizers 0.19.1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|