deepdml
/

whisper-tiny-ar-mix-norm

@@ -7,12 +7,13 @@ base_model: openai/whisper-tiny
 tags:
 - generated_from_trainer
 datasets:
-- deepdml/Tunisian_MSA
-- fixie-ai/common_voice_17_0
 - pain/MASC
 - google/fleurs
-- UBC-NLP/Casablanca
 - ymoslem/MediaSpeech
 metrics:
 - wer
 model-index:
@@ -23,12 +24,13 @@ model-index:
       type: automatic-speech-recognition
     dataset:
       name: Common Voice 17.0
-      type: deepdml/Tunisian_MSA
     metrics:
     - name: Wer
       type: wer
-      value: 51.49189328143075
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
@@ -36,9 +38,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5877
-- Wer: 51.4919
-- Cer: 17.9569
 ## Model description
@@ -70,24 +72,24 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step  | Validation Loss | Wer     | Cer     |
 |:-------------:|:------:|:-----:|:---------------:|:-------:|:-------:|
-| 1.009         | 0.0556 | 1000  | 0.8091          | 68.4159 | 26.2902 |
-| 0.7599        | 0.1111 | 2000  | 0.7226          | 62.5420 | 23.2768 |
-| 0.562         | 0.1667 | 3000  | 0.6874          | 59.7455 | 21.5321 |
-| 0.4697        | 0.2222 | 4000  | 0.6690          | 57.8891 | 20.7745 |
-| 0.3596        | 0.2778 | 5000  | 0.6601          | 58.0562 | 21.0386 |
-| 0.3156        | 0.3333 | 6000  | 0.6483          | 56.2366 | 20.0099 |
-| 0.2326        | 0.3889 | 7000  | 0.6359          | 55.6215 | 19.8258 |
-| 0.193         | 0.4444 | 8000  | 0.6243          | 55.0118 | 19.7842 |
-| 0.1844        | 0.5    | 9000  | 0.6174          | 55.0100 | 19.8557 |
-| 0.1742        | 0.5556 | 10000 | 0.6142          | 53.9744 | 19.0535 |
-| 0.1408        | 0.6111 | 11000 | 0.6009          | 53.3721 | 19.1199 |
-| 0.1175        | 0.6667 | 12000 | 0.6040          | 52.5679 | 18.4452 |
-| 0.1441        | 0.7222 | 13000 | 0.6043          | 53.1298 | 18.8023 |
-| 0.1245        | 0.7778 | 14000 | 0.5880          | 52.0519 | 18.2682 |
-| 0.1189        | 0.8333 | 15000 | 0.5865          | 51.5103 | 18.1251 |
-| 0.1065        | 0.8889 | 16000 | 0.5863          | 51.3964 | 17.9292 |
-| 0.1266        | 0.9444 | 17000 | 0.5839          | 51.2275 | 17.9617 |
-| 0.1145        | 1.0    | 18000 | 0.5877          | 51.4919 | 17.9569 |
 ### Framework versions
@@ -96,16 +98,3 @@ The following hyperparameters were used during training:
 - Pytorch 2.5.1+cu121
 - Datasets 3.6.0
 - Tokenizers 0.21.0
-## Citation
-Please cite the model using the following BibTeX entry:
-```bibtex
-@misc{deepdml/whisper-tiny-ar-mix-norm,
-      title={Fine-tuned Whisper tiny ASR model for speech recognition in Arabic},
-      author={Jimenez, David},
-      howpublished={\url{https://huggingface.co/deepdml/whisper-tiny-ar-mix-norm}},
-      year={2026}
-    }
-```

 tags:
 - generated_from_trainer
 datasets:
 - pain/MASC
 - google/fleurs
+- deepdml/Tunisian_MSA
+- deepdml/mtedx
 - ymoslem/MediaSpeech
+- UBC-NLP/Casablanca
+- fixie-ai/common_voice_17_0
 metrics:
 - wer
 model-index:
       type: automatic-speech-recognition
     dataset:
       name: Common Voice 17.0
+      type: pain/MASC
     metrics:
     - name: Wer
       type: wer
+      value: 52.36224086961312
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5962
+- Wer: 52.3622
+- Cer: 18.7245
 ## Model description
 | Training Loss | Epoch  | Step  | Validation Loss | Wer     | Cer     |
 |:-------------:|:------:|:-----:|:---------------:|:-------:|:-------:|
+| 1.1234        | 0.0556 | 1000  | 0.8106          | 67.0994 | 25.5658 |
+| 0.8957        | 0.1111 | 2000  | 0.7234          | 62.5310 | 23.3911 |
+| 0.7114        | 0.1667 | 3000  | 0.6871          | 59.7675 | 21.8722 |
+| 0.6346        | 0.2222 | 4000  | 0.6637          | 58.1976 | 20.7336 |
+| 0.4961        | 0.2778 | 5000  | 0.6545          | 57.7404 | 20.7048 |
+| 0.4354        | 0.3333 | 6000  | 0.6473          | 56.7948 | 20.2061 |
+| 0.3924        | 0.3889 | 7000  | 0.6325          | 55.8400 | 20.0139 |
+| 0.3466        | 0.4444 | 8000  | 0.6274          | 55.4176 | 20.1441 |
+| 0.2979        | 0.5    | 9000  | 0.6206          | 54.6997 | 19.6005 |
+| 0.3099        | 0.5556 | 10000 | 0.6150          | 54.0166 | 19.3231 |
+| 0.2681        | 0.6111 | 11000 | 0.6120          | 53.5980 | 19.1106 |
+| 0.2383        | 0.6667 | 12000 | 0.6113          | 53.5576 | 19.4238 |
+| 0.2582        | 0.7222 | 13000 | 0.6060          | 52.7515 | 18.7573 |
+| 0.1543        | 0.7778 | 14000 | 0.6018          | 52.6175 | 18.4895 |
+| 0.2356        | 0.8333 | 15000 | 0.6023          | 52.9902 | 18.9782 |
+| 0.2031        | 0.8889 | 16000 | 0.5984          | 52.5165 | 18.8550 |
+| 0.2437        | 0.9444 | 17000 | 0.5951          | 52.4926 | 18.7514 |
+| 0.2269        | 1.0    | 18000 | 0.5962          | 52.3622 | 18.7245 |
 ### Framework versions
 - Pytorch 2.5.1+cu121
 - Datasets 3.6.0
 - Tokenizers 0.21.0