deepdml
/

whisper-tiny-ta-mix-norm

@@ -6,10 +6,11 @@ base_model: openai/whisper-tiny
 tags:
 - generated_from_trainer
 datasets:
 - google/fleurs
-- deepdml/iisc-mile-tamil-asr
 - fixie-ai/common_voice_17_0
-- deepdml/microsoft-speech-corpus-indian
 metrics:
 - wer
 model-index:
@@ -20,12 +21,13 @@ model-index:
       type: automatic-speech-recognition
     dataset:
       name: Common Voice 17.0
-      type: google/fleurs
     metrics:
     - name: Wer
       type: wer
-      value: 51.23597531913797
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
@@ -33,9 +35,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2638
-- Wer: 51.2360
-- Cer: 11.6333
 ## Model description
@@ -65,16 +67,16 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Wer     | Cer     |
-|:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|
-| 0.2297        | 0.125  | 1000 | 0.3368          | 61.1882 | 15.1244 |
-| 0.151         | 0.25   | 2000 | 0.3126          | 57.7102 | 13.8047 |
-| 0.1576        | 0.375  | 3000 | 0.2900          | 55.1687 | 13.0734 |
-| 0.0818        | 0.5    | 4000 | 0.2829          | 53.7672 | 12.5394 |
-| 0.1168        | 0.625  | 5000 | 0.2713          | 53.2661 | 12.2708 |
-| 0.0752        | 0.75   | 6000 | 0.2706          | 52.0746 | 11.7892 |
-| 0.1097        | 0.875  | 7000 | 0.2647          | 51.7925 | 11.8749 |
-| 0.0836        | 1.0961 | 8000 | 0.2638          | 51.2360 | 11.6333 |
 ### Framework versions
@@ -83,16 +85,3 @@ The following hyperparameters were used during training:
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1
-## Citation
-Please cite the model using the following BibTeX entry:
-```bibtex
-@misc{deepdml/whisper-tiny-ta-mix-norm,
-      title={Fine-tuned Whisper tiny ASR model for speech recognition in Tamil},
-      author={Jimenez, David},
-      howpublished={\url{https://huggingface.co/deepdml/whisper-tiny-ta-mix-norm}},
-      year={2026}
-    }
-```

 tags:
 - generated_from_trainer
 datasets:
+- deepdml/microsoft-speech-corpus-indian
 - google/fleurs
 - fixie-ai/common_voice_17_0
+- ai4bharat/Kathbath
+- deepdml/iisc-mile-tamil-asr
 metrics:
 - wer
 model-index:
       type: automatic-speech-recognition
     dataset:
       name: Common Voice 17.0
+      type: deepdml/microsoft-speech-corpus-indian
     metrics:
     - name: Wer
       type: wer
+      value: 50.94614264919942
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice 17.0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2583
+- Wer: 50.9461
+- Cer: 11.6470
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     | Cer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| 0.2315        | 0.125 | 1000 | 0.3384          | 62.1530 | 15.6705 |
+| 0.1623        | 0.25  | 2000 | 0.2973          | 56.7273 | 13.4770 |
+| 0.1716        | 0.375 | 3000 | 0.2856          | 55.2872 | 13.0946 |
+| 0.1572        | 0.5   | 4000 | 0.2676          | 52.6516 | 12.2552 |
+| 0.1475        | 0.625 | 5000 | 0.2650          | 52.0655 | 12.0396 |
+| 0.1656        | 0.75  | 6000 | 0.2610          | 51.5322 | 11.9197 |
+| 0.1048        | 0.875 | 7000 | 0.2561          | 50.7993 | 11.4955 |
+| 0.1166        | 1.0   | 8000 | 0.2583          | 50.9461 | 11.6470 |
 ### Framework versions
 - Pytorch 2.3.0+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:42fbdd6bd9b1d8b875600d238924636b7c44f2d1b290fef0e565ce482bbdb5e2
 size 151061672

 version https://git-lfs.github.com/spec/v1
+oid sha256:79f57b39a899c264ade9bb215625da3d93e4ebeda72fc7ff68d1b3a76f7065ee
 size 151061672