Leotrim
/

whisper-small-dv

@@ -1,40 +1,42 @@
 ---
 license: apache-2.0
-base_model: openai/whisper-tiny
 tags:
 - generated_from_trainer
 datasets:
-- PolyAI/minds14
 metrics:
 - wer
 model-index:
-- name: whisper-small-dv
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: PolyAI/minds14
-      type: PolyAI/minds14
-      config: en-US
-      split: train
-      args: en-US
     metrics:
     - name: Wer
       type: wer
-      value: 35.6091030789826
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# whisper-small-dv
-This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the PolyAI/minds14 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7160
-- Wer Ortho: 36.2369
-- Wer: 35.6091
 ## Model description
@@ -62,15 +64,15 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
 - lr_scheduler_warmup_steps: 50
-- training_steps: 200
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Wer Ortho | Wer     |
-|:-------------:|:-------:|:----:|:---------------:|:---------:|:-------:|
-| 0.2296        | 7.1429  | 100  | 0.5760          | 34.1463   | 33.7349 |
-| 0.0048        | 14.2857 | 200  | 0.7160          | 36.2369   | 35.6091 |
 ### Framework versions

 ---
+language:
+- dv
 license: apache-2.0
+base_model: openai/whisper-small
 tags:
 - generated_from_trainer
 datasets:
+- mozilla-foundation/common_voice_13_0
 metrics:
 - wer
 model-index:
+- name: Whisper-Small-Dv-fine-tuned
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: Common Voice 13
+      type: mozilla-foundation/common_voice_13_0
+      config: dv
+      split: test
+      args: dv
     metrics:
     - name: Wer
       type: wer
+      value: 12.868171227874953
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# Whisper-Small-Dv-fine-tuned
+This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 13 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1666
+- Wer Ortho: 60.2479
+- Wer: 12.8682
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
 - lr_scheduler_warmup_steps: 50
+- training_steps: 500
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer Ortho | Wer     |
+|:-------------:|:------:|:----:|:---------------:|:---------:|:-------:|
+| 0.1858        | 1.6313 | 250  | 0.2034          | 69.6915   | 15.8101 |
+| 0.0747        | 3.2626 | 500  | 0.1666          | 60.2479   | 12.8682 |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -1,27 +1,43 @@
 {
   "alignment_heads": [
     [
-      2,
-      2
     ],
     [
-      3,
       0
     ],
     [
-      3,
-      2
     ],
     [
-      3,
-      3
     ],
     [
-      3,
-      4
     ],
     [
-      3,
       5
     ]
   ],

 {
   "alignment_heads": [
     [
+      5,
+      3
+    ],
+    [
+      5,
+      9
     ],
     [
+      8,
       0
     ],
     [
+      8,
+      4
     ],
     [
+      8,
+      7
     ],
     [
+      8,
+      8
+    ],
+    [
+      9,
+      0
+    ],
+    [
+      9,
+      7
+    ],
+    [
+      9,
+      9
     ],
     [
+      10,
       5
     ]
   ],