mozilla-ai
/

whisper-small-sv

@@ -23,7 +23,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 20.14691254112786
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,9 +33,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_17_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3111
 - Model Preparation Time: 0.0044
-- Wer: 20.1469
 ## Model description
@@ -55,44 +55,31 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 258
-- eval_batch_size: 64
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 50
-- training_steps: 1250
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch   | Step | Validation Loss | Model Preparation Time | Wer     |
-|:-------------:|:-------:|:----:|:---------------:|:----------------------:|:-------:|
-| 0.6188        | 0.9804  | 50   | 0.3424          | 0.0044                 | 23.1107 |
-| 0.2417        | 1.9608  | 100  | 0.3089          | 0.0044                 | 21.0881 |
-| 0.1456        | 2.9412  | 150  | 0.3038          | 0.0044                 | 20.7412 |
-| 0.0928        | 3.9216  | 200  | 0.3111          | 0.0044                 | 20.1469 |
-| 0.0554        | 4.9020  | 250  | 0.3231          | 0.0044                 | 20.3739 |
-| 0.035         | 5.8824  | 300  | 0.3367          | 0.0044                 | 20.8432 |
-| 0.0215        | 6.8627  | 350  | 0.3619          | 0.0044                 | 20.8458 |
-| 0.0143        | 7.8431  | 400  | 0.3768          | 0.0044                 | 20.9299 |
-| 0.0101        | 8.8235  | 450  | 0.3880          | 0.0044                 | 20.8509 |
-| 0.0084        | 9.8039  | 500  | 0.3960          | 0.0044                 | 20.8993 |
-| 0.0072        | 10.7843 | 550  | 0.3999          | 0.0044                 | 21.0014 |
-| 0.0059        | 11.7647 | 600  | 0.4069          | 0.0044                 | 20.8942 |
-| 0.0053        | 12.7451 | 650  | 0.4130          | 0.0044                 | 20.9656 |
-| 0.0047        | 13.7255 | 700  | 0.4177          | 0.0044                 | 20.9963 |
-| 0.0043        | 14.7059 | 750  | 0.4208          | 0.0044                 | 20.9478 |
-| 0.004         | 15.6863 | 800  | 0.4241          | 0.0044                 | 21.0371 |
-| 0.0037        | 16.6667 | 850  | 0.4265          | 0.0044                 | 21.0600 |
-| 0.0035        | 17.6471 | 900  | 0.4298          | 0.0044                 | 21.1034 |
-| 0.0034        | 18.6275 | 950  | 0.4317          | 0.0044                 | 21.0983 |
-| 0.0032        | 19.6078 | 1000 | 0.4334          | 0.0044                 | 21.1416 |
-| 0.0031        | 20.5882 | 1050 | 0.4351          | 0.0044                 | 21.1518 |
-| 0.003         | 21.5686 | 1100 | 0.4361          | 0.0044                 | 21.1748 |
-| 0.0029        | 22.5490 | 1150 | 0.4368          | 0.0044                 | 21.1620 |
-| 0.0029        | 23.5294 | 1200 | 0.4374          | 0.0044                 | 21.1799 |
-| 0.0029        | 24.5098 | 1250 | 0.4377          | 0.0044                 | 21.1722 |
 ### Framework versions

     metrics:
     - name: Wer
       type: wer
+      value: 20.88912694161757
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_17_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3054
 - Model Preparation Time: 0.0044
+- Wer: 20.8891
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 64
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 50
+- training_steps: 300
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Model Preparation Time | Wer     |
+|:-------------:|:------:|:----:|:---------------:|:----------------------:|:-------:|
+| 0.9087        | 0.1232 | 25   | 0.5954          | 0.0044                 | 25.6485 |
+| 0.3769        | 0.2463 | 50   | 0.3614          | 0.0044                 | 23.9728 |
+| 0.3282        | 0.3695 | 75   | 0.3457          | 0.0044                 | 23.8478 |
+| 0.3236        | 0.4926 | 100  | 0.3340          | 0.0044                 | 22.8939 |
+| 0.3075        | 0.6158 | 125  | 0.3260          | 0.0044                 | 22.5853 |
+| 0.2922        | 0.7389 | 150  | 0.3186          | 0.0044                 | 21.8711 |
+| 0.287         | 0.8621 | 175  | 0.3140          | 0.0044                 | 21.6670 |
+| 0.2845        | 0.9852 | 200  | 0.3093          | 0.0044                 | 21.4196 |
+| 0.195         | 1.1084 | 225  | 0.3080          | 0.0044                 | 21.3610 |
+| 0.1679        | 1.2315 | 250  | 0.3079          | 0.0044                 | 21.1059 |
+| 0.1726        | 1.3547 | 275  | 0.3060          | 0.0044                 | 21.0243 |
+| 0.165         | 1.4778 | 300  | 0.3054          | 0.0044                 | 20.8891 |
 ### Framework versions

runs/Mar06_11-07-10_gpu-pod/events.out.tfevents.1741266944.gpu-pod.61799.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8c68925fcd35a0c1e19c08ecdbb0a4695280768b061c28d3325c028af49b3477
+size 472