mouseyy
/

result_data-1

Automatic Speech Recognition

Generated from Trainer

Eval Results (legacy)

Model card Files Files and versions

mouseyy commited on Mar 17, 2025

Commit

ba8c43f

·

verified ·

1 Parent(s): d3f5dab

Model save

Files changed (1) hide show

README.md +16 -10

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 1.0
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,9 +33,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_17_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 17.1658
-- Wer: 1.0
-- Cer: 1.2793
 ## Model description
@@ -54,7 +54,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3.793766869035321e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
@@ -64,15 +64,21 @@ The following hyperparameters were used during training:
 - total_eval_batch_size: 32
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 222
-- training_steps: 10
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Wer | Cer    |
-|:-------------:|:------:|:----:|:---------------:|:---:|:------:|
-| No log        | 0.0055 | 6    | 17.2325         | 1.0 | 1.3514 |
 ### Framework versions

     metrics:
     - name: Wer
       type: wer
+      value: 0.3637135578828191
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_17_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2233
+- Wer: 0.3637
+- Cer: 0.1700
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 6.532628754904162e-05
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - total_eval_batch_size: 32
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 206
+- num_epochs: 7.0
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Wer    | Cer    |
+|:-------------:|:------:|:----:|:---------------:|:------:|:------:|
+| 0.6324        | 0.9099 | 1000 | 0.5004          | 0.6083 | 0.2381 |
+| 0.3497        | 1.8198 | 2000 | 0.3087          | 0.4650 | 0.1965 |
+| 0.2642        | 2.7298 | 3000 | 0.2636          | 0.4249 | 0.1841 |
+| 0.2328        | 3.6397 | 4000 | 0.2431          | 0.3960 | 0.1789 |
+| 0.1933        | 4.5496 | 5000 | 0.2289          | 0.3773 | 0.1732 |
+| 0.1783        | 5.4595 | 6000 | 0.2300          | 0.3728 | 0.1711 |
+| 0.1617        | 6.3694 | 7000 | 0.2233          | 0.3637 | 0.1700 |
 ### Framework versions