adrianSauer
/

wav2vec2-wer-extension

@@ -1,4 +1,5 @@
 ---
 language:
 - gn
 license: apache-2.0
@@ -24,7 +25,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 45.839210155148095
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -34,8 +35,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3327
-- Wer: 45.8392
 ## Model description
@@ -54,7 +55,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
 - train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
@@ -62,24 +63,25 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
-- lr_scheduler_warmup_steps: 50
-- training_steps: 500
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Wer     |
 |:-------------:|:------:|:----:|:---------------:|:-------:|
-| 1.397         | 0.0991 | 100  | 0.3675          | 48.4062 |
-| 1.0613        | 0.1982 | 200  | 0.3604          | 50.5219 |
-| 1.0365        | 0.2973 | 300  | 0.3500          | 48.8575 |
-| 0.9822        | 0.3964 | 400  | 0.3454          | 47.5599 |
-| 0.9197        | 0.4955 | 500  | 0.3327          | 45.8392 |
 ### Framework versions
-- Transformers 4.44.0
 - Pytorch 2.3.1+cu121
 - Datasets 2.21.0
 - Tokenizers 0.19.1

 ---
+library_name: transformers
 language:
 - gn
 license: apache-2.0
     metrics:
     - name: Wer
       type: wer
+      value: 39.84010659560293
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2438
+- Wer: 39.8401
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
 - train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
 - total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
+- lr_scheduler_warmup_steps: 3000
+- training_steps: 3000
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Wer     |
 |:-------------:|:------:|:----:|:---------------:|:-------:|
+| 1.2579        | 0.4955 | 500  | 0.3710          | 53.4310 |
+| 0.919         | 0.9911 | 1000 | 0.3295          | 49.9001 |
+| 0.746         | 1.4866 | 1500 | 0.2902          | 45.1033 |
+| 0.6767        | 1.9822 | 2000 | 0.2674          | 43.3711 |
+| 0.574         | 2.4777 | 2500 | 0.2677          | 42.5716 |
+| 0.5485        | 2.9732 | 3000 | 0.2438          | 39.8401 |
 ### Framework versions
+- Transformers 4.44.1
 - Pytorch 2.3.1+cu121
 - Datasets 2.21.0
 - Tokenizers 0.19.1