adrianSauer
/

wav2vec2-wer

@@ -19,12 +19,12 @@ model-index:
       name: Common Voice 16
       type: mozilla-foundation/common_voice_16_1
       config: gn
-      split: test
       args: gn
     metrics:
     - name: Wer
       type: wer
-      value: 43.723554301833566
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -34,8 +34,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3202
-- Wer: 43.7236
 ## Model description
@@ -55,9 +55,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
 - lr_scheduler_warmup_steps: 50
@@ -68,16 +70,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Wer     |
 |:-------------:|:------:|:----:|:---------------:|:-------:|
-| 0.4174        | 1.0101 | 100  | 0.3535          | 47.6728 |
-| 0.3411        | 2.0202 | 200  | 0.3387          | 46.3188 |
-| 0.2905        | 3.0303 | 300  | 0.3278          | 45.7546 |
-| 0.2591        | 4.0404 | 400  | 0.3214          | 44.6544 |
-| 0.251         | 5.0505 | 500  | 0.3202          | 43.7236 |
 ### Framework versions
-- Transformers 4.40.1
-- Pytorch 2.2.1+cu121
-- Datasets 2.19.0
 - Tokenizers 0.19.1

       name: Common Voice 16
       type: mozilla-foundation/common_voice_16_1
       config: gn
+      split: None
       args: gn
     metrics:
     - name: Wer
       type: wer
+      value: 49.766822118587605
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [glob-asr/wav2vec2-large-xls-r-300m-guarani-small](https://huggingface.co/glob-asr/wav2vec2-large-xls-r-300m-guarani-small) on the Common Voice 16 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3513
+- Wer: 49.7668
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant_with_warmup
 - lr_scheduler_warmup_steps: 50
 | Training Loss | Epoch  | Step | Validation Loss | Wer     |
 |:-------------:|:------:|:----:|:---------------:|:-------:|
+| 0.4171        | 1.0152 | 100  | 0.3798          | 55.2965 |
+| 0.3376        | 2.0305 | 200  | 0.3628          | 53.8974 |
+| 0.294         | 3.0457 | 300  | 0.3528          | 52.4983 |
+| 0.2632        | 4.0609 | 400  | 0.3484          | 49.7668 |
+| 0.2459        | 5.0761 | 500  | 0.3513          | 49.7668 |
 ### Framework versions
+- Transformers 4.44.0
+- Pytorch 2.3.1+cu121
+- Datasets 2.21.0
 - Tokenizers 0.19.1