ngohuudang
/

test_bug2

Automatic Speech Recognition

Generated from Trainer

Model card Files Files and versions

Metrics Training metrics Community

ngohuudang commited on Jan 9, 2023

Commit

90fe690

·

1 Parent(s): 38cfbcc

update model card README.md

Files changed (1) hide show

README.md +31 -9

README.md CHANGED Viewed

@@ -1,8 +1,7 @@
 ---
 tags:
 - generated_from_trainer
-datasets:
-- common_voice
 model-index:
 - name: test_bug2
   results: []
@@ -13,7 +12,10 @@ should probably proofread and complete it, then remove this comment. -->
 # test_bug2
-This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice dataset.
 ## Model description
@@ -33,24 +35,44 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
-- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 2
-- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 30
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions
-- Transformers 4.20.1
-- Pytorch 1.10.0+cu113
 - Datasets 1.18.3
 - Tokenizers 0.12.1

 ---
+license: cc-by-nc-4.0
 tags:
 - generated_from_trainer
 model-index:
 - name: test_bug2
   results: []
 # test_bug2
+This model is a fine-tuned version of [nguyenvulebinh/wav2vec2-base-vietnamese-250h](https://huggingface.co/nguyenvulebinh/wav2vec2-base-vietnamese-250h) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.3549
+- Wer: 0.2334
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
+- train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
+- gradient_accumulation_steps: 4
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer    |
+|:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.6139        | 0.27  | 50   | 0.4498          | 0.2591 |
+| 0.3695        | 0.53  | 100  | 0.3626          | 0.2381 |
+| 0.3065        | 0.8   | 150  | 0.3484          | 0.2281 |
+| 0.268         | 1.07  | 200  | 0.3606          | 0.2488 |
+| 0.282         | 1.34  | 250  | 0.3440          | 0.2409 |
+| 0.2688        | 1.6   | 300  | 0.3707          | 0.2459 |
+| 0.2683        | 1.87  | 350  | 0.3736          | 0.2474 |
+| 0.2599        | 2.14  | 400  | 0.4010          | 0.2664 |
+| 0.2683        | 2.41  | 450  | 0.3890          | 0.2627 |
+| 0.2623        | 2.67  | 500  | 0.4109          | 0.2790 |
+| 0.2633        | 2.94  | 550  | 0.4251          | 0.2800 |
+| 0.2431        | 3.21  | 600  | 0.4424          | 0.2941 |
+| 0.2263        | 3.48  | 650  | 0.4179          | 0.2677 |
+| 0.2268        | 3.74  | 700  | 0.4049          | 0.2715 |
+| 0.1965        | 4.01  | 750  | 0.3953          | 0.2599 |
+| 0.1851        | 4.28  | 800  | 0.3549          | 0.2467 |
+| 0.1724        | 4.54  | 850  | 0.3586          | 0.2450 |
+| 0.1587        | 4.81  | 900  | 0.3549          | 0.2334 |
 ### Framework versions
+- Transformers 4.16.0
+- Pytorch 1.13.1+cu116
 - Datasets 1.18.3
 - Tokenizers 0.12.1