mouseyy commited on
Commit
fc8ea62
·
verified ·
1 Parent(s): 7c2585f

Model save

Browse files
Files changed (1) hide show
  1. README.md +14 -7
README.md CHANGED
@@ -23,7 +23,7 @@ model-index:
23
  metrics:
24
  - name: Wer
25
  type: wer
26
- value: 0.36626096801585056
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,9 +33,9 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [mouseyy/result_data-1](https://huggingface.co/mouseyy/result_data-1) on the common_voice_17_0 dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 0.2237
37
- - Wer: 0.3663
38
- - Cer: 0.1697
39
 
40
  ## Model description
41
 
@@ -54,7 +54,7 @@ More information needed
54
  ### Training hyperparameters
55
 
56
  The following hyperparameters were used during training:
57
- - learning_rate: 2.1500224030495643e-05
58
  - train_batch_size: 16
59
  - eval_batch_size: 16
60
  - seed: 42
@@ -64,12 +64,19 @@ The following hyperparameters were used during training:
64
  - total_eval_batch_size: 32
65
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
66
  - lr_scheduler_type: linear
67
- - lr_scheduler_warmup_steps: 282
68
- - training_steps: 10
69
  - mixed_precision_training: Native AMP
70
 
71
  ### Training results
72
 
 
 
 
 
 
 
 
73
 
74
 
75
  ### Framework versions
 
23
  metrics:
24
  - name: Wer
25
  type: wer
26
+ value: 0.3560713274837249
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [mouseyy/result_data-1](https://huggingface.co/mouseyy/result_data-1) on the common_voice_17_0 dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 0.2354
37
+ - Wer: 0.3561
38
+ - Cer: 0.1687
39
 
40
  ## Model description
41
 
 
54
  ### Training hyperparameters
55
 
56
  The following hyperparameters were used during training:
57
+ - learning_rate: 1.7029909432213465e-05
58
  - train_batch_size: 16
59
  - eval_batch_size: 16
60
  - seed: 42
 
64
  - total_eval_batch_size: 32
65
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
66
  - lr_scheduler_type: linear
67
+ - lr_scheduler_warmup_steps: 95
68
+ - num_epochs: 5.0
69
  - mixed_precision_training: Native AMP
70
 
71
  ### Training results
72
 
73
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
74
+ |:-------------:|:------:|:----:|:---------------:|:------:|:------:|
75
+ | 0.2145 | 0.9099 | 1000 | 0.2450 | 0.3677 | 0.1717 |
76
+ | 0.2083 | 1.8198 | 2000 | 0.2324 | 0.3657 | 0.1708 |
77
+ | 0.1853 | 2.7298 | 3000 | 0.2309 | 0.3583 | 0.1682 |
78
+ | 0.1872 | 3.6397 | 4000 | 0.2347 | 0.3558 | 0.1689 |
79
+ | 0.17 | 4.5496 | 5000 | 0.2354 | 0.3561 | 0.1687 |
80
 
81
 
82
  ### Framework versions