mouseyy commited on
Commit
25edfe0
·
verified ·
1 Parent(s): 43239f3

Model save

Browse files
Files changed (1) hide show
  1. README.md +14 -7
README.md CHANGED
@@ -23,7 +23,7 @@ model-index:
23
  metrics:
24
  - name: Wer
25
  type: wer
26
- value: 0.36597792244551375
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,9 +33,9 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [mouseyy/result_data-1](https://huggingface.co/mouseyy/result_data-1) on the common_voice_17_0 dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 0.2241
37
- - Wer: 0.3660
38
- - Cer: 0.1696
39
 
40
  ## Model description
41
 
@@ -54,7 +54,7 @@ More information needed
54
  ### Training hyperparameters
55
 
56
  The following hyperparameters were used during training:
57
- - learning_rate: 5.0181182371213955e-05
58
  - train_batch_size: 16
59
  - eval_batch_size: 16
60
  - seed: 42
@@ -64,12 +64,19 @@ The following hyperparameters were used during training:
64
  - total_eval_batch_size: 32
65
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
66
  - lr_scheduler_type: linear
67
- - lr_scheduler_warmup_steps: 96
68
- - training_steps: 10
69
  - mixed_precision_training: Native AMP
70
 
71
  ### Training results
72
 
 
 
 
 
 
 
 
73
 
74
 
75
  ### Framework versions
 
23
  metrics:
24
  - name: Wer
25
  type: wer
26
+ value: 0.3492782337956411
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [mouseyy/result_data-1](https://huggingface.co/mouseyy/result_data-1) on the common_voice_17_0 dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 0.2371
37
+ - Wer: 0.3493
38
+ - Cer: 0.1667
39
 
40
  ## Model description
41
 
 
54
  ### Training hyperparameters
55
 
56
  The following hyperparameters were used during training:
57
+ - learning_rate: 1.3065956368514577e-05
58
  - train_batch_size: 16
59
  - eval_batch_size: 16
60
  - seed: 42
 
64
  - total_eval_batch_size: 32
65
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
66
  - lr_scheduler_type: linear
67
+ - lr_scheduler_warmup_steps: 184
68
+ - num_epochs: 5.0
69
  - mixed_precision_training: Native AMP
70
 
71
  ### Training results
72
 
73
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
74
+ |:-------------:|:------:|:----:|:---------------:|:------:|:------:|
75
+ | 0.1695 | 0.9099 | 1000 | 0.2457 | 0.3725 | 0.1714 |
76
+ | 0.1502 | 1.8198 | 2000 | 0.2344 | 0.3640 | 0.1704 |
77
+ | 0.1287 | 2.7298 | 3000 | 0.2321 | 0.3617 | 0.1691 |
78
+ | 0.1248 | 3.6397 | 4000 | 0.2387 | 0.3561 | 0.1665 |
79
+ | 0.1127 | 4.5496 | 5000 | 0.2371 | 0.3493 | 0.1667 |
80
 
81
 
82
  ### Framework versions