GodsonNtungi commited on
Commit
743e75d
·
1 Parent(s): 4b8c4a7

End of training

Browse files
Files changed (1) hide show
  1. README.md +14 -14
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: Akashpb13/Swahili_xlsr
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -13,15 +13,15 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # asr2
15
 
16
- This model is a fine-tuned version of [Akashpb13/Swahili_xlsr](https://huggingface.co/Akashpb13/Swahili_xlsr) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - eval_loss: 0.2807
19
- - eval_wer: 0.3257
20
- - eval_runtime: 10.1611
21
- - eval_samples_per_second: 19.683
22
- - eval_steps_per_second: 2.46
23
- - epoch: 0.32
24
- - step: 200
25
 
26
  ## Model description
27
 
@@ -41,20 +41,20 @@ More information needed
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 0.0003
44
- - train_batch_size: 16
45
  - eval_batch_size: 8
46
  - seed: 42
47
  - gradient_accumulation_steps: 2
48
- - total_train_batch_size: 32
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
  - lr_scheduler_warmup_steps: 100
52
- - training_steps: 4000
53
  - mixed_precision_training: Native AMP
54
 
55
  ### Framework versions
56
 
57
- - Transformers 4.36.0.dev0
58
  - Pytorch 2.0.1
59
- - Datasets 2.14.6
60
  - Tokenizers 0.14.1
 
1
  ---
2
  license: apache-2.0
3
+ base_model: GodsonNtungi/asr2
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # asr2
15
 
16
+ This model is a fine-tuned version of [GodsonNtungi/asr2](https://huggingface.co/GodsonNtungi/asr2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - eval_loss: 0.2567
19
+ - eval_wer: 0.2528
20
+ - eval_runtime: 6.694
21
+ - eval_samples_per_second: 20.914
22
+ - eval_steps_per_second: 2.689
23
+ - epoch: 3.23
24
+ - step: 700
25
 
26
  ## Model description
27
 
 
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 0.0003
44
+ - train_batch_size: 32
45
  - eval_batch_size: 8
46
  - seed: 42
47
  - gradient_accumulation_steps: 2
48
+ - total_train_batch_size: 64
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
  - lr_scheduler_warmup_steps: 100
52
+ - training_steps: 1500
53
  - mixed_precision_training: Native AMP
54
 
55
  ### Framework versions
56
 
57
+ - Transformers 4.35.0
58
  - Pytorch 2.0.1
59
+ - Datasets 1.18.3
60
  - Tokenizers 0.14.1