ales commited on
Commit
0666270
·
1 Parent(s): 6829ef1

update model card README.md

Browse files
Files changed (2) hide show
  1. README.md +16 -14
  2. train.log +2 -0
README.md CHANGED
@@ -1,41 +1,38 @@
1
  ---
2
- language:
3
- - be
4
  license: apache-2.0
5
  tags:
6
- - whisper-event
7
  - generated_from_trainer
8
  datasets:
9
- - mozilla-foundation/common_voice_11_0
10
  metrics:
11
  - wer
12
  model-index:
13
- - name: Whisper Small Belarusian
14
  results:
15
  - task:
16
  name: Automatic Speech Recognition
17
  type: automatic-speech-recognition
18
  dataset:
19
- name: mozilla-foundation/common_voice_11_0 be
20
- type: mozilla-foundation/common_voice_11_0
21
  config: be
22
  split: validation
23
  args: be
24
  metrics:
25
  - name: Wer
26
  type: wer
27
- value: 60.43956043956044
28
  ---
29
 
30
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
31
  should probably proofread and complete it, then remove this comment. -->
32
 
33
- # Whisper Small Belarusian
34
 
35
- This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the mozilla-foundation/common_voice_11_0 be dataset.
36
  It achieves the following results on the evaluation set:
37
- - Loss: 0.6197
38
- - Wer: 60.4396
39
 
40
  ## Model description
41
 
@@ -54,14 +51,14 @@ More information needed
54
  ### Training hyperparameters
55
 
56
  The following hyperparameters were used during training:
57
- - learning_rate: 0.0001
58
  - train_batch_size: 32
59
  - eval_batch_size: 32
60
  - seed: 42
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
  - lr_scheduler_warmup_steps: 5
64
- - training_steps: 100
65
  - mixed_precision_training: Native AMP
66
 
67
  ### Training results
@@ -78,6 +75,11 @@ The following hyperparameters were used during training:
78
  | 0.7832 | 0.8 | 80 | 0.6129 | 65.9341 |
79
  | 0.6031 | 0.9 | 90 | 0.5877 | 61.3553 |
80
  | 0.6678 | 1.0 | 100 | 0.5759 | 61.5385 |
 
 
 
 
 
81
 
82
 
83
  ### Framework versions
 
1
  ---
 
 
2
  license: apache-2.0
3
  tags:
 
4
  - generated_from_trainer
5
  datasets:
6
+ - common_voice_11_0
7
  metrics:
8
  - wer
9
  model-index:
10
+ - name: whisper-tiny-be-test
11
  results:
12
  - task:
13
  name: Automatic Speech Recognition
14
  type: automatic-speech-recognition
15
  dataset:
16
+ name: common_voice_11_0
17
+ type: common_voice_11_0
18
  config: be
19
  split: validation
20
  args: be
21
  metrics:
22
  - name: Wer
23
  type: wer
24
+ value: 55.67765567765568
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
28
  should probably proofread and complete it, then remove this comment. -->
29
 
30
+ # whisper-tiny-be-test
31
 
32
+ This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the common_voice_11_0 dataset.
33
  It achieves the following results on the evaluation set:
34
+ - Loss: 0.5387
35
+ - Wer: 55.6777
36
 
37
  ## Model description
38
 
 
51
  ### Training hyperparameters
52
 
53
  The following hyperparameters were used during training:
54
+ - learning_rate: 3.1578947368421056e-06
55
  - train_batch_size: 32
56
  - eval_batch_size: 32
57
  - seed: 42
58
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
59
  - lr_scheduler_type: linear
60
  - lr_scheduler_warmup_steps: 5
61
+ - training_steps: 150
62
  - mixed_precision_training: Native AMP
63
 
64
  ### Training results
 
75
  | 0.7832 | 0.8 | 80 | 0.6129 | 65.9341 |
76
  | 0.6031 | 0.9 | 90 | 0.5877 | 61.3553 |
77
  | 0.6678 | 1.0 | 100 | 0.5759 | 61.5385 |
78
+ | 0.4611 | 0.07 | 110 | 0.5625 | 57.6923 |
79
+ | 0.4451 | 0.13 | 120 | 0.5636 | 56.5934 |
80
+ | 0.3615 | 0.2 | 130 | 0.5490 | 61.1722 |
81
+ | 0.4055 | 0.27 | 140 | 0.5382 | 55.1282 |
82
+ | 0.2946 | 0.33 | 150 | 0.5387 | 55.6777 |
83
 
84
 
85
  ### Framework versions
train.log CHANGED
@@ -56,3 +56,5 @@
56
  {'loss': 0.4055, 'learning_rate': 8.96551724137931e-06, 'epoch': 0.27}
57
  {'eval_loss': 0.5382302403450012, 'eval_wer': 55.12820512820513, 'eval_runtime': 22.4274, 'eval_samples_per_second': 2.854, 'eval_steps_per_second': 0.089, 'epoch': 0.27}
58
  {'loss': 0.2946, 'learning_rate': 2.0689655172413796e-06, 'epoch': 0.33}
 
 
 
56
  {'loss': 0.4055, 'learning_rate': 8.96551724137931e-06, 'epoch': 0.27}
57
  {'eval_loss': 0.5382302403450012, 'eval_wer': 55.12820512820513, 'eval_runtime': 22.4274, 'eval_samples_per_second': 2.854, 'eval_steps_per_second': 0.089, 'epoch': 0.27}
58
  {'loss': 0.2946, 'learning_rate': 2.0689655172413796e-06, 'epoch': 0.33}
59
+ {'eval_loss': 0.53872150182724, 'eval_wer': 55.67765567765568, 'eval_runtime': 20.4177, 'eval_samples_per_second': 3.135, 'eval_steps_per_second': 0.098, 'epoch': 0.33}
60
+ {'train_runtime': 451.4438, 'train_samples_per_second': 10.633, 'train_steps_per_second': 0.332, 'train_loss': 0.13119232177734375, 'epoch': 0.33}