zerolat3ncy commited on
Commit
dad7967
·
verified ·
1 Parent(s): c9fe96e

faster-whisper-medium-chichewa

Browse files
README.md CHANGED
@@ -4,6 +4,8 @@ license: apache-2.0
4
  base_model: openai/whisper-medium
5
  tags:
6
  - generated_from_trainer
 
 
7
  model-index:
8
  - name: medium-model
9
  results: []
@@ -15,6 +17,10 @@ should probably proofread and complete it, then remove this comment. -->
15
  # medium-model
16
 
17
  This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on an unknown dataset.
 
 
 
 
18
 
19
  ## Model description
20
 
@@ -34,20 +40,31 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 1e-05
37
- - train_batch_size: 6
38
  - eval_batch_size: 8
39
  - seed: 42
40
  - gradient_accumulation_steps: 2
41
- - total_train_batch_size: 12
42
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: cosine
44
  - lr_scheduler_warmup_steps: 500
45
  - training_steps: 5000
46
  - mixed_precision_training: Native AMP
47
 
 
 
 
 
 
 
 
 
 
 
 
48
  ### Framework versions
49
 
50
  - Transformers 4.48.0
51
- - Pytorch 2.6.0+cu124
52
  - Datasets 3.6.0
53
- - Tokenizers 0.21.2
 
4
  base_model: openai/whisper-medium
5
  tags:
6
  - generated_from_trainer
7
+ metrics:
8
+ - wer
9
  model-index:
10
  - name: medium-model
11
  results: []
 
17
  # medium-model
18
 
19
  This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) on an unknown dataset.
20
+ It achieves the following results on the evaluation set:
21
+ - Loss: 2.2383
22
+ - Wer: 0.6660
23
+ - Cer: 0.2985
24
 
25
  ## Model description
26
 
 
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 1e-05
43
+ - train_batch_size: 2
44
  - eval_batch_size: 8
45
  - seed: 42
46
  - gradient_accumulation_steps: 2
47
+ - total_train_batch_size: 4
48
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
49
  - lr_scheduler_type: cosine
50
  - lr_scheduler_warmup_steps: 500
51
  - training_steps: 5000
52
  - mixed_precision_training: Native AMP
53
 
54
+ ### Training results
55
+
56
+ | Training Loss | Epoch | Step | Validation Loss | Wer | Cer |
57
+ |:-------------:|:-------:|:----:|:---------------:|:------:|:------:|
58
+ | 0.0575 | 13.1579 | 1000 | 1.9806 | 0.6920 | 0.3415 |
59
+ | 0.0118 | 26.3158 | 2000 | 2.0510 | 0.6764 | 0.3321 |
60
+ | 0.0039 | 39.4737 | 3000 | 2.1584 | 0.6647 | 0.2982 |
61
+ | 0.0011 | 52.6316 | 4000 | 2.2279 | 0.6997 | 0.3405 |
62
+ | 0.0009 | 65.7895 | 5000 | 2.2383 | 0.6660 | 0.2985 |
63
+
64
+
65
  ### Framework versions
66
 
67
  - Transformers 4.48.0
68
+ - Pytorch 2.8.0+cu126
69
  - Datasets 3.6.0
70
+ - Tokenizers 0.21.4
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:77d0c4dbc02f5c791d090067a62dad5c06c328c903f5c232901ecb5d8e855560
3
  size 3055544304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb3cabbf5fb0424ff63ab7230121e01a041e7c7521d32fac4482092cc583fd70
3
  size 3055544304
runs/Nov05_06-56-09_e4580a34292e/events.out.tfevents.1762325781.e4580a34292e.483.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fb81d03a9d65d5bca6d1d18593e135a1089e3e0cfd8d99d1a46955bd9f62147e
3
+ size 30149
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ddd5d9ef933889bf05b7f386f9a142abc9dd0d03347d1393ac1eec05a95d2f9a
3
- size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d3b243009ae2adc29d06fdcfe6aa897dec00a74ccf2d7e238f906a91388ecb71
3
+ size 5905