Baselhany commited on
Commit
74f03e4
·
verified ·
1 Parent(s): 1a7872d

Model save

Browse files
README.md CHANGED
@@ -1,27 +1,28 @@
1
  ---
2
  library_name: transformers
3
- language:
4
- - ar
5
  license: apache-2.0
6
- base_model: openai/whisper-base
7
  tags:
8
  - generated_from_trainer
9
- metrics:
10
- - wer
11
  model-index:
12
- - name: Whisper base AR - BA
13
  results: []
14
  ---
15
 
16
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
  should probably proofread and complete it, then remove this comment. -->
18
 
19
- # Whisper base AR - BA
20
 
21
- This model is a fine-tuned version of [openai/whisper-base](https://huggingface.co/openai/whisper-base) on the quran-ayat-speech-to-text dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.0890
24
- - Wer: 0.1942
 
 
 
 
 
25
 
26
  ## Model description
27
 
@@ -49,38 +50,12 @@ The following hyperparameters were used during training:
49
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
50
  - lr_scheduler_type: linear
51
  - lr_scheduler_warmup_steps: 500
52
- - num_epochs: 20
53
  - mixed_precision_training: Native AMP
54
 
55
- ### Training results
56
-
57
- | Training Loss | Epoch | Step | Validation Loss | Wer |
58
- |:-------------:|:-------:|:----:|:---------------:|:------:|
59
- | 1.7224 | 1.0 | 301 | 0.0867 | 0.1888 |
60
- | 1.6242 | 2.0 | 602 | 0.0876 | 0.1970 |
61
- | 1.3485 | 3.0 | 903 | 0.0880 | 0.1932 |
62
- | 1.2337 | 4.0 | 1204 | 0.0895 | 0.1872 |
63
- | 1.0669 | 5.0 | 1505 | 0.0861 | 0.1932 |
64
- | 1.0507 | 6.0 | 1806 | 0.0855 | 0.1829 |
65
- | 0.9844 | 7.0 | 2107 | 0.0861 | 0.1892 |
66
- | 0.8618 | 8.0 | 2408 | 0.0844 | 0.1873 |
67
- | 0.7956 | 9.0 | 2709 | 0.0849 | 0.1964 |
68
- | 0.7654 | 10.0 | 3010 | 0.0842 | 0.1895 |
69
- | 0.6957 | 11.0 | 3311 | 0.0843 | 0.1848 |
70
- | 0.7042 | 12.0 | 3612 | 0.0837 | 0.1866 |
71
- | 0.6364 | 13.0 | 3913 | 0.0831 | 0.1929 |
72
- | 0.6357 | 14.0 | 4214 | 0.0829 | 0.1895 |
73
- | 0.6137 | 15.0 | 4515 | 0.0827 | 0.1904 |
74
- | 0.5909 | 16.0 | 4816 | 0.0824 | 0.1904 |
75
- | 0.5582 | 17.0 | 5117 | 0.0823 | 0.1941 |
76
- | 0.5409 | 18.0 | 5418 | 0.0824 | 0.1916 |
77
- | 0.5305 | 19.0 | 5719 | 0.0823 | 0.1895 |
78
- | 0.5137 | 19.9343 | 6000 | 0.0824 | 0.1902 |
79
-
80
-
81
  ### Framework versions
82
 
83
- - Transformers 4.51.3
84
- - Pytorch 2.6.0+cu124
85
- - Datasets 3.6.0
86
- - Tokenizers 0.21.1
 
1
  ---
2
  library_name: transformers
 
 
3
  license: apache-2.0
4
+ base_model: Baselhany/Graduation_Project_Distilation_Whisper_base3
5
  tags:
6
  - generated_from_trainer
 
 
7
  model-index:
8
+ - name: Graduation_Project_Distilation_Whisper_base3
9
  results: []
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
+ # Graduation_Project_Distilation_Whisper_base3
16
 
17
+ This model is a fine-tuned version of [Baselhany/Graduation_Project_Distilation_Whisper_base3](https://huggingface.co/Baselhany/Graduation_Project_Distilation_Whisper_base3) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - eval_loss: 0.0292
20
+ - eval_model_preparation_time: 0.0028
21
+ - eval_wer: 0.0968
22
+ - eval_runtime: 784.1659
23
+ - eval_samples_per_second: 3.826
24
+ - eval_steps_per_second: 0.478
25
+ - step: 0
26
 
27
  ## Model description
28
 
 
50
  - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
51
  - lr_scheduler_type: linear
52
  - lr_scheduler_warmup_steps: 500
53
+ - num_epochs: 5
54
  - mixed_precision_training: Native AMP
55
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
56
  ### Framework versions
57
 
58
+ - Transformers 4.51.1
59
+ - Pytorch 2.5.1+cu124
60
+ - Datasets 3.5.0
61
+ - Tokenizers 0.21.0
config.json CHANGED
@@ -53,7 +53,7 @@
53
  "pad_token_id": 50257,
54
  "scale_embedding": false,
55
  "torch_dtype": "float32",
56
- "transformers_version": "4.51.3",
57
  "use_cache": true,
58
  "use_weighted_layer_sum": false,
59
  "vocab_size": 51865
 
53
  "pad_token_id": 50257,
54
  "scale_embedding": false,
55
  "torch_dtype": "float32",
56
+ "transformers_version": "4.51.1",
57
  "use_cache": true,
58
  "use_weighted_layer_sum": false,
59
  "vocab_size": 51865
generation_config.json CHANGED
@@ -208,7 +208,7 @@
208
  "tie_word_embeddings": true,
209
  "tokenizer_class": null,
210
  "torchscript": false,
211
- "transformers_version": "4.51.3",
212
  "use_bfloat16": false,
213
  "use_weighted_layer_sum": false,
214
  "vocab_size": 51865
 
208
  "tie_word_embeddings": true,
209
  "tokenizer_class": null,
210
  "torchscript": false,
211
+ "transformers_version": "4.51.1",
212
  "use_bfloat16": false,
213
  "use_weighted_layer_sum": false,
214
  "vocab_size": 51865
runs/Jun23_14-04-30_622dd1a6c211/events.out.tfevents.1750688255.622dd1a6c211.19.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef949d0c92e2ba648f2b06ff9cfe4a54ede9d48114ae9049dfe88c67aa15c76d
3
+ size 404
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:01bd6b74ff3330c40b66014b7a8633f14d2e802969473d23195377f8dfda4e6c
3
  size 5496
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:789f58b69a0f91dab5c8b603a740843d5ecea78923fcf90739cac1ee6262547c
3
  size 5496