Nikolajvestergaard commited on
Commit
8fe48d7
·
1 Parent(s): 449f1d1

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -0
README.md ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - generated_from_trainer
5
+ metrics:
6
+ - wer
7
+ model-index:
8
+ - name: Japanese_Fine_Tuned_Whisper_Model
9
+ results: []
10
+ datasets:
11
+ - mozilla-foundation/common_voice_11_0
12
+ language:
13
+ - ja
14
+ ---
15
+
16
+ # Japanese_Fine_Tuned_Whisper_Model
17
+
18
+ This model is a fine-tuned version of [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny) on the Common Voice dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.549100
21
+ - Wer: 225.233037
22
+
23
+ ## Model description
24
+
25
+ The tiny Whisper model is fine-tuned on Japanese speech samples from the Common Voice dataset, based on which users can perform Automatic Speech Recognition in real time in Japanese.
26
+
27
+ ### Training hyperparameters
28
+
29
+ The following hyperparameters were used during training:
30
+ - learning_rate: 1e-05
31
+ - train_batch_size: 8
32
+ - eval_batch_size: 8
33
+ - seed: 42
34
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
35
+ - lr_scheduler_type: linear
36
+ - lr_scheduler_warmup_steps: 100
37
+ - training_steps: 1000
38
+ - mixed_precision_training: Native AMP
39
+
40
+ ### Training results
41
+
42
+ | Training Loss | Step | Validation Loss | Wer |
43
+ |:-------------:|:-----:|:-----------------:|:----------:|
44
+ | 0.8097 | 200 | 0.801917 | 601.560806 |
45
+ | 0.7200 | 400 | 0.783436 | 327.335790 |
46
+ | 0.6810 | 600 | 0.759281 | 254.064600 |
47
+ | 0.7351 | 800 | 0.747759 | 241.426404 |
48
+ | 0.5491 | 1000 | 0.747127 | 225.233037 |
49
+
50
+ ### Framework versions
51
+
52
+ - Transformers 4.27.0.dev0
53
+ - Pytorch 1.13.1+cu116
54
+ - Datasets 2.10.1
55
+ - Tokenizers 0.13.2