pere commited on
Commit
bc7aef5
·
1 Parent(s): 66a66f1
README.md ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - no
4
+ license: apache-2.0
5
+ tags:
6
+ - whisper-event
7
+ - norwegian
8
+ datasets:
9
+ - NbAiLab/NCC_S
10
+ - NbAiLab/NPSC
11
+ - NbAiLab/NST
12
+ metrics:
13
+ - wer
14
+ model-index:
15
+ - name: Whisper Small Norwegian Bokmål
16
+ results:
17
+ - task:
18
+ name: Automatic Speech Recognition
19
+ type: automatic-speech-recognition
20
+ dataset:
21
+ name: FLEURS
22
+ type: google/fleurs
23
+ config: nb_no
24
+ split: test
25
+ args: nb_no
26
+ metrics:
27
+ - name: Wer
28
+ type: wer
29
+ value: 23.93
30
+ ---
31
+
32
+ # Whisper Tiny Norwegian Bokmål
33
+
34
+ This model is a fine-tuned version of [openai/whisper-medium](https://huggingface.co/openai/whisper-medium) trained on several datasets.
35
+
36
+ It is currently in the middle of a large training. Currently achieves the following results on the evaluation set:
37
+ - Loss: 0.4780
38
+ - Wer: 23.93
39
+
40
+ ## Model description
41
+
42
+ The model is trained on a large corpus of roughly 5.000 hours of voice. The sources are subtitles from the Norwegian broadcaster NRK, transcribed speeches from the Norwegian parliament and voice recordings from Norsk Språkteknologi.
43
+
44
+ ## Intended uses & limitations
45
+
46
+ The model will be free for everyone to use when it is finished.
47
+
48
+ ### Training hyperparameters
49
+
50
+ The following hyperparameters were used during training:
51
+ - learning_rate: 3e-06
52
+ - train_batch_size: 128
53
+ - gradient_accumulation_steps: 2
54
+ - eval_batch_size: 32
55
+ - seed: 42
56
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
57
+ - lr_scheduler_type: constant with warmup
58
+ - lr_scheduler_warmup_steps: 1000
59
+ - training_steps: 50.000 (currently @1.000)
60
+ - mixed_precision_training: fp16
61
+ - deepspee: true
62
+
63
+ ### Training results
64
+ See [Tensorboad Metrics](https://huggingface.co/NbAiLab/whisper-small-nob/tensorboard)
65
+
66
+
67
+
68
+
runs/Dec15_19-44-15_dante/events.out.tfevents.1671129887.dante.1198326.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7b9456836b3f65994e5aedb52c318fbd917b6481044ec0fbaeacb560c676134a
3
- size 10875
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bb2820a34dec31732207a6a3ce568ff74e20f309d306614381064482cad718d5
3
+ size 13230