plabdev commited on
Commit
39fe236
·
verified ·
1 Parent(s): 8e36173

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md ADDED
@@ -0,0 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ This model is based on whisper large v3 and trained on the dataset of jlvdoorn/atco2-asr-atcosim
2
+ It starts as an experimental model and seems to be abit overfit to air traffic vocabs but with better capturing of some of the radio words
3
+
4
+ ```
5
+ Training Metric:
6
+ eval/loss:0.06807401776313782
7
+ eval/runtime:1,922.2684
8
+ eval/samples_per_second:1.054
9
+ eval/steps_per_second:0.132
10
+ eval/wer:2.3947324274450597
11
+ train/epoch:7.1146245059288535
12
+ train/global_step:3,600
13
+ train/grad_norm:0.05588585510849953
14
+ train/learning_rate:0.00000406779661016949
15
+ train/loss:0.001
16
+ ```
17
+
18
+ hyperparameter:
19
+ ```
20
+ --model_name openai/whisper-large-v3 \
21
+ --train_batch_size 16 \
22
+ --eval_batch_size 8 \
23
+ --bf16 false \
24
+ --learning_rate 1e-5 \
25
+ --warmup_steps 100 \
26
+ --max_steps 6000 \
27
+ --save_steps 200 \
28
+ --eval_steps 200 \
29
+ --gradient_checkpointing true \
30
+ --output_dir ./whisper_atc_20241219
31
+ ```