seastar105 commited on
Commit
9a6a930
Β·
1 Parent(s): 54a6e0f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -11
README.md CHANGED
@@ -12,15 +12,18 @@ OpenAI의 whisper-base λͺ¨λΈμ„ μ•„λž˜ λ°μ΄ν„°μ…‹μœΌλ‘œ ν•™μŠ΅ν•œ λͺ¨λΈμž…
12
  - μ €μŒμ§ˆ 전화망 μŒμ„±μΈμ‹ 데이터 (https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=571)
13
  - 방솑 μ½˜ν…μΈ  λŒ€ν™”μ²΄ μŒμ„±μΈμ‹ 데이터 (https://www.aihub.or.kr/aihubdata/data/view.do?dataSetSn=463)
14
 
 
 
15
  ```
16
- train_steps: 20000
17
- warmup_steps: 2000
18
  lr scheduler: linear warmup cosine decay
19
  max learning rate: 1e-4
20
- batch size: 256
21
  max_grad_norm: 1.0
22
  adamw_beta1: 0.9
23
  adamw_beta2: 0.98
 
24
  ```
25
 
26
  ### Evaluation
@@ -30,14 +33,16 @@ https://github.com/rtzr/Awesome-Korean-Speech-Recognition
30
  μœ„ λ ˆν¬μ§€ν† λ¦¬μ—μ„œ μ£Όμš” μ˜μ—­λ³„ 회의 μŒμ„±μ„ μ œμ™Έν•œ ν…ŒμŠ€νŠΈμ…‹ κ²°κ³Όμž…λ‹ˆλ‹€. μ•„λž˜ ν…Œμ΄λΈ”μ—μ„œ whisper_base_komixv2κ°€ λ³Έ λͺ¨λΈ μ„±λŠ₯μž…λ‹ˆλ‹€.
31
 
32
 
33
- | Model | cv_15_ko | fleurs_ko | kcall_testset | kconf_test | kcounsel_test | klec_testset | kspon_clean | kspon_other |
34
- |-----------------------|----------|-----------|---------------|------------|---------------|--------------|-------------|-------------|
35
- | whisper_base | 21.16 | 11.89 | 42.56 | 27.62 | 22.24 | 28.65 | 30.41 | 27.02 |
36
- | whisper_base_komix | 15.42 | 7.16 | 20.86 | 14.24 | 12.64 | 13.44 | 12.26 | 12.12 |
37
- | whisper_base_komixv2 | 13.04 | 7.04 | 10.54 | 13.1 | 10.65 | 12.99 | 12.44 | 12.56 |
38
- | whisper_large_v3 | 5.11 | 3.72 | 5.45 | 9.35 | 3.83 | 8.46 | 15.08 | 12.89 |
39
- | whisper_turbo | 5.38 | 3.95 | 5.89 | 9.77 | 4.21 | 9.27 | 16.49 | 13.54 |
40
- | whisper_turbo_lora | 6.25 | 4.0 | 6.51 | 9.94 | 5.05 | 8.84 | 9.35 | 9.29 |
 
 
41
 
42
  ### Acknowledgement
43
  - λ³Έ λͺ¨λΈμ€ κ΅¬κΈ€μ˜ TRC ν”„λ‘œκ·Έλž¨μ˜ μ§€μ›μœΌλ‘œ ν•™μŠ΅ν–ˆμŠ΅λ‹ˆλ‹€.
 
12
  - μ €μŒμ§ˆ 전화망 μŒμ„±μΈμ‹ 데이터 (https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=571)
13
  - 방솑 μ½˜ν…μΈ  λŒ€ν™”μ²΄ μŒμ„±μΈμ‹ 데이터 (https://www.aihub.or.kr/aihubdata/data/view.do?dataSetSn=463)
14
 
15
+ Training setup
16
+
17
  ```
18
+ train_steps: 50000
19
+ warmup_steps: 500
20
  lr scheduler: linear warmup cosine decay
21
  max learning rate: 1e-4
22
+ batch size: 1024
23
  max_grad_norm: 1.0
24
  adamw_beta1: 0.9
25
  adamw_beta2: 0.98
26
+ adamw_eps: 1e-6
27
  ```
28
 
29
  ### Evaluation
 
33
  μœ„ λ ˆν¬μ§€ν† λ¦¬μ—μ„œ μ£Όμš” μ˜μ—­λ³„ 회의 μŒμ„±μ„ μ œμ™Έν•œ ν…ŒμŠ€νŠΈμ…‹ κ²°κ³Όμž…λ‹ˆλ‹€. μ•„λž˜ ν…Œμ΄λΈ”μ—μ„œ whisper_base_komixv2κ°€ λ³Έ λͺ¨λΈ μ„±λŠ₯μž…λ‹ˆλ‹€.
34
 
35
 
36
+ | Model | cv_15_ko | fleurs_ko | kcall_testset | kconf_test | kcounsel_test | klec_testset | kspon_clean | kspon_other | Average |
37
+ |--------------------------|----------|-----------|---------------|------------|---------------|--------------|-------------|-------------|---------|
38
+ | whisper_base | 21.16 | 11.89 | 42.56 | 27.62 | 22.24 | 28.65 | 30.41 | 27.02 | 26.44 |
39
+ | whisper_base_komix | 15.42 | 7.16 | 20.86 | 14.24 | 12.64 | 13.44 | 12.26 | 12.12 | 13.52 |
40
+ | whisper_base_komixv2 | 10.27 | 5.14 | 6.23 | 10.86 | 7.01 | 10.38 | 9.98 | 9.99 | 8.73 |
41
+ | whisper_base_komixv2_phn| 12.81 | 8.27 | 9.5 | 13.26 | 11.33 | 14.24 | 13.11 | 13.3 | 11.98 |
42
+ | whisper_large_v2 | 6.58 | 3.74 | 22.26 | 13.88 | 8.95 | 13.84 | 15.51 | 13.6 | 12.29 |
43
+ | whisper_large_v3 | 5.11 | 3.72 | 5.45 | 9.35 | 3.83 | 8.46 | 15.08 | 12.89 | 7.99 |
44
+ | whisper_large_v3_turbo | 5.38 | 3.95 | 5.89 | 9.77 | 4.21 | 9.27 | 16.49 | 13.54 | 8.56 |
45
+
46
 
47
  ### Acknowledgement
48
  - λ³Έ λͺ¨λΈμ€ κ΅¬κΈ€μ˜ TRC ν”„λ‘œκ·Έλž¨μ˜ μ§€μ›μœΌλ‘œ ν•™μŠ΅ν–ˆμŠ΅λ‹ˆλ‹€.