Commit Β·
9a6a930
1
Parent(s): 54a6e0f
Update README.md
Browse files
README.md
CHANGED
|
@@ -12,15 +12,18 @@ OpenAIμ whisper-base λͺ¨λΈμ μλ λ°μ΄ν°μ
μΌλ‘ νμ΅ν λͺ¨λΈμ
|
|
| 12 |
- μ μμ§ μ νλ§ μμ±μΈμ λ°μ΄ν° (https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=571)
|
| 13 |
- λ°©μ‘ μ½ν
μΈ λν체 μμ±μΈμ λ°μ΄ν° (https://www.aihub.or.kr/aihubdata/data/view.do?dataSetSn=463)
|
| 14 |
|
|
|
|
|
|
|
| 15 |
```
|
| 16 |
-
train_steps:
|
| 17 |
-
warmup_steps:
|
| 18 |
lr scheduler: linear warmup cosine decay
|
| 19 |
max learning rate: 1e-4
|
| 20 |
-
batch size:
|
| 21 |
max_grad_norm: 1.0
|
| 22 |
adamw_beta1: 0.9
|
| 23 |
adamw_beta2: 0.98
|
|
|
|
| 24 |
```
|
| 25 |
|
| 26 |
### Evaluation
|
|
@@ -30,14 +33,16 @@ https://github.com/rtzr/Awesome-Korean-Speech-Recognition
|
|
| 30 |
μ λ ν¬μ§ν 리μμ μ£Όμ μμλ³ νμ μμ±μ μ μΈν ν
μ€νΈμ
κ²°κ³Όμ
λλ€. μλ ν
μ΄λΈμμ whisper_base_komixv2κ° λ³Έ λͺ¨λΈ μ±λ₯μ
λλ€.
|
| 31 |
|
| 32 |
|
| 33 |
-
|
|
| 34 |
-
|
| 35 |
-
| whisper_base
|
| 36 |
-
| whisper_base_komix
|
| 37 |
-
|
|
| 38 |
-
|
|
| 39 |
-
|
|
| 40 |
-
|
|
|
|
|
|
|
|
| 41 |
|
| 42 |
### Acknowledgement
|
| 43 |
- λ³Έ λͺ¨λΈμ ꡬκΈμ TRC νλ‘κ·Έλ¨μ μ§μμΌλ‘ νμ΅νμ΅λλ€.
|
|
|
|
| 12 |
- μ μμ§ μ νλ§ μμ±μΈμ λ°μ΄ν° (https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&dataSetSn=571)
|
| 13 |
- λ°©μ‘ μ½ν
μΈ λν체 μμ±μΈμ λ°μ΄ν° (https://www.aihub.or.kr/aihubdata/data/view.do?dataSetSn=463)
|
| 14 |
|
| 15 |
+
Training setup
|
| 16 |
+
|
| 17 |
```
|
| 18 |
+
train_steps: 50000
|
| 19 |
+
warmup_steps: 500
|
| 20 |
lr scheduler: linear warmup cosine decay
|
| 21 |
max learning rate: 1e-4
|
| 22 |
+
batch size: 1024
|
| 23 |
max_grad_norm: 1.0
|
| 24 |
adamw_beta1: 0.9
|
| 25 |
adamw_beta2: 0.98
|
| 26 |
+
adamw_eps: 1e-6
|
| 27 |
```
|
| 28 |
|
| 29 |
### Evaluation
|
|
|
|
| 33 |
μ λ ν¬μ§ν 리μμ μ£Όμ μμλ³ νμ μμ±μ μ μΈν ν
μ€νΈμ
κ²°κ³Όμ
λλ€. μλ ν
μ΄λΈμμ whisper_base_komixv2κ° λ³Έ λͺ¨λΈ μ±λ₯μ
λλ€.
|
| 34 |
|
| 35 |
|
| 36 |
+
| Model | cv_15_ko | fleurs_ko | kcall_testset | kconf_test | kcounsel_test | klec_testset | kspon_clean | kspon_other | Average |
|
| 37 |
+
|--------------------------|----------|-----------|---------------|------------|---------------|--------------|-------------|-------------|---------|
|
| 38 |
+
| whisper_base | 21.16 | 11.89 | 42.56 | 27.62 | 22.24 | 28.65 | 30.41 | 27.02 | 26.44 |
|
| 39 |
+
| whisper_base_komix | 15.42 | 7.16 | 20.86 | 14.24 | 12.64 | 13.44 | 12.26 | 12.12 | 13.52 |
|
| 40 |
+
| whisper_base_komixv2 | 10.27 | 5.14 | 6.23 | 10.86 | 7.01 | 10.38 | 9.98 | 9.99 | 8.73 |
|
| 41 |
+
| whisper_base_komixv2_phn| 12.81 | 8.27 | 9.5 | 13.26 | 11.33 | 14.24 | 13.11 | 13.3 | 11.98 |
|
| 42 |
+
| whisper_large_v2 | 6.58 | 3.74 | 22.26 | 13.88 | 8.95 | 13.84 | 15.51 | 13.6 | 12.29 |
|
| 43 |
+
| whisper_large_v3 | 5.11 | 3.72 | 5.45 | 9.35 | 3.83 | 8.46 | 15.08 | 12.89 | 7.99 |
|
| 44 |
+
| whisper_large_v3_turbo | 5.38 | 3.95 | 5.89 | 9.77 | 4.21 | 9.27 | 16.49 | 13.54 | 8.56 |
|
| 45 |
+
|
| 46 |
|
| 47 |
### Acknowledgement
|
| 48 |
- λ³Έ λͺ¨λΈμ ꡬκΈμ TRC νλ‘κ·Έλ¨μ μ§μμΌλ‘ νμ΅νμ΅λλ€.
|