Update README.md
Browse files
README.md
CHANGED
|
@@ -46,7 +46,7 @@ The model was trained on approximately **6000 hours of high-quality Vietnamese s
|
|
| 46 |
---
|
| 47 |
|
| 48 |
## π Achievements
|
| 49 |
-
|
| 50 |
Comprehensive details about **training data**, **optimization strategies**, **architecture improvements**, and **evaluation methodologies** are available in the paper below:
|
| 51 |
|
| 52 |
π [Read the full paper on Overleaf](https://www.overleaf.com/read/wjntrgchhbgv#48aa25)
|
|
@@ -62,6 +62,11 @@ Comprehensive details about **training data**, **optimization strategies**, **ar
|
|
| 62 |
|
| 63 |
---
|
| 64 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 65 |
## π¬ Summary
|
| 66 |
The **ZipFormer-30M-RNNT-6000h** model demonstrates that a lightweight architecture can still achieve state-of-the-art accuracy for Vietnamese ASR.
|
| 67 |
It is designed for **fast deployment on CPU-based systems**, making it ideal for **real-time speech recognition**, **callbots**, and **embedded speech interfaces**.
|
|
|
|
| 46 |
---
|
| 47 |
|
| 48 |
## π Achievements
|
| 49 |
+
By training this model architecture on 4,000 hours of data, I **won First Place** in the **Vietnamese Language Speech Processing (VLSP)** competition **2025**.
|
| 50 |
Comprehensive details about **training data**, **optimization strategies**, **architecture improvements**, and **evaluation methodologies** are available in the paper below:
|
| 51 |
|
| 52 |
π [Read the full paper on Overleaf](https://www.overleaf.com/read/wjntrgchhbgv#48aa25)
|
|
|
|
| 62 |
|
| 63 |
---
|
| 64 |
|
| 65 |
+
## π Online Demo
|
| 66 |
+
|
| 67 |
+
You can try the model directly here:
|
| 68 |
+
π https://huggingface.co/spaces/hynt/k2-automatic-speech-recognition-demo
|
| 69 |
+
|
| 70 |
## π¬ Summary
|
| 71 |
The **ZipFormer-30M-RNNT-6000h** model demonstrates that a lightweight architecture can still achieve state-of-the-art accuracy for Vietnamese ASR.
|
| 72 |
It is designed for **fast deployment on CPU-based systems**, making it ideal for **real-time speech recognition**, **callbots**, and **embedded speech interfaces**.
|