hynt
/

Zipformer-30M-RNNT-6000h

Model card Files Files and versions

hynt commited on Oct 17, 2025

Commit

65b7e84

·

verified ·

1 Parent(s): e3c9d0a

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -46,7 +46,7 @@ The model was trained on approximately **6000 hours of high-quality Vietnamese s
 ---
 ## 🏆 Achievements
-This model architecture **won First Place** in the **Vietnamese Language Speech Processing (VLSP)** competition **2025**.
 Comprehensive details about **training data**, **optimization strategies**, **architecture improvements**, and **evaluation methodologies** are available in the paper below:
 👉 [Read the full paper on Overleaf](https://www.overleaf.com/read/wjntrgchhbgv#48aa25)
@@ -62,6 +62,11 @@ Comprehensive details about **training data**, **optimization strategies**, **ar
 ---
 ## 💬 Summary
 The **ZipFormer-30M-RNNT-6000h** model demonstrates that a lightweight architecture can still achieve state-of-the-art accuracy for Vietnamese ASR.
 It is designed for **fast deployment on CPU-based systems**, making it ideal for **real-time speech recognition**, **callbots**, and **embedded speech interfaces**.

 ---
 ## 🏆 Achievements
+By training this model architecture on 4,000 hours of data, I **won First Place** in the **Vietnamese Language Speech Processing (VLSP)** competition **2025**.
 Comprehensive details about **training data**, **optimization strategies**, **architecture improvements**, and **evaluation methodologies** are available in the paper below:
 👉 [Read the full paper on Overleaf](https://www.overleaf.com/read/wjntrgchhbgv#48aa25)
 ---
+## 🚀 Online Demo
+You can try the model directly here:
+👉 https://huggingface.co/spaces/hynt/k2-automatic-speech-recognition-demo
 ## 💬 Summary
 The **ZipFormer-30M-RNNT-6000h** model demonstrates that a lightweight architecture can still achieve state-of-the-art accuracy for Vietnamese ASR.
 It is designed for **fast deployment on CPU-based systems**, making it ideal for **real-time speech recognition**, **callbots**, and **embedded speech interfaces**.