hynt commited on
Commit
65b7e84
Β·
verified Β·
1 Parent(s): e3c9d0a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -1
README.md CHANGED
@@ -46,7 +46,7 @@ The model was trained on approximately **6000 hours of high-quality Vietnamese s
46
  ---
47
 
48
  ## πŸ† Achievements
49
- This model architecture **won First Place** in the **Vietnamese Language Speech Processing (VLSP)** competition **2025**.
50
  Comprehensive details about **training data**, **optimization strategies**, **architecture improvements**, and **evaluation methodologies** are available in the paper below:
51
 
52
  πŸ‘‰ [Read the full paper on Overleaf](https://www.overleaf.com/read/wjntrgchhbgv#48aa25)
@@ -62,6 +62,11 @@ Comprehensive details about **training data**, **optimization strategies**, **ar
62
 
63
  ---
64
 
 
 
 
 
 
65
  ## πŸ’¬ Summary
66
  The **ZipFormer-30M-RNNT-6000h** model demonstrates that a lightweight architecture can still achieve state-of-the-art accuracy for Vietnamese ASR.
67
  It is designed for **fast deployment on CPU-based systems**, making it ideal for **real-time speech recognition**, **callbots**, and **embedded speech interfaces**.
 
46
  ---
47
 
48
  ## πŸ† Achievements
49
+ By training this model architecture on 4,000 hours of data, I **won First Place** in the **Vietnamese Language Speech Processing (VLSP)** competition **2025**.
50
  Comprehensive details about **training data**, **optimization strategies**, **architecture improvements**, and **evaluation methodologies** are available in the paper below:
51
 
52
  πŸ‘‰ [Read the full paper on Overleaf](https://www.overleaf.com/read/wjntrgchhbgv#48aa25)
 
62
 
63
  ---
64
 
65
+ ## πŸš€ Online Demo
66
+
67
+ You can try the model directly here:
68
+ πŸ‘‰ https://huggingface.co/spaces/hynt/k2-automatic-speech-recognition-demo
69
+
70
  ## πŸ’¬ Summary
71
  The **ZipFormer-30M-RNNT-6000h** model demonstrates that a lightweight architecture can still achieve state-of-the-art accuracy for Vietnamese ASR.
72
  It is designed for **fast deployment on CPU-based systems**, making it ideal for **real-time speech recognition**, **callbots**, and **embedded speech interfaces**.