hynt
/

Zipformer-30M-RNNT-6000h

Model card Files Files and versions

hynt commited on Oct 17, 2025

Commit

e7e3117

·

verified ·

1 Parent(s): fb0bf94

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ On CPU, the model can transcribe a **12-second audio clip in just 0.4 seconds**,
 - **Language:** Vietnamese
 - **Loss Function:** RNN-Transducer (RNNT Loss)
 - **Framework:** PyTorch + k2
-- **Training strategy**: Carefully preprocess the data, apply an augmentation strategy based on the distribution of out-of-vocabulary (OOV) tokens and Refine the transcriptions using Whisper.
 - **Optimized for:** High-speed CPU inference
 ---

 - **Language:** Vietnamese
 - **Loss Function:** RNN-Transducer (RNNT Loss)
 - **Framework:** PyTorch + k2
+- **Training strategy**: Carefully preprocess the data, apply an augmentation strategy based on the distribution of out-of-vocabulary (OOV) tokens and refine the transcriptions using Whisper.
 - **Optimized for:** High-speed CPU inference
 ---