Update README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,7 @@ On CPU, the model can transcribe a **12-second audio clip in just 0.4 seconds**,
|
|
| 15 |
- **Language:** Vietnamese
|
| 16 |
- **Loss Function:** RNN-Transducer (RNNT Loss)
|
| 17 |
- **Framework:** PyTorch + k2
|
| 18 |
-
- **Training strategy**: Carefully preprocess the data, apply an augmentation strategy based on the distribution of out-of-vocabulary (OOV) tokens and
|
| 19 |
- **Optimized for:** High-speed CPU inference
|
| 20 |
|
| 21 |
---
|
|
|
|
| 15 |
- **Language:** Vietnamese
|
| 16 |
- **Loss Function:** RNN-Transducer (RNNT Loss)
|
| 17 |
- **Framework:** PyTorch + k2
|
| 18 |
+
- **Training strategy**: Carefully preprocess the data, apply an augmentation strategy based on the distribution of out-of-vocabulary (OOV) tokens and refine the transcriptions using Whisper.
|
| 19 |
- **Optimized for:** High-speed CPU inference
|
| 20 |
|
| 21 |
---
|