Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -4,37 +4,75 @@ emoji: 🎙️
|
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: purple
|
| 6 |
sdk: gradio
|
| 7 |
-
sdk_version:
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
-
license:
|
| 11 |
---
|
| 12 |
|
| 13 |
-
#
|
| 14 |
|
| 15 |
-
Vietnamese Text-to-Speech
|
|
|
|
|
|
|
| 16 |
|
| 17 |
## Features
|
| 18 |
|
| 19 |
-
-
|
| 20 |
-
-
|
| 21 |
-
-
|
| 22 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 23 |
|
| 24 |
## Usage
|
| 25 |
|
| 26 |
-
|
| 27 |
-
|
| 28 |
-
|
| 29 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 30 |
|
| 31 |
## Model Info
|
| 32 |
|
| 33 |
-
- **Architecture**: VITS
|
| 34 |
-
- **
|
| 35 |
- **Sample Rate**: 24kHz
|
| 36 |
-
- **
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 37 |
|
| 38 |
## License
|
| 39 |
|
| 40 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
colorFrom: blue
|
| 5 |
colorTo: purple
|
| 6 |
sdk: gradio
|
| 7 |
+
sdk_version: 4.0.0
|
| 8 |
app_file: app.py
|
| 9 |
pinned: false
|
| 10 |
+
license: mit
|
| 11 |
---
|
| 12 |
|
| 13 |
+
# Valtec Vietnamese TTS - 5-Speaker System
|
| 14 |
|
| 15 |
+
🎙️ **Vietnamese Text-to-Speech with Regional Accents**
|
| 16 |
+
|
| 17 |
+
Vietnamese TTS system supporting 5 different voices with Northern and Southern regional accents.
|
| 18 |
|
| 19 |
## Features
|
| 20 |
|
| 21 |
+
- ✅ **5 Vietnamese Voices**: NF, SF, NM1, SM, NM2 (Northern/Southern, Male/Female)
|
| 22 |
+
- ✅ **Accurate G2P**: Vietnamese grapheme-to-phoneme conversion
|
| 23 |
+
- ✅ **High Quality**: VITS-based neural TTS
|
| 24 |
+
- ✅ **Fast Inference**: GPU-accelerated synthesis
|
| 25 |
+
- ✅ **Browser Demo**: ONNX Runtime Web version available
|
| 26 |
+
|
| 27 |
+
## Available Speakers
|
| 28 |
+
|
| 29 |
+
| Code | Region | Gender | Description |
|
| 30 |
+
|------|--------|--------|-------------|
|
| 31 |
+
| **NF** | Northern (Bắc) | Female | Formal, clear pronunciation |
|
| 32 |
+
| **SF** | Southern (Nam) | Female | Friendly, casual tone |
|
| 33 |
+
| **NM1** | Northern (Bắc) | Male | Professional voice |
|
| 34 |
+
| **SM** | Southern (Nam) | Male | Conversational style |
|
| 35 |
+
| **NM2** | Northern (Bắc) | Male | Authoritative tone |
|
| 36 |
|
| 37 |
## Usage
|
| 38 |
|
| 39 |
+
### Gradio Interface (This Space)
|
| 40 |
+
|
| 41 |
+
1. Enter Vietnamese text
|
| 42 |
+
2. Select a voice
|
| 43 |
+
3. Adjust synthesis parameters
|
| 44 |
+
4. Click "Generate Speech"
|
| 45 |
+
|
| 46 |
+
### Web Demo (Static HTML)
|
| 47 |
+
|
| 48 |
+
Switch to the "Web Demo" tab for browser-based inference using ONNX Runtime Web.
|
| 49 |
+
|
| 50 |
+
### Python API
|
| 51 |
+
|
| 52 |
+
```python
|
| 53 |
+
from valtec_tts import VietnameseTTS
|
| 54 |
+
|
| 55 |
+
tts = VietnameseTTS()
|
| 56 |
+
audio = tts.synthesize("Xin chào Việt Nam", speaker="NF")
|
| 57 |
+
```
|
| 58 |
|
| 59 |
## Model Info
|
| 60 |
|
| 61 |
+
- **Architecture**: VITS (Conditional Variational Autoencoder)
|
| 62 |
+
- **Speakers**: 5 (Northern/Southern Vietnamese accents)
|
| 63 |
- **Sample Rate**: 24kHz
|
| 64 |
+
- **Model Size**: ~220MB (PyTorch), ~165MB (ONNX)
|
| 65 |
+
|
| 66 |
+
## Links
|
| 67 |
+
|
| 68 |
+
- 🏠 [GitHub Repository](https://github.com/valtecAI-team/valtec-tts)
|
| 69 |
+
- 📦 [ONNX Models](https://huggingface.co/valtecAI-team/valtec-tts-onnx)
|
| 70 |
+
- 🎯 [Edge Deployment Guide](https://github.com/valtecAI-team/valtec-tts/tree/main/deployments/edge)
|
| 71 |
|
| 72 |
## License
|
| 73 |
|
| 74 |
+
MIT License - Free for commercial and non-commercial use.
|
| 75 |
+
|
| 76 |
+
---
|
| 77 |
+
|
| 78 |
+
**Powered by Valtec AI Team** | Built with Gradio & ONNX Runtime
|