valtecAI-team commited on
Commit
f3f9dd6
·
verified ·
1 Parent(s): dbba096

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +54 -16
README.md CHANGED
@@ -4,37 +4,75 @@ emoji: 🎙️
4
  colorFrom: blue
5
  colorTo: purple
6
  sdk: gradio
7
- sdk_version: 5.38.0
8
  app_file: app.py
9
  pinned: false
10
- license: cc-by-nc-4.0
11
  ---
12
 
13
- # 🎙️ Valtec Vietnamese TTS
14
 
15
- Vietnamese Text-to-Speech system with high-quality voice synthesis.
 
 
16
 
17
  ## Features
18
 
19
- - 🇻🇳 **Vietnamese TTS** with natural prosody and intonation
20
- - 👥 **Multi-speaker**: Male and Female voices
21
- - **Fast inference** on both CPU and GPU
22
- - 🎛️ **Adjustable settings**: Speed, noise scale, duration control
 
 
 
 
 
 
 
 
 
 
 
23
 
24
  ## Usage
25
 
26
- 1. Enter Vietnamese text in the input box
27
- 2. Select a speaker (male/female)
28
- 3. Click "Tạo giọng nói" to generate audio
29
- 4. Listen and download the result
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
30
 
31
  ## Model Info
32
 
33
- - **Architecture**: VITS-based
34
- - **Parameters**: ~58M
35
  - **Sample Rate**: 24kHz
36
- - **Speakers**: Male, Female
 
 
 
 
 
 
37
 
38
  ## License
39
 
40
- This project is licensed under [CC BY-NC 4.0](https://creativecommons.org/licenses/by-nc/4.0/).
 
 
 
 
 
4
  colorFrom: blue
5
  colorTo: purple
6
  sdk: gradio
7
+ sdk_version: 4.0.0
8
  app_file: app.py
9
  pinned: false
10
+ license: mit
11
  ---
12
 
13
+ # Valtec Vietnamese TTS - 5-Speaker System
14
 
15
+ 🎙️ **Vietnamese Text-to-Speech with Regional Accents**
16
+
17
+ Vietnamese TTS system supporting 5 different voices with Northern and Southern regional accents.
18
 
19
  ## Features
20
 
21
+ - **5 Vietnamese Voices**: NF, SF, NM1, SM, NM2 (Northern/Southern, Male/Female)
22
+ - **Accurate G2P**: Vietnamese grapheme-to-phoneme conversion
23
+ - **High Quality**: VITS-based neural TTS
24
+ - **Fast Inference**: GPU-accelerated synthesis
25
+ - ✅ **Browser Demo**: ONNX Runtime Web version available
26
+
27
+ ## Available Speakers
28
+
29
+ | Code | Region | Gender | Description |
30
+ |------|--------|--------|-------------|
31
+ | **NF** | Northern (Bắc) | Female | Formal, clear pronunciation |
32
+ | **SF** | Southern (Nam) | Female | Friendly, casual tone |
33
+ | **NM1** | Northern (Bắc) | Male | Professional voice |
34
+ | **SM** | Southern (Nam) | Male | Conversational style |
35
+ | **NM2** | Northern (Bắc) | Male | Authoritative tone |
36
 
37
  ## Usage
38
 
39
+ ### Gradio Interface (This Space)
40
+
41
+ 1. Enter Vietnamese text
42
+ 2. Select a voice
43
+ 3. Adjust synthesis parameters
44
+ 4. Click "Generate Speech"
45
+
46
+ ### Web Demo (Static HTML)
47
+
48
+ Switch to the "Web Demo" tab for browser-based inference using ONNX Runtime Web.
49
+
50
+ ### Python API
51
+
52
+ ```python
53
+ from valtec_tts import VietnameseTTS
54
+
55
+ tts = VietnameseTTS()
56
+ audio = tts.synthesize("Xin chào Việt Nam", speaker="NF")
57
+ ```
58
 
59
  ## Model Info
60
 
61
+ - **Architecture**: VITS (Conditional Variational Autoencoder)
62
+ - **Speakers**: 5 (Northern/Southern Vietnamese accents)
63
  - **Sample Rate**: 24kHz
64
+ - **Model Size**: ~220MB (PyTorch), ~165MB (ONNX)
65
+
66
+ ## Links
67
+
68
+ - 🏠 [GitHub Repository](https://github.com/valtecAI-team/valtec-tts)
69
+ - 📦 [ONNX Models](https://huggingface.co/valtecAI-team/valtec-tts-onnx)
70
+ - 🎯 [Edge Deployment Guide](https://github.com/valtecAI-team/valtec-tts/tree/main/deployments/edge)
71
 
72
  ## License
73
 
74
+ MIT License - Free for commercial and non-commercial use.
75
+
76
+ ---
77
+
78
+ **Powered by Valtec AI Team** | Built with Gradio & ONNX Runtime