DictoEaso commited on
Commit
6f4a2ab
Β·
verified Β·
1 Parent(s): 6fdab30

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +162 -16
README.md CHANGED
@@ -1,24 +1,170 @@
1
  ---
 
 
2
  tags:
3
- - text-to-speech
4
- - tts
5
- - onnx
6
- - kokoro
7
- - piper
 
8
  ---
9
 
10
  # DictoEaso Offline TTS Models
11
 
12
- This repo contains offline TTS assets for DictoEaso apps.
13
 
14
- ## Contents & Licenses
15
- - Kokoro (EN/ES/ZH/JA): onnx-community/Kokoro-82M-v1.0-ONNX (fp16 ONNX + selected voices). License: see upstream.
16
- - Piper (FR): model.onnx + tokens.txt. License: see upstream.
17
- - Piper (KO): model.onnx + tokens.txt. License: see upstream.
18
 
19
- ## Layout
20
- - kokoro/core_v1_fp16/onnx/model_fp16.onnx
21
- - kokoro/core_v1_fp16/voices/*.bin
22
- - piper/fr/model.onnx + tokens.txt
23
- - piper/ko/model.onnx + tokens.txt
24
- - voice_map.json
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: other
3
+ pipeline_tag: text-to-speech
4
  tags:
5
+ - text-to-speech
6
+ - tts
7
+ - onnx
8
+ - kokoro
9
+ - piper
10
+ - vits
11
  ---
12
 
13
  # DictoEaso Offline TTS Models
14
 
15
+ This repository contains **offline TTS assets** used by DictoEaso mobile/tablet apps.
16
 
17
+ It is an **asset bundle** (models + tokenizers/voices + mapping), not a full TTS runtime. Your app/runtime decides how to load and run each engine (Kokoro / Piper / VITS).
 
 
 
18
 
19
+ ---
20
+
21
+ ## What’s inside (Engines & Languages)
22
+
23
+ ### 1) Kokoro (ONNX FP16) β€” EN / ES / ZH / JA
24
+ - **One FP16 ONNX model** shared by multiple languages.
25
+ - We ship **a minimal set of voice binaries** (female + male per language) to keep the package small.
26
+
27
+ Included voices (selected):
28
+ - English: `af_bella` (female), `am_adam` (male)
29
+ - Spanish: `ef_dora` (female), `em_alex` (male)
30
+ - Chinese: `zf_xiaoxiao` (female), `zm_yunxi` (male)
31
+ - Japanese: `jf_alpha` (female), `jm_kumo` (male)
32
+
33
+ ### 2) Piper β€” FR
34
+ - Piper-style ONNX model + `tokens.txt` for French.
35
+
36
+ ### 3) Korean β€” VITS / Mimic3 (KO)
37
+ - Korean is provided as a **VITS/Mimic3** voice package:
38
+ - `model.onnx`
39
+ - `tokens.txt`
40
+ - `espeak-ng-data/` (phonemizer data required by many Mimic3/VITS pipelines)
41
+
42
+ > Note: KO is **not** the same engine as Piper or Kokoro. Treat it as a separate runtime path.
43
+
44
+ ---
45
+
46
+ ## Repository layout
47
+
48
+ .
49
+ β”œβ”€ kokoro/
50
+ β”‚ └─ core_v1_fp16/
51
+ β”‚ β”œβ”€ onnx/
52
+ β”‚ β”‚ └─ model_fp16.onnx
53
+ β”‚ β”œβ”€ voices/
54
+ β”‚ β”‚ β”œβ”€ af_bella.bin
55
+ β”‚ β”‚ β”œβ”€ am_adam.bin
56
+ β”‚ β”‚ β”œβ”€ ef_dora.bin
57
+ β”‚ β”‚ β”œβ”€ em_alex.bin
58
+ β”‚ β”‚ β”œβ”€ zf_xiaoxiao.bin
59
+ β”‚ β”‚ β”œβ”€ zm_yunxi.bin
60
+ β”‚ β”‚ β”œβ”€ jf_alpha.bin
61
+ β”‚ β”‚ └─ jm_kumo.bin
62
+ β”‚ β”œβ”€ config.json
63
+ β”‚ β”œβ”€ tokenizer.json
64
+ β”‚ └─ tokenizer_config.json
65
+ β”‚
66
+ β”œβ”€ piper/
67
+ β”‚ └─ fr/
68
+ β”‚ β”œβ”€ model.onnx
69
+ β”‚ └─ tokens.txt
70
+ β”‚
71
+ β”œβ”€ vits/
72
+ β”‚ └─ ko_kss_low/
73
+ β”‚ β”œβ”€ model.onnx
74
+ β”‚ β”œβ”€ tokens.txt
75
+ β”‚ └─ espeak-ng-data/
76
+ β”‚ └─ (many files)
77
+ β”‚
78
+ └─ voice_map.json
79
+
80
+ pgsql
81
+ Copy code
82
+
83
+ ---
84
+
85
+ ## voice_map.json
86
+
87
+ `voice_map.json` provides a stable mapping from language β†’ engine β†’ paths + default voices.
88
+
89
+ Example shape (your app should follow the real file in this repo):
90
+
91
+ ```json
92
+ {
93
+ "kokoro": {
94
+ "core": "kokoro/core_v1_fp16/onnx/model_fp16.onnx",
95
+ "voicesDir": "kokoro/core_v1_fp16/voices",
96
+ "defaults": {
97
+ "en": {"female":"af_bella","male":"am_adam"},
98
+ "es": {"female":"ef_dora","male":"em_alex"},
99
+ "zh": {"female":"zf_xiaoxiao","male":"zm_yunxi"},
100
+ "ja": {"female":"jf_alpha","male":"jm_kumo"}
101
+ }
102
+ },
103
+ "piper": {
104
+ "fr": {"model":"piper/fr/model.onnx","tokens":"piper/fr/tokens.txt"}
105
+ },
106
+ "vits": {
107
+ "ko": {
108
+ "model":"vits/ko_kss_low/model.onnx",
109
+ "tokens":"vits/ko_kss_low/tokens.txt",
110
+ "espeakData":"vits/ko_kss_low/espeak-ng-data"
111
+ }
112
+ }
113
+ }
114
+ How to download (recommended)
115
+ Using Hugging Face CLI (hf):
116
+
117
+ bash
118
+ Copy code
119
+ hf download DictoEaso-Official/offline-tts-models --local-dir ./offline-tts-models --local-dir-use-symlinks False
120
+ If you only want Kokoro core:
121
+
122
+ bash
123
+ Copy code
124
+ hf download DictoEaso-Official/offline-tts-models \
125
+ --include "kokoro/**" "voice_map.json" \
126
+ --local-dir ./offline-tts-models --local-dir-use-symlinks False
127
+ Notes for app/runtime integration
128
+ Kokoro
129
+ Load ONNX: kokoro/core_v1_fp16/onnx/model_fp16.onnx
130
+
131
+ Load tokenizer + config from the same folder
132
+
133
+ Pick voice by .bin name (without extension), e.g. af_bella, jm_kumo
134
+
135
+ Piper (FR)
136
+ Load model: piper/fr/model.onnx
137
+
138
+ Load tokens: piper/fr/tokens.txt
139
+
140
+ VITS/Mimic3 (KO)
141
+ Load model: vits/ko_kss_low/model.onnx
142
+
143
+ Load tokens: vits/ko_kss_low/tokens.txt
144
+
145
+ Many pipelines require espeak-ng-data/ to phonemize β†’ ensure your KO runtime knows how to use it.
146
+
147
+ Licenses / Third-party notices (IMPORTANT)
148
+ This repository aggregates multiple upstream projects and does not have a single unified license for all files.
149
+
150
+ The repo-level metadata uses license: other to reflect that the contents are mixed.
151
+
152
+ Each subcomponent has its own upstream license and attribution requirements.
153
+
154
+ Upstream sources (you must verify before redistribution)
155
+ Kokoro ONNX: derived from upstream onnx-community/Kokoro-82M-v1.0-ONNX (model + tokenizer/config + voices).
156
+
157
+ Piper FR: Piper-compatible voice (ONNX + tokens). License depends on the specific voice origin.
158
+
159
+ VITS/Mimic3 KO (kss_low): Mimic3/VITS voice package. License and attribution requirements are defined by its upstream distribution (and may differ from Kokoro/Piper).
160
+
161
+ Internal use tip: If your legal/compliance requires strict separation, consider splitting KO into a dedicated repo so licensing boundaries remain crystal-clear.
162
+
163
+ Change log policy
164
+ kokoro/core_v1_fp16/*: updated only when upstream Kokoro ONNX or selected voices change
165
+
166
+ piper/fr/*: updated per voice refresh
167
+
168
+ vits/ko_kss_low/*: updated per upstream KO voice refresh
169
+
170
+ voice_map.json: kept stable; changes are backward compatible when possible