DictoEaso Offline TTS Models
This repository contains offline TTS assets used by DictoEaso mobile/tablet apps.
It is an asset bundle (models + tokenizers/voices + mapping), not a full TTS runtime. Your app/runtime decides how to load and run each engine (Kokoro / Piper / VITS).
Whatβs inside (Engines & Languages)
1) Kokoro (ONNX FP16) β EN / ES / ZH / JA
- One FP16 ONNX model shared by multiple languages.
- We ship a minimal set of voice binaries (female + male per language) to keep the package small.
Included voices (selected):
- English:
af_bella(female),am_adam(male) - Spanish:
ef_dora(female),em_alex(male) - Chinese:
zf_xiaoxiao(female),zm_yunxi(male) - Japanese:
jf_alpha(female),jm_kumo(male)
2) Piper β FR
- Piper-style ONNX model +
tokens.txtfor French.
3) Korean β VITS / Mimic3 (KO)
- Korean is provided as a VITS/Mimic3 voice package:
model.onnxtokens.txtespeak-ng-data/(phonemizer data required by many Mimic3/VITS pipelines)
Note: KO is not the same engine as Piper or Kokoro. Treat it as a separate runtime path.
Repository layout
. ββ kokoro/ β ββ core_v1_fp16/ β ββ onnx/ β β ββ model_fp16.onnx β ββ voices/ β β ββ af_bella.bin β β ββ am_adam.bin β β ββ ef_dora.bin β β ββ em_alex.bin β β ββ zf_xiaoxiao.bin β β ββ zm_yunxi.bin β β ββ jf_alpha.bin β β ββ jm_kumo.bin β ββ config.json β ββ tokenizer.json β ββ tokenizer_config.json β ββ piper/ β ββ fr/ β ββ model.onnx β ββ tokens.txt β ββ vits/ β ββ ko_kss_low/ β ββ model.onnx β ββ tokens.txt β ββ espeak-ng-data/ β ββ (many files) β ββ voice_map.json
pgsql Copy code
voice_map.json
voice_map.json provides a stable mapping from language β engine β paths + default voices.
Example shape (your app should follow the real file in this repo):
{
"kokoro": {
"core": "kokoro/core_v1_fp16/onnx/model_fp16.onnx",
"voicesDir": "kokoro/core_v1_fp16/voices",
"defaults": {
"en": {"female":"af_bella","male":"am_adam"},
"es": {"female":"ef_dora","male":"em_alex"},
"zh": {"female":"zf_xiaoxiao","male":"zm_yunxi"},
"ja": {"female":"jf_alpha","male":"jm_kumo"}
}
},
"piper": {
"fr": {"model":"piper/fr/model.onnx","tokens":"piper/fr/tokens.txt"}
},
"vits": {
"ko": {
"model":"vits/ko_kss_low/model.onnx",
"tokens":"vits/ko_kss_low/tokens.txt",
"espeakData":"vits/ko_kss_low/espeak-ng-data"
}
}
}
How to download (recommended)
Using Hugging Face CLI (hf):
bash
Copy code
hf download DictoEaso-Official/offline-tts-models --local-dir ./offline-tts-models --local-dir-use-symlinks False
If you only want Kokoro core:
bash
Copy code
hf download DictoEaso-Official/offline-tts-models \
--include "kokoro/**" "voice_map.json" \
--local-dir ./offline-tts-models --local-dir-use-symlinks False
Notes for app/runtime integration
Kokoro
Load ONNX: kokoro/core_v1_fp16/onnx/model_fp16.onnx
Load tokenizer + config from the same folder
Pick voice by .bin name (without extension), e.g. af_bella, jm_kumo
Piper (FR)
Load model: piper/fr/model.onnx
Load tokens: piper/fr/tokens.txt
VITS/Mimic3 (KO)
Load model: vits/ko_kss_low/model.onnx
Load tokens: vits/ko_kss_low/tokens.txt
Many pipelines require espeak-ng-data/ to phonemize β ensure your KO runtime knows how to use it.
Licenses / Third-party notices (IMPORTANT)
This repository aggregates multiple upstream projects and does not have a single unified license for all files.
The repo-level metadata uses license: other to reflect that the contents are mixed.
Each subcomponent has its own upstream license and attribution requirements.
Upstream sources (you must verify before redistribution)
Kokoro ONNX: derived from upstream onnx-community/Kokoro-82M-v1.0-ONNX (model + tokenizer/config + voices).
Piper FR: Piper-compatible voice (ONNX + tokens). License depends on the specific voice origin.
VITS/Mimic3 KO (kss_low): Mimic3/VITS voice package. License and attribution requirements are defined by its upstream distribution (and may differ from Kokoro/Piper).
Internal use tip: If your legal/compliance requires strict separation, consider splitting KO into a dedicated repo so licensing boundaries remain crystal-clear.
Change log policy
kokoro/core_v1_fp16/*: updated only when upstream Kokoro ONNX or selected voices change
piper/fr/*: updated per voice refresh
vits/ko_kss_low/*: updated per upstream KO voice refresh
voice_map.json: kept stable; changes are backward compatible when possible