Text-to-Speech
ONNX
KittenTTS
English
tts
kokoro
piper
melotts
vits
vits2
styletts2
sherpa-onnx
on-device
threadcast
Instructions to use Pixel-Labs/threadcast-neural-models with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- KittenTTS
How to use Pixel-Labs/threadcast-neural-models with KittenTTS:
from kittentts import KittenTTS m = KittenTTS("Pixel-Labs/threadcast-neural-models") audio = m.generate("This high quality TTS model works without a GPU") # Save the audio import soundfile as sf sf.write('output.wav', audio, 24000) - Notebooks
- Google Colab
- Kaggle
Upload README.md
Browse files
README.md
CHANGED
|
@@ -7,6 +7,7 @@ tags:
|
|
| 7 |
- tts
|
| 8 |
- kokoro
|
| 9 |
- piper
|
|
|
|
| 10 |
- vits
|
| 11 |
- styletts2
|
| 12 |
- onnx
|
|
@@ -31,7 +32,7 @@ pipeline_tag: text-to-speech
|
|
| 31 |
|
| 32 |
---
|
| 33 |
|
| 34 |
-
Self-hosted mirror of the on-device neural TTS models used by **[ThreadCast](https://threadcast.app)** across both shipping platforms β the Chrome extension and the Android app.
|
| 35 |
|
| 36 |
This repository exists so each platform can ship a stable, version-pinned set of model weights without depending on the availability or rate-limits of upstream Hugging Face repos at runtime.
|
| 37 |
|
|
@@ -48,7 +49,9 @@ threadcast-neural-models/
|
|
| 48 |
β βββ neural-82m/ Kokoro StyleTTS2 β 1 model + 11 voice embeddings
|
| 49 |
β
|
| 50 |
βββ mobile-android/ β Android app β production zips fetched at runtime
|
| 51 |
-
βββ v1/
|
|
|
|
|
|
|
| 52 |
```
|
| 53 |
|
| 54 |
| Subtree | Format | Consumed by | Sub-README |
|
|
@@ -64,8 +67,9 @@ The two subtrees parallel each other on purpose β same engine families (Piper
|
|
| 64 |
|
| 65 |
| Engine | Architecture | Params | Per-voice cost | Quality tier |
|
| 66 |
|---|---|---|---|---|
|
| 67 |
-
| **`neural-28m`** | Piper VITS | ~28 M | One ONNX file per voice (~63 MB) | Standard β fast, CPU-friendly, single-thread WASM real-time on a laptop |
|
| 68 |
-
| **`neural-
|
|
|
|
| 69 |
|
| 70 |
---
|
| 71 |
|
|
@@ -74,6 +78,7 @@ The two subtrees parallel each other on purpose β same engine families (Piper
|
|
| 74 |
This repository **mirrors** upstream models for distribution stability. Each upstream project retains its own license:
|
| 75 |
|
| 76 |
- **Kokoro-82M:** Apache-2.0 ([upstream model card](https://huggingface.co/hexgrad/Kokoro-82M))
|
|
|
|
| 77 |
- **Piper voices:** MIT, with individual voice attributions in each `.onnx.json`
|
| 78 |
- **transformers.js, onnxruntime-web, onnxruntime-android:** Apache-2.0
|
| 79 |
- **sherpa-onnx:** Apache-2.0
|
|
|
|
| 7 |
- tts
|
| 8 |
- kokoro
|
| 9 |
- piper
|
| 10 |
+
- kittentts
|
| 11 |
- vits
|
| 12 |
- styletts2
|
| 13 |
- onnx
|
|
|
|
| 32 |
|
| 33 |
---
|
| 34 |
|
| 35 |
+
Self-hosted mirror of the on-device neural TTS models used by **[ThreadCast](https://threadcast.app)** across both shipping platforms β the Chrome extension and the Android app. Three engine families on Android (Piper VITS, KittenTTS-nano VITS, Kokoro StyleTTS2), two on the extension, one source of truth.
|
| 36 |
|
| 37 |
This repository exists so each platform can ship a stable, version-pinned set of model weights without depending on the availability or rate-limits of upstream Hugging Face repos at runtime.
|
| 38 |
|
|
|
|
| 49 |
β βββ neural-82m/ Kokoro StyleTTS2 β 1 model + 11 voice embeddings
|
| 50 |
β
|
| 51 |
βββ mobile-android/ β Android app β production zips fetched at runtime
|
| 52 |
+
βββ v1/ 8 zips: 1 shared espeak + 5 per-voice Piper
|
| 53 |
+
+ 1 KittenTTS-nano ("Local AI Plus")
|
| 54 |
+
+ 1 Kokoro ("Local AI Studio")
|
| 55 |
```
|
| 56 |
|
| 57 |
| Subtree | Format | Consumed by | Sub-README |
|
|
|
|
| 67 |
|
| 68 |
| Engine | Architecture | Params | Per-voice cost | Quality tier |
|
| 69 |
|---|---|---|---|---|
|
| 70 |
+
| **`neural-28m`** | Piper VITS | ~28 M | One ONNX file per voice (~63 MB) | Standard β fast, CPU-friendly, single-thread WASM real-time on a laptop. Surfaced on Android as **Local AI Lite**, on the extension as **AI Neural CPU**. |
|
| 71 |
+
| **`neural-15m`** | KittenTTS-nano VITS | ~15 M | Single fp16 model + 8 speaker embeddings (one ~26 MB file serves all) | Sweet spot β 8 voices with style-vector switching at a fraction of the storage cost. Android-only, surfaced as **Local AI Plus**. |
|
| 72 |
+
| **`neural-82m`** | Kokoro StyleTTS2 | ~82 M | Single model + 256-dim style vectors per voice (one ~325 MB file serves all) | Premium β more natural prosody, GPU-accelerated on Chrome (WebGPU); CPU-only on Android (perf-gated). Surfaced on Android as **Local AI Studio**, on the extension as **AI Neural GPU**. |
|
| 73 |
|
| 74 |
---
|
| 75 |
|
|
|
|
| 78 |
This repository **mirrors** upstream models for distribution stability. Each upstream project retains its own license:
|
| 79 |
|
| 80 |
- **Kokoro-82M:** Apache-2.0 ([upstream model card](https://huggingface.co/hexgrad/Kokoro-82M))
|
| 81 |
+
- **KittenTTS-nano (v0.1):** Apache-2.0 ([upstream model card](https://huggingface.co/KittenML/kitten-tts-nano-0.1))
|
| 82 |
- **Piper voices:** MIT, with individual voice attributions in each `.onnx.json`
|
| 83 |
- **transformers.js, onnxruntime-web, onnxruntime-android:** Apache-2.0
|
| 84 |
- **sherpa-onnx:** Apache-2.0
|