Pixel-Labs
/

threadcast-neural-models

@@ -7,6 +7,7 @@ tags:
   - tts
   - kokoro
   - piper
   - vits
   - styletts2
   - onnx
@@ -31,7 +32,7 @@ pipeline_tag: text-to-speech
 ---
-Self-hosted mirror of the on-device neural TTS models used by **[ThreadCast](https://threadcast.app)** across both shipping platforms — the Chrome extension and the Android app. Two engines, two packagings, one source of truth.
 This repository exists so each platform can ship a stable, version-pinned set of model weights without depending on the availability or rate-limits of upstream Hugging Face repos at runtime.
@@ -48,7 +49,9 @@ threadcast-neural-models/
 │   └── neural-82m/                     Kokoro StyleTTS2 — 1 model + 11 voice embeddings
 │
 └── mobile-android/                     ← Android app — production zips fetched at runtime
-    └── v1/                             7 zips: 1 shared + 5 per-voice Piper + 1 Kokoro
 ```
 | Subtree | Format | Consumed by | Sub-README |
@@ -64,8 +67,9 @@ The two subtrees parallel each other on purpose — same engine families (Piper
 | Engine | Architecture | Params | Per-voice cost | Quality tier |
 |---|---|---|---|---|
-| **`neural-28m`** | Piper VITS | ~28 M | One ONNX file per voice (~63 MB) | Standard — fast, CPU-friendly, single-thread WASM real-time on a laptop |
-| **`neural-82m`** | Kokoro StyleTTS2 | ~82 M | Single model + 256-dim style vectors per voice (one ~325 MB file serves all) | Premium — more natural prosody, GPU-accelerated on Chrome (WebGPU); CPU-only on Android (perf-gated) |
 ---
@@ -74,6 +78,7 @@ The two subtrees parallel each other on purpose — same engine families (Piper
 This repository **mirrors** upstream models for distribution stability. Each upstream project retains its own license:
 - **Kokoro-82M:** Apache-2.0 ([upstream model card](https://huggingface.co/hexgrad/Kokoro-82M))
 - **Piper voices:** MIT, with individual voice attributions in each `.onnx.json`
 - **transformers.js, onnxruntime-web, onnxruntime-android:** Apache-2.0
 - **sherpa-onnx:** Apache-2.0

   - tts
   - kokoro
   - piper
+  - kittentts
   - vits
   - styletts2
   - onnx
 ---
+Self-hosted mirror of the on-device neural TTS models used by **[ThreadCast](https://threadcast.app)** across both shipping platforms — the Chrome extension and the Android app. Three engine families on Android (Piper VITS, KittenTTS-nano VITS, Kokoro StyleTTS2), two on the extension, one source of truth.
 This repository exists so each platform can ship a stable, version-pinned set of model weights without depending on the availability or rate-limits of upstream Hugging Face repos at runtime.
 │   └── neural-82m/                     Kokoro StyleTTS2 — 1 model + 11 voice embeddings
 │
 └── mobile-android/                     ← Android app — production zips fetched at runtime
+    └── v1/                             8 zips: 1 shared espeak + 5 per-voice Piper
+                                        + 1 KittenTTS-nano ("Local AI Plus")
+                                        + 1 Kokoro ("Local AI Studio")
 ```
 | Subtree | Format | Consumed by | Sub-README |
 | Engine | Architecture | Params | Per-voice cost | Quality tier |
 |---|---|---|---|---|
+| **`neural-28m`** | Piper VITS | ~28 M | One ONNX file per voice (~63 MB) | Standard — fast, CPU-friendly, single-thread WASM real-time on a laptop. Surfaced on Android as **Local AI Lite**, on the extension as **AI Neural CPU**. |
+| **`neural-15m`** | KittenTTS-nano VITS | ~15 M | Single fp16 model + 8 speaker embeddings (one ~26 MB file serves all) | Sweet spot — 8 voices with style-vector switching at a fraction of the storage cost. Android-only, surfaced as **Local AI Plus**. |
+| **`neural-82m`** | Kokoro StyleTTS2 | ~82 M | Single model + 256-dim style vectors per voice (one ~325 MB file serves all) | Premium — more natural prosody, GPU-accelerated on Chrome (WebGPU); CPU-only on Android (perf-gated). Surfaced on Android as **Local AI Studio**, on the extension as **AI Neural GPU**. |
 ---
 This repository **mirrors** upstream models for distribution stability. Each upstream project retains its own license:
 - **Kokoro-82M:** Apache-2.0 ([upstream model card](https://huggingface.co/hexgrad/Kokoro-82M))
+- **KittenTTS-nano (v0.1):** Apache-2.0 ([upstream model card](https://huggingface.co/KittenML/kitten-tts-nano-0.1))
 - **Piper voices:** MIT, with individual voice attributions in each `.onnx.json`
 - **transformers.js, onnxruntime-web, onnxruntime-android:** Apache-2.0
 - **sherpa-onnx:** Apache-2.0