Spaces:

WSYBYT
/

ybtts

Running

App Files Files Community

Major Update: Kokoro-82M with 54 Premium Voices

#8

by masbudjj - opened Oct 22, 2025

base: refs/heads/main

←

from: refs/pr/8

Discussion Files changed

WS YB YT org Oct 22, 2025

🎙️ Kokoro-82M Implementation - 54 Premium Voices

Major Changes:

✅ Replace SpeechT5 with Kokoro-82M
✅ 54 premium voices (American & British)
✅ StyleTTS 2 architecture (82M parameters)
✅ Gradio backend for better UX
✅ HF Inference API integration

Voice Categories:

🇺🇸 American Female (11 voices)
🇺🇸 American Male (8 voices)
🇬🇧 British Female (4 voices)
🇬🇧 British Male (4 voices)

Technology:

Model: hexgrad/Kokoro-82M
Architecture: StyleTTS 2 + ISTFTNet
Backend: Gradio 4.x
API: Hugging Face Inference

Features:

54 unique voice characters
Speed control (0.5x - 2x)
High-quality audio output
Natural prosody & emotion
Fast generation (~2-5s)

Major Update: Kokoro-82M with 54 Premium Voices1ee51d6b

masbudjj changed pull request status to merged Oct 22, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment