Spaces:
Running
Running
Major Update: Kokoro-82M with 54 Premium Voices
#8
by
masbudjj - opened
ποΈ Kokoro-82M Implementation - 54 Premium Voices
Major Changes:
- β Replace SpeechT5 with Kokoro-82M
- β 54 premium voices (American & British)
- β StyleTTS 2 architecture (82M parameters)
- β Gradio backend for better UX
- β HF Inference API integration
Voice Categories:
- πΊπΈ American Female (11 voices)
- πΊπΈ American Male (8 voices)
- π¬π§ British Female (4 voices)
- π¬π§ British Male (4 voices)
Technology:
- Model: hexgrad/Kokoro-82M
- Architecture: StyleTTS 2 + ISTFTNet
- Backend: Gradio 4.x
- API: Hugging Face Inference
Features:
- 54 unique voice characters
- Speed control (0.5x - 2x)
- High-quality audio output
- Natural prosody & emotion
- Fast generation (~2-5s)
masbudjj changed pull request status to
merged