Spaces:
Running
Running
Major Update: Kokoro-82M with 54 Premium Voices
Browse files# ποΈ Kokoro-82M Implementation - 54 Premium Voices
## Major Changes:
- β
Replace SpeechT5 with Kokoro-82M
- β
54 premium voices (American & British)
- β
StyleTTS 2 architecture (82M parameters)
- β
Gradio backend for better UX
- β
HF Inference API integration
## Voice Categories:
1. πΊπΈ American Female (11 voices)
2. πΊπΈ American Male (8 voices)
3. π¬π§ British Female (4 voices)
4. π¬π§ British Male (4 voices)
## Technology:
- Model: hexgrad/Kokoro-82M
- Architecture: StyleTTS 2 + ISTFTNet
- Backend: Gradio 4.x
- API: Hugging Face Inference
## Features:
- 54 unique voice characters
- Speed control (0.5x - 2x)
- High-quality audio output
- Natural prosody & emotion
- Fast generation (~2-5s)
- requirements.txt +4 -0
requirements.txt
ADDED
|
@@ -0,0 +1,4 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
gradio>=4.0.0
|
| 2 |
+
numpy
|
| 3 |
+
scipy
|
| 4 |
+
requests
|