Text to Speech
Generate natural speech in English, Hindi or Hinglish
Text Input
Detected: English
Ctrl+Enter to generate
Voice
Active Voice
Default Voice
Parameters
âļ Generate Speech
Generation History
đĩ
Your generations appear here
Voice Library
Save, manage and reuse cloned voices permanently
Add New Voice
Voice Sample (5â30 seconds, clean audio)
đ
Upload WAV or MP3
No background music â clear speech only
đž Save to Library
Saved Voices
đ
No voices saved yet â add one above
Voice Design
Describe a voice in plain words â AI generates the parameters
Describe Your Voice
Preview Text
⨠Design Voice
Example Prompts
Young Indian woman, warm and friendly
Deep mature male, calm and authoritative
Energetic teen, very expressive and fast
Professional newsreader, neutral accent
Soft spoken elderly woman, slow and gentle
Excited sports commentator, loud and fast
Dubbing
Translate and re-voice any video or audio file
â ī¸ Free CPU Warning
Dubbing on free CPU takes 10â20 minutes per minute of video.
Start with a short clip to test. Upgrade to GPU for faster processing.
Upload Media
đŦ
Upload video or audio file
MP4, MKV, AVI, MP3, WAV â max 100MB
Source Language
Auto Detect
English
Hindi
Spanish
French
German
Japanese
Chinese
Target Language
English
Hindi
Spanish
French
German
Japanese
Chinese
Dubbing Voice (optional)
Default Voice
đŦ Start Dubbing
API Reference
OpenAI-compatible endpoints â drop-in replacement
Text to Speech
POST /v1/audio/speech
{
"input": "Hello I am Cortana",
"emotion": 0.5, // 0.0 neutral â 1.0 expressive
"speed": 1.0 // 0.5x to 2.0x
}
Returns: audio/mpeg stream
Copy
Voice Cloning (on-the-fly)
POST /v1/audio/speech/clone
{
"input": "Hello I am Cortana",
"voice_b64": "base64_encoded_wav",
"emotion": 0.5
}
Returns: audio/mpeg stream
Copy
List Voices
GET /v1/voices
Returns: { "voices": [ { "id": "...", "name": "...", "lang": "..." } ] }
CURL Example
curl -X POST "https://YOUR_SPACE.hf.space/v1/audio/speech" \
-H "Authorization: Bearer YOUR_HF_TOKEN" \
-H "Content-Type: application/json" \
-d '{"input": "Hello I am Cortana", "emotion": 0.5}' \
--output speech.mp3