F5-TTS
🗣
2.88k
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Generate detailed prompts from any image
Apply the motion of a video on a portrait
Generate speech from text using a reference voice
Generate custom images from text prompts with Stable Diffusion 3