Xtts
⚡
21
Generate audio from text with custom speakers
Generate audio from text with custom speakers
Audio-based Lip Sync for Talking Head Video Editing
Generate animated videos from images and prompts
Dub videos in another language with cloned voice
Get a LLM Assistant personality idea from an image
Motion Controlled Video Generation
Generate audio‑driven videos from an image and pose data
Quickly edit the expression of a face