Soul-AILab/SoulX-Podcast-1.7B-dialect
Text-to-Speech β’ Updated β’ 78 β’ 25
Generate personalized images preserving your face identity
Replace objects in images using prompts or reference images
Generate speech in a cloned voice from a short audio sample
Generate music from text descriptions and optional melodies
Transcribe or translate audio and YouTube videos to text
Transcribe audio files to text instantly