h1t/TCD-SDXL-LoRA
Text-to-Image • Updated • 1.62k • • 117
Transcribe audio to text in various languages
Text-to-speech (TTS) with Next-gen Kaldi
Generate voice with text or audio input
Generate high-quality speech from text using a prompt audio
Generate speech in a cloned voice from a short audio sample
Generate a talking face video from an image and audio