Convert text into speech in multiple languages
Generate speech from text and an audio prompt
Transcribe audio and generate responses based on prompts
Visualize articulatory features of a sentence
Audio Conditioned LipSync with Latent Diffusion Models