Generate realistic audio from text
Generate audio from text using a reference voice
Generate speech in a cloned voice from reference audio
Generate text using open source models
Describe and highlight entities in images