Generate speech from text using a reference voice
Engage in multimedia chat with LLMs and ML models
Generate realistic audio from text