Engage in multimedia chat with LLMs and ML models
Transcribe audio files into text
Generate images from text prompts