Generate detailed images from text prompts
Generate answers to questions using text models
Translate video audio to another language