Transcribe audio files or YouTube videos into text
Identify human poses in images
Generate images from text prompts