Generate captions for images
Fill in image areas using prompts and masks
Remove background from images