Generate speech from text using a reference voice
Image generator/identifier/reposer
Transform images based on text instructions