Highlight elements in GUI images based on instructions
Generate speech from text using a reference voice