Detect UI elements in screenshots and get JSON details
Generate visual attention heatmaps from uploaded images