Analyze images to generate detailed prompts
Generate depth map from an image
Image to 3D with DPT + 3D Point Cloud