Generate depth maps from any image
Summarize videos to shorter clips
Generate depth maps for video frames