Annotate and summarize video content
Run app using Pixi, install dependencies as needed
Extract text from images using OCR