OmniParser, turn your LLM into GUI agent
Nanonets / olmOCR / RolmOCR / Aya-Vision / Qwen2-VL-OCR
Analysis of data on an invoice