GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 8 days ago • 35 • 7
CommonForms: A Large, Diverse Dataset for Form Field Detection Paper • 2509.16506 • Published Sep 20, 2025 • 22
Large Language Models for Page Stream Segmentation Paper • 2408.11981 • Published Aug 21, 2024 • 3
OCR Collection Data and models for optical character recognition • 6 items • Updated 6 days ago • 3
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 8 days ago • 35 • 7
GutenOCR: A Grounded Vision-Language Front-End for Documents Paper • 2601.14490 • Published 8 days ago • 35