Segment document layouts into text, images, and tables
Convert PDFs and images to structured text and layout data