# pdfsys-parser-pipeline Region-level OCR backend for scanned PDFs with simple layouts. **Stub only — not yet implemented.** Will take a `LayoutDocument` from the cache, crop each region at the configured DPI, and run OCR (RapidOCR / PaddleOCR-classic) on each crop individually. Produces an `ExtractedDoc` following the same schema as parser-mupdf.