yin
docs: add project README, CONTRIBUTING guide, and per-package READMEs
b8ca6f2
# pdfsys-parser-pipeline
Region-level OCR backend for scanned PDFs with simple layouts. **Stub only — not yet implemented.**
Will take a `LayoutDocument` from the cache, crop each region at the configured DPI, and run OCR (RapidOCR / PaddleOCR-classic) on each crop individually. Produces an `ExtractedDoc` following the same schema as parser-mupdf.