File size: 360 Bytes
67495fe
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
"""pdfsys-parser-pipeline — OCR-pipeline backend.

Handles the "needs-ocr AND no complex content" branch. Reads the cached
LayoutDocument produced by pdfsys-layout-analyser, renders each region via
PyMuPDF, runs line-level OCR (RapidOCR / PaddleOCR-classic, selectable via
config), and assembles the Markdown output. CPU-friendly.
"""

__version__ = "0.0.1"