Introduction
This is the PP-Doclayoutv3 model weights for the PaddlePaddle framework. Get safetensors weights at PP-DocLayoutV3_safetensors
PP-DocLayoutV3 is specifically engineered to handle non-planar document images. It can directly predict multi-point bounding boxes for layout elements—as opposed to standard two-point boxes—and determine logical reading orders for skewed and curved surfaces within a single forward pass, significantly reducing cascading errors. This model is an essential component of PaddleOCR-VL-1.5, providing crucial layout analysis for the high-precision parsing of various real-world documents in PaddleOCR-VL.
Model Architecture
Visualization
Light Variation
Skewing
Screen-photo
Curving
Citation
If you find PP-DocLayoutV3 helpful, feel free to give us a star and citation.
comming soon
- Downloads last month
- -
Collection including PaddlePaddle/PP-DocLayoutV3
Collection
Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing
•
4 items
•
Updated
•
1