PaddlePaddle
/

PP-DocLayoutV3

Image Segmentation

layout_detection

Model card Files Files and versions

OCR-Format

#5

by BigTiger78 - opened Mar 5

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

This PR is in draft mode

Files changed (1) hide show

README.md +0 -4

README.md CHANGED Viewed

@@ -47,10 +47,6 @@ This is the PP-Doclayoutv3 model weights for the PaddlePaddle framework. Get saf
 **PP-DocLayoutV3 is specifically engineered to handle non-planar document images. It can directly predict multi-point bounding boxes for layout elements—as opposed to standard two-point boxes—and determine logical reading orders for skewed and curved surfaces within a single forward pass, significantly reducing cascading errors.** This model is an essential component of PaddleOCR-VL-1.5, providing crucial layout analysis for the high-precision parsing of various real-world documents in PaddleOCR-VL.
-This work has been accepted to ECCV 2026! 🎉
 ### **Model Architecture**


47
48	PP-DocLayoutV3 is specifically engineered to handle non-planar document images. It can directly predict multi-point bounding boxes for layout elements—as opposed to standard two-point boxes—and determine logical reading orders for skewed and curved surfaces within a single forward pass, significantly reducing cascading errors. This model is an essential component of PaddleOCR-VL-1.5, providing crucial layout analysis for the high-precision parsing of various real-world documents in PaddleOCR-VL.
49




50
51	### Model Architecture
52