Layout Analysis Module of PaddleOCR-VL-1.5

repo HuggingFace ModelScope HuggingFace ModelScope Discord X License

🔥 Official Website | 📝 Technical Report

Introduction

This is the model weights for PP-DocLayoutv3 in safetensors format. Get PaddlePaddle weights at PP-DocLayoutV3

PP-DocLayoutV3 is specifically engineered to handle non-planar document images. It can directly predict multi-point bounding boxes for layout elements—as opposed to standard two-point boxes—and determine logical reading orders for skewed and curved surfaces within a single forward pass, significantly reducing cascading errors. This model is an essential component of PaddleOCR-VL-1.5, providing crucial layout analysis for the high-precision parsing of various real-world documents in PaddleOCR-VL.

Model Architecture

Visualization

Light Variation

Skewing

Screen-photo

Curving

Citation

If you find PP-DocLayoutV3 helpful, feel free to give us a star and citation.

comming soon
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including PaddlePaddle/PP-DocLayoutV3_safetensors

Paper for PaddlePaddle/PP-DocLayoutV3_safetensors