File size: 11,491 Bytes
a75825a e63547d a75825a 2ddddd4 a75825a 19ef575 a75825a 4a929db 9c77d04 a75825a d83d7f2 a75825a e63547d |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 |
---
license: apache-2.0
pipeline_tag: image-segmentation
tags:
- PaddleOCR
- PaddlePaddle
- image-segmentation
- ocr
- layout
- layout_detection
language:
- en
- zh
- multilingual
library_name: PaddleOCR
---
<div align="center">
<h1 align="center">
Layout Analysis Module of PaddleOCR-VL-1.5
</h1>
[](https://github.com/PaddlePaddle/PaddleOCR)
[](https://huggingface.co/PaddlePaddle/PP-DocLayoutV3)
[](https://modelscope.cn/models/PaddlePaddle/PP-DocLayoutV3)
[](https://huggingface.co/spaces/PaddlePaddle/PaddleOCR-VL-1.5_Online_Demo)
[](https://modelscope.cn/studios/PaddlePaddle/PaddleOCR-VL-1.5_Online_Demo/summary)
[](https://discord.gg/JPmZXDsEEK)
[](https://x.com/PaddlePaddle)
[](./LICENSE)
**🔥 [Official Website](https://www.paddleocr.com)** |
**📝 [Technical Report](https://arxiv.org/pdf/2510.14528)**
</div>
## Introduction
This is the PP-Doclayoutv3 model weights for the PaddlePaddle framework. Get safetensors weights at [PP-DocLayoutV3_safetensors](https://huggingface.co/PaddlePaddle/PP-DocLayoutV3_safetensors)
**PP-DocLayoutV3 is specifically engineered to handle non-planar document images. It can directly predict multi-point bounding boxes for layout elements—as opposed to standard two-point boxes—and determine logical reading orders for skewed and curved surfaces within a single forward pass, significantly reducing cascading errors.** This model is an essential component of PaddleOCR-VL-1.5, providing crucial layout analysis for the high-precision parsing of various real-world documents in PaddleOCR-VL.
### **Model Architecture**
<div align="center">
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/PP-DocLayoutV3.png" width="800"/>
</div>
## Visualization
### Light Variation
<div align="center">
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/layout_lighting.jpg" width="800"/>
</div>
### Skewing
<div align="center">
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/layout_skew.jpg" width="800"/>
</div>
### Screen-photo
<div align="center">
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/layout_screen.jpg" width="800"/>
</div>
### Curving
<div align="center">
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/layout_curv.jpg" width="800"/>
</div>
## Citation
If you find PP-DocLayoutV3 helpful, feel free to give us a star and citation.
```bibtex
comming soon
``` |