|
|
--- |
|
|
license: apache-2.0 |
|
|
pipeline_tag: image-segmentation |
|
|
tags: |
|
|
- PaddleOCR |
|
|
- PaddlePaddle |
|
|
- image-segmentation |
|
|
- ocr |
|
|
- layout |
|
|
- layout_detection |
|
|
language: |
|
|
- en |
|
|
- zh |
|
|
- multilingual |
|
|
library_name: PaddleOCR |
|
|
--- |
|
|
|
|
|
<div align="center"> |
|
|
|
|
|
|
|
|
<h1 align="center"> |
|
|
|
|
|
Layout Analysis Module of PaddleOCR-VL-1.5 |
|
|
|
|
|
</h1> |
|
|
|
|
|
[](https://github.com/PaddlePaddle/PaddleOCR) |
|
|
[](https://huggingface.co/PaddlePaddle/PP-DocLayoutV3) |
|
|
[](https://modelscope.cn/models/PaddlePaddle/PP-DocLayoutV3) |
|
|
[](https://huggingface.co/spaces/PaddlePaddle/PaddleOCR-VL-1.5_Online_Demo) |
|
|
[](https://modelscope.cn/studios/PaddlePaddle/PaddleOCR-VL-1.5_Online_Demo/summary) |
|
|
[](https://discord.gg/JPmZXDsEEK) |
|
|
[](https://x.com/PaddlePaddle) |
|
|
[](./LICENSE) |
|
|
|
|
|
**🔥 [Official Website](https://www.paddleocr.com)** | |
|
|
**📝 [Technical Report](https://arxiv.org/pdf/2510.14528)** |
|
|
|
|
|
</div> |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
## Introduction |
|
|
|
|
|
This is the PP-Doclayoutv3 model weights for the PaddlePaddle framework. Get safetensors weights at [PP-DocLayoutV3_safetensors](https://huggingface.co/PaddlePaddle/PP-DocLayoutV3_safetensors) |
|
|
|
|
|
**PP-DocLayoutV3 is specifically engineered to handle non-planar document images. It can directly predict multi-point bounding boxes for layout elements—as opposed to standard two-point boxes—and determine logical reading orders for skewed and curved surfaces within a single forward pass, significantly reducing cascading errors.** This model is an essential component of PaddleOCR-VL-1.5, providing crucial layout analysis for the high-precision parsing of various real-world documents in PaddleOCR-VL. |
|
|
|
|
|
|
|
|
### **Model Architecture** |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/PP-DocLayoutV3.png" width="800"/> |
|
|
</div> |
|
|
|
|
|
|
|
|
## Visualization |
|
|
|
|
|
|
|
|
### Light Variation |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/layout_lighting.jpg" width="800"/> |
|
|
</div> |
|
|
|
|
|
|
|
|
### Skewing |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/layout_skew.jpg" width="800"/> |
|
|
</div> |
|
|
|
|
|
|
|
|
### Screen-photo |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/layout_screen.jpg" width="800"/> |
|
|
</div> |
|
|
|
|
|
|
|
|
### Curving |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/paddleocr_vl_1_5/layout_curv.jpg" width="800"/> |
|
|
</div> |
|
|
|
|
|
|
|
|
## Citation |
|
|
|
|
|
If you find PP-DocLayoutV3 helpful, feel free to give us a star and citation. |
|
|
|
|
|
```bibtex |
|
|
comming soon |
|
|
``` |