Image-Text-to-Text
PaddleOCR
Safetensors
English
Chinese
multilingual
paddleocr_vl
ERNIE4.5
PaddlePaddle
image-to-text
ocr
document-parse
layout
table
formula
chart
seal
spotting
conversational
custom_code
Eval Results
Instructions to use PaddlePaddle/PaddleOCR-VL-1.5 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PaddleOCR
How to use PaddlePaddle/PaddleOCR-VL-1.5 with PaddleOCR:
# See https://www.paddleocr.ai/latest/version3.x/pipeline_usage/PaddleOCR-VL.html to installation from paddleocr import PaddleOCRVL pipeline = PaddleOCRVL(pipeline_version="v1.5") output = pipeline.predict("path/to/document_image.png") for res in output: res.print() res.save_to_json(save_path="output") res.save_to_markdown(save_path="output") - Notebooks
- Google Colab
- Kaggle
pdf表格识别
#7
by SauceL - opened
如何重现官网对pdf内表格识别的效果,本地运行一直输出....。官网导出markdown里面输出了规范的表格html代码
同问,为什么官网在线体验的表格输出结果和本地部署的有所差异,在线体验的效果比自己部署的更好
@SauceL @pbob 二者的模型和代码完全一致,是不是部署的方式不对呢?这个是两阶段的模型,需要一起部署。或者查看文档:www.paddleocr.ai
里边提供纯离线部署的镜像,可以通过docker compose的方式部署成API。