Image-Text-to-Text
PaddleOCR
Safetensors
English
Chinese
multilingual
paddleocr_vl
ERNIE4.5
PaddlePaddle
image-to-text
ocr
document-parse
layout
table
formula
chart
conversational
custom_code
Eval Results
Instructions to use PaddlePaddle/PaddleOCR-VL with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PaddleOCR
How to use PaddlePaddle/PaddleOCR-VL with PaddleOCR:
# See https://www.paddleocr.ai/latest/version3.x/pipeline_usage/PaddleOCR-VL.html to installation from paddleocr import PaddleOCRVL pipeline = PaddleOCRVL(pipeline_version="v1") output = pipeline.predict("path/to/document_image.png") for res in output: res.print() res.save_to_json(save_path="output") res.save_to_markdown(save_path="output") - Notebooks
- Google Colab
- Kaggle
Only for image? not support PDF?
#3
by stephonye - opened
There are many scanned documents in PDF format, and many large multi-page scanned documents are also in PDF format. Is it just that the Demo does not support PDF, or does the local deployment also not support PDF?
PaddleOCR-VL provides built-in support for PDF recognition. However, due to the limited computational resources in the demo environment, inference on multi-page PDF documents is temporarily unavailable. You can deploy the model locally to explore its full capabilities.