PaddleOCR ONNX Models

Pre-converted PaddleOCR models in ONNX format for use with Kreuzberg.

Models

File	Type	Version	Size	Description
`ch_PP-OCRv4_det_infer.onnx`	Detection	PP-OCRv4	4.5 MB	Text region detection (language-agnostic)
`ch_ppocr_mobile_v2.0_cls_infer.onnx`	Classification	PPOCRv2	572 KB	Text angle classification (0° vs 180°)
`en_PP-OCRv4_rec_infer.onnx`	Recognition	PP-OCRv4	7.3 MB	English text recognition

PaddleOCR uses a three-stage pipeline:

Detection (det): Locates text regions in the image using a DB (Differentiable Binarization) network. The detection model is language-agnostic — it identifies bounding boxes around text regardless of script.
Classification (cls): Determines text orientation (upright vs upside-down) to correct rotated text before recognition.
Recognition (rec): Reads characters from each detected text region. This is the only language-specific model.

These models are derived from PaddlePaddle/PaddleOCR, licensed under Apache 2.0.

Detection model (ch_PP-OCRv4_det_infer.onnx): Pre-converted ONNX model sourced from SWHL/RapidOCR on HuggingFace (PP-OCRv4 folder). Originally trained by PaddlePaddle as ch_PP-OCRv4_det_infer.
Classification model (ch_ppocr_mobile_v2.0_cls_infer.onnx): Pre-converted ONNX model sourced from SWHL/RapidOCR on HuggingFace (PP-OCRv1 folder). Originally trained by PaddlePaddle as ch_ppocr_mobile_v2.0_cls_infer.
Recognition model (en_PP-OCRv4_rec_infer.onnx): Converted from PaddlePaddle format to ONNX using paddle2onnx (opset 14). Original PaddlePaddle model downloaded from PaddlePaddle's official distribution (en_PP-OCRv4_rec_infer).

The original PaddleOCR models are licensed under Apache License 2.0.

Downloads last month: -; Downloads are not tracked for this model. How to track