Commit History

Implement PP-OCRv5 using official model names
edb3860
Running

Marc Allen Lopez commited on

Enable PP-OCRv5 support with fallback
e4fceaf

Marc Allen Lopez commited on

textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors
5b041e6

Marc Allen Lopez commited on

textsense-ocr: add libgl1 and libgomp1 to fix OpenCV/Paddle runtime deps
d29ee6e

Marc Allen Lopez commited on

Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n
aafb1d3

Marc Allen Lopez commited on

Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality
88f879c

Marc Allen Lopez commited on

Fix Qwen2-VL model loading - use correct model class
e0b420c

Marc Allen Lopez commited on

Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct
778d70f

Marc Allen Lopez commited on

Switch to PyTorch devel image and set CUDA env vars for flash_attn compilation
5595512

Marc Allen Lopez commited on

Add flash_attn back and install build tools for dots.ocr model compatibility
2540b0e

Marc Allen Lopez commited on

Update transformers to >=4.55.0 for dots.ocr compatibility
56f307f

Marc Allen Lopez commited on

Fix build: remove flash-attn dependency and add git to Dockerfile for dots.ocr compatibility
284f016

Marc Allen Lopez commited on

Fix dependencies: pin exact versions and update PyTorch base image for dots.ocr compatibility
d8616e1

Marc Allen Lopez commited on

Add missing dependencies for dots.ocr VLM model (transformers, accelerate, torch, etc.)
5c9be52

Marc Allen Lopez commited on

Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies
cef0c83

Marc Allen Lopez commited on

OCR: switch to TrOCR printed model and better decoding (beam search)
30a9f52

Marc Allen Lopez commited on

Improve error handling for network issues and connection failures
2b73fdf

Marc Allen Lopez commited on

Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction
198cd52

Marc Allen Lopez commited on

initial commit
3a448fd
verified

mlopez6132 commited on