textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors
5b041e6
Marc Allen Lopezcommited on
textsense-ocr: add libgl1 and libgomp1 to fix OpenCV/Paddle runtime deps
d29ee6e
Marc Allen Lopezcommited on
Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n
aafb1d3
Marc Allen Lopezcommited on
Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality
88f879c
Marc Allen Lopezcommited on
Fix Qwen2-VL model loading - use correct model class
e0b420c
Marc Allen Lopezcommited on
Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct
778d70f
Marc Allen Lopezcommited on
Switch to PyTorch devel image and set CUDA env vars for flash_attn compilation
5595512
Marc Allen Lopezcommited on
Add flash_attn back and install build tools for dots.ocr model compatibility
2540b0e
Marc Allen Lopezcommited on
Update transformers to >=4.55.0 for dots.ocr compatibility
56f307f
Marc Allen Lopezcommited on
Fix build: remove flash-attn dependency and add git to Dockerfile for dots.ocr compatibility
284f016
Marc Allen Lopezcommited on
Fix dependencies: pin exact versions and update PyTorch base image for dots.ocr compatibility
d8616e1
Marc Allen Lopezcommited on
Add missing dependencies for dots.ocr VLM model (transformers, accelerate, torch, etc.)
5c9be52
Marc Allen Lopezcommited on
Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies
cef0c83
Marc Allen Lopezcommited on
OCR: switch to TrOCR printed model and better decoding (beam search)
30a9f52
Marc Allen Lopezcommited on
Improve error handling for network issues and connection failures
2b73fdf
Marc Allen Lopezcommited on
Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction