Spaces:

mlopez6132
/

textsense-ocr

Running

App Files Files Community

Commit History

Implement PP-OCRv5 using official model names

edb3860

Running

Marc Allen Lopez commited on Sep 15, 2025

Enable PP-OCRv5 support with fallback

e4fceaf

Marc Allen Lopez commited on Sep 15, 2025

textsense-ocr: set PPOCR_HOME to writable /tmp and create it to avoid permission errors

5b041e6

Marc Allen Lopez commited on Aug 13, 2025

textsense-ocr: add libgl1 and libgomp1 to fix OpenCV/Paddle runtime deps

d29ee6e

Marc Allen Lopez commited on Aug 13, 2025

Switch OCR backend to PaddleOCR\n\n- Replace MiniCPM inference with PaddleOCR (CPU)\n- Simplify requirements to Paddle and PaddleOCR only\n- Use slim Python image; drop CUDA dependencies\n- Keep API compatible: /extract returns text lines\n

aafb1d3

Marc Allen Lopez commited on Aug 13, 2025

Replace Qwen2-VL with MiniCPM-V-4 for OCR functionality

88f879c

Marc Allen Lopez commited on Aug 13, 2025

Fix Qwen2-VL model loading - use correct model class

e0b420c

Marc Allen Lopez commited on Aug 12, 2025

Fix dots.ocr model error by switching to Qwen2-VL-2B-Instruct

778d70f

Marc Allen Lopez commited on Aug 12, 2025

Switch to PyTorch devel image and set CUDA env vars for flash_attn compilation

5595512

Marc Allen Lopez commited on Aug 12, 2025

Add flash_attn back and install build tools for dots.ocr model compatibility

2540b0e

Marc Allen Lopez commited on Aug 12, 2025

Update transformers to >=4.55.0 for dots.ocr compatibility

56f307f

Marc Allen Lopez commited on Aug 12, 2025

Fix build: remove flash-attn dependency and add git to Dockerfile for dots.ocr compatibility

284f016

Marc Allen Lopez commited on Aug 12, 2025

Fix dependencies: pin exact versions and update PyTorch base image for dots.ocr compatibility

d8616e1

Marc Allen Lopez commited on Aug 12, 2025

Add missing dependencies for dots.ocr VLM model (transformers, accelerate, torch, etc.)

5c9be52

Marc Allen Lopez commited on Aug 12, 2025

Replace TrOCR with rednote-hilab/dots.ocr; add VLM pipeline and dependencies

cef0c83

Marc Allen Lopez commited on Aug 12, 2025

OCR: switch to TrOCR printed model and better decoding (beam search)

30a9f52

Marc Allen Lopez commited on Aug 12, 2025

Improve error handling for network issues and connection failures

2b73fdf

Marc Allen Lopez commited on Aug 12, 2025

Add TrOCR OCR functionality with FastAPI - Docker container with TrOCR model for image-to-text extraction

198cd52

Marc Allen Lopez commited on Aug 12, 2025

initial commit

3a448fd
verified

mlopez6132 commited on Aug 12, 2025