Spaces:

alfonsovelp
/

llm_document

Sleeping

Alfonso Velasco commited on Oct 16, 2025

Commit

c9f61a8

1 Parent(s): e850a96

adding readme

Files changed (1) hide show

README.md ADDED Viewed

+---
+title: LayoutLMv3 Document Extraction
+emoji: 📄
+colorFrom: blue
+colorTo: green
+sdk: docker
+app_file: app.py
+pinned: false
+---
+# LayoutLMv3 Document Extraction with OCR
+This Space provides document extraction with bounding boxes using LayoutLMv3 and Tesseract OCR.
+## Features
+- PDF processing with OCR
+- Image text extraction
+- Bounding box coordinates for each text element
+- Multi-page PDF support
+## API Usage
+Send POST requests to `/extract` with base64-encoded PDF or image:
+```python