Alfonso Velasco commited on
Commit
c9f61a8
·
1 Parent(s): e850a96

adding readme

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: LayoutLMv3 Document Extraction
3
+ emoji: 📄
4
+ colorFrom: blue
5
+ colorTo: green
6
+ sdk: docker
7
+ app_file: app.py
8
+ pinned: false
9
+ ---
10
+
11
+ # LayoutLMv3 Document Extraction with OCR
12
+
13
+ This Space provides document extraction with bounding boxes using LayoutLMv3 and Tesseract OCR.
14
+
15
+ ## Features
16
+ - PDF processing with OCR
17
+ - Image text extraction
18
+ - Bounding box coordinates for each text element
19
+ - Multi-page PDF support
20
+
21
+ ## API Usage
22
+
23
+ Send POST requests to `/extract` with base64-encoded PDF or image:
24
+ ```python