coderprabhat commited on
Commit
1c1eb03
Β·
1 Parent(s): ef35d07

README.md added

Browse files
Files changed (1) hide show
  1. README.md +24 -6
README.md CHANGED
@@ -1,12 +1,30 @@
1
  ---
2
- title: OlmOCR
3
- emoji: πŸŒ–
4
- colorFrom: red
5
- colorTo: indigo
6
  sdk: gradio
7
- sdk_version: 5.49.1
8
  app_file: app.py
 
9
  pinned: false
 
10
  ---
11
 
12
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ title: olmOCR Document OCR (CPU)
3
+ emoji: πŸ“„
4
+ colorFrom: blue
5
+ colorTo: green
6
  sdk: gradio
7
+ sdk_version: 4.0.0
8
  app_file: app.py
9
+ python_version: 3.11
10
  pinned: false
11
+ license: apache-2.0
12
  ---
13
 
14
+ # olmOCR: Document OCR with Vision Language Models (CPU Version)
15
+
16
+ This Space uses the olmOCR model to extract text from PDF and image files, optimized for CPU deployment.
17
+
18
+ ## Features
19
+ - PDF and image support (PNG, JPEG)
20
+ - Page-by-page processing for PDFs
21
+ - Optimized for CPU inference
22
+ - Free tier deployment
23
+
24
+ ## Performance Notes
25
+ - Processing time: 30-90 seconds per page on CPU
26
+ - Image resolution reduced to 1024px for efficiency
27
+ - Uses greedy decoding for faster inference
28
+
29
+ ## Model
30
+ Uses `allenai/olmOCR-2-7B-1025` optimized for CPU.