olmOCR / README.md
coderprabhat
fix : bugs
55a0a6c

A newer version of the Gradio SDK is available: 6.1.0

Upgrade
metadata
title: olmOCR Document OCR (CPU)
emoji: πŸ“„
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.49.1
app_file: app.py
python_version: 3.11
pinned: false
license: apache-2.0

olmOCR: Document OCR with Vision Language Models (CPU Version)

This Space uses the olmOCR model to extract text from PDF and image files, optimized for CPU deployment.

Features

  • PDF and image support (PNG, JPEG)
  • Page-by-page processing for PDFs
  • Optimized for CPU inference
  • Free tier deployment

Performance Notes

  • Processing time: 30-90 seconds per page on CPU
  • Image resolution reduced to 1024px for efficiency
  • Uses greedy decoding for faster inference

Model

Uses allenai/olmOCR-2-7B-1025 optimized for CPU.