Image2OcrPdf / README.md
arasuezofis's picture
Update README.md
1f69d1a verified

A newer version of the Streamlit SDK is available: 1.56.0

Upgrade
metadata
title: ImageToOCRPdf
emoji: πŸ“„
colorFrom: blue
colorTo: purple
sdk: streamlit
sdk_version: 1.36.0
app_file: app.py
pinned: false

πŸ“„ Image / PDF β†’ Searchable PDF (OCR)

This Hugging Face Space converts images (PNG/JPG/JPEG) and PDF files into fully searchable OCR PDFs using Tesseract OCR and Poppler.

βœ” Supports images
βœ” Supports multi-page PDFs
βœ” Works fully in-browser
βœ” Download the final searchable PDF


πŸš€ How It Works

This app uses:

  • Streamlit for UI
  • Tesseract OCR for text extraction
  • pdf2image to convert PDFs into images
  • Pillow for image processing
  • Poppler backend for high-quality PDF rendering

πŸ“₯ Supported Upload Types

  • PNG
  • JPG / JPEG
  • PDF (multi-page supported)

▢️ Run Locally

Install Python dependencies