OCRQuest / README.md
sprakhil's picture
updated
a561de2

A newer version of the Gradio SDK is available: 6.7.0

Upgrade
metadata
title: OCRQuest
emoji: 🔥
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

OCRQuest

OCR and Keyword Search Web Application

Overview

This project is a web-based prototype that performs Optical Character Recognition (OCR) on an uploaded image containing text in Hindi and English. The application also implements a basic keyword search functionality based on the extracted text.

Virtual Environment setup

Setup virtual environment using "python -m venv ocr-env" and activated using "ocr-env\Scripts\activate"

How can we run locally

Implement ocr and gradio(for user interface) in a python file(here I have app.py) save file and run it. We can run file using terminal by "python app.py" command. Gradio will launch the application and provide a local URL where we can test it in our browser.

Deployment Process

  • I have deployed this application on Hugging Face Spaces.
  • To deploy on Hugging Face Spaces:
  • 1.Create a repository on Hugging Face under the Spaces tab.
  • 2.Add app.py file and requirements.txt file that lists all dependencies
  • 3.Push the repository to Hugging Face, and it will automatically deploy the application.

Features

  • Upload an image in JPEG, PNG, or other common formats.
  • Extract text from the uploaded image using EasyOCR.
  • Search for a keyword in the extracted text.
  • Display the extracted text and highlight keyword occurrences.

Technologies Used

  • Python
  • EasyOCR
  • Gradio
  • Huggingface Transformers
  • PyTorch
  • PIL (Python Imaging Library)

Setup and Installation

  1. Clone this repository:
    git clone https://github.com/sprakhil/OCR_Project