Spaces:

sprakhil
/

OCRQuest

Sleeping

App Files Files Community

OCRQuest / README.md

sprakhil

updated

a561de2 over 1 year ago

preview code

raw

history blame contribute delete

1.78 kB

A newer version of the Gradio SDK is available: 6.7.0

Upgrade

metadata

title: OCRQuest
emoji: 🔥
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

OCRQuest

OCR and Keyword Search Web Application

Overview

This project is a web-based prototype that performs Optical Character Recognition (OCR) on an uploaded image containing text in Hindi and English. The application also implements a basic keyword search functionality based on the extracted text.

Virtual Environment setup

Setup virtual environment using "python -m venv ocr-env" and activated using "ocr-env\Scripts\activate"

How can we run locally

Implement ocr and gradio(for user interface) in a python file(here I have app.py) save file and run it. We can run file using terminal by "python app.py" command. Gradio will launch the application and provide a local URL where we can test it in our browser.

Deployment Process

I have deployed this application on Hugging Face Spaces.
To deploy on Hugging Face Spaces:
1.Create a repository on Hugging Face under the Spaces tab.
2.Add app.py file and requirements.txt file that lists all dependencies
3.Push the repository to Hugging Face, and it will automatically deploy the application.

Features

Upload an image in JPEG, PNG, or other common formats.
Extract text from the uploaded image using EasyOCR.
Search for a keyword in the extracted text.
Display the extracted text and highlight keyword occurrences.

Technologies Used

Python
EasyOCR
Gradio
Huggingface Transformers
PyTorch
PIL (Python Imaging Library)

Setup and Installation

Clone this repository:

git clone https://github.com/sprakhil/OCR_Project