A newer version of the Gradio SDK is available:
6.7.0
title: OCRQuest
emoji: 🔥
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
OCRQuest
OCR and Keyword Search Web Application
Overview
This project is a web-based prototype that performs Optical Character Recognition (OCR) on an uploaded image containing text in Hindi and English. The application also implements a basic keyword search functionality based on the extracted text.
Virtual Environment setup
Setup virtual environment using "python -m venv ocr-env" and activated using "ocr-env\Scripts\activate"
How can we run locally
Implement ocr and gradio(for user interface) in a python file(here I have app.py) save file and run it. We can run file using terminal by "python app.py" command. Gradio will launch the application and provide a local URL where we can test it in our browser.
Deployment Process
- I have deployed this application on Hugging Face Spaces.
- To deploy on Hugging Face Spaces:
- 1.Create a repository on Hugging Face under the Spaces tab.
- 2.Add app.py file and requirements.txt file that lists all dependencies
- 3.Push the repository to Hugging Face, and it will automatically deploy the application.
Features
- Upload an image in JPEG, PNG, or other common formats.
- Extract text from the uploaded image using EasyOCR.
- Search for a keyword in the extracted text.
- Display the extracted text and highlight keyword occurrences.
Technologies Used
- Python
- EasyOCR
- Gradio
- Huggingface Transformers
- PyTorch
- PIL (Python Imaging Library)
Setup and Installation
- Clone this repository:
git clone https://github.com/sprakhil/OCR_Project