OCRQuest / README.md
sprakhil's picture
updated
a561de2
---
title: OCRQuest
emoji: 🔥
colorFrom: indigo
colorTo: green
sdk: gradio
sdk_version: 4.44.0
app_file: app.py
pinned: false
---
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
# OCRQuest
# OCR and Keyword Search Web Application
## Overview
This project is a web-based prototype that performs Optical Character Recognition (OCR) on an uploaded image containing text in Hindi and English. The application also implements a basic keyword search functionality based on the extracted text.
## Virtual Environment setup
Setup virtual environment using "python -m venv ocr-env"
and activated using "ocr-env\Scripts\activate"
## How can we run locally
Implement ocr and gradio(for user interface) in a python file(here I have app.py) save file and run it. We can run file using terminal by "python app.py" command. Gradio will launch the application and provide a local URL where we can test it in our browser.
## Deployment Process
- I have deployed this application on Hugging Face Spaces.
- To deploy on Hugging Face Spaces:
- 1.Create a repository on Hugging Face under the Spaces tab.
- 2.Add app.py file and requirements.txt file that lists all dependencies
- 3.Push the repository to Hugging Face, and it will automatically deploy the application.
## Features
- Upload an image in JPEG, PNG, or other common formats.
- Extract text from the uploaded image using EasyOCR.
- Search for a keyword in the extracted text.
- Display the extracted text and highlight keyword occurrences.
## Technologies Used
- Python
- EasyOCR
- Gradio
- Huggingface Transformers
- PyTorch
- PIL (Python Imaging Library)
## Setup and Installation
1. Clone this repository:
```bash
git clone https://github.com/sprakhil/OCR_Project