Spaces:

Jaimodiji
/

Report-Generator

Running

App Files Files Community

Report-Generator / README.md

Jaimodiji

Upload README.md with huggingface_hub

fc526fe about 1 month ago

preview code

raw

history blame contribute delete

4.56 kB

metadata

title: Report Generator
emoji: 📄
colorFrom: blue
colorTo: indigo
sdk: docker
pinned: false
app_port: 7680

DocuPDF: Smart Scanner & PDF Generator

DocuPDF is a web-based utility that transforms images of documents—like test papers, notes, or book pages—into a polished, organized, and enhanced PDF. It provides powerful 8-point perspective crop, image enhancement tools, and detailed metadata entry, all within your browser.

✨ Key Features

Dual Input Support

PDF Upload: Upload a PDF document and extract individual pages as images
Image Upload: Upload multiple image files directly (PNG, JPG, JPEG, GIF, BMP)

Advanced Cropping

Powerful 8-point perspective correction tool
Draw multiple crop boxes on a single page/image
Touch-friendly UI with magnifying loupe for precision

Image Enhancement

Fine-tune scans with adjustments for:
- Brightness
- Contrast
- Gamma

Metadata Management

Add structured metadata to each question:
- Question Number
- Subject
- Status (Correct/Wrong/Unattempted)
- Marked Solution
- Actual Solution
- Time Taken

Smart Features

Automatic question number extraction using NVIDIA NIM OCR (when API key is provided)
Dashboard for session management
Custom PDF generation with filtering options

Modern UI

Responsive design that works on desktop and mobile
Dark theme for comfortable extended use
Keyboard shortcuts for power users

🔧 Tech Stack

Backend: Flask (Python)
Image Processing: OpenCV, Pillow, PyMuPDF
Frontend: HTML5, Bootstrap 5, JavaScript
Database: SQLite
OCR: NVIDIA NIM API (optional)

⚙️ Installation & Setup

Prerequisites

Python 3.7+
pip package installer

Step-by-Step Guide

Clone the Repository

git clone <repository-url>
cd Report-Generator

Create and Activate a Virtual Environment (Recommended)

# Create the environment
python -m venv venv

# Activate it
# On Windows:
.\venv\Scripts\activate
# On macOS/Linux:
source venv/bin/activate

Install Dependencies
```
pip install -r requirements.txt
```
Run the Application
```
python app.py
```
Open in Browser Navigate to http://127.0.0.1:1302 in your web browser

Environment Variables (Optional)

To enable the automatic question number extraction feature, set the NVIDIA_API_KEY environment variable:

# On Linux/macOS:
export NVIDIA_API_KEY="your-api-key-here"

# On Windows:
set NVIDIA_API_KEY=your-api-key-here

# Or create a .env file with:
NVIDIA_API_KEY=your-api-key-here

If you don't set this variable, the application will still work but the automatic question number extraction feature will be disabled.

📖 How to Use

Workflow Options

PDF Workflow:
- Upload a PDF document
- Each page is converted to an image
- Crop and enhance individual pages
- Enter question details
- Generate final PDF
Image Workflow:
- Upload multiple image files directly
- Crop and enhance individual images
- Enter question details
- Generate final PDF

Step 1: Choose Input Method

Select either PDF upload or multiple image upload from the main page

Step 2: Crop & Enhance

For each page/image, draw crop boxes around questions
Use the sliders to adjust brightness, contrast, and gamma
Save and continue to the next page/image

Step 3: Enter Details

Fill in metadata for each extracted question
Use productivity features like "Same Subject for All"
Extract question numbers automatically (if NVIDIA API is configured)

Step 4: Generate & Download

Choose PDF name and layout options
Filter questions by status if needed
Generate and download your final document

📁 Project Structure

/
├── uploads/              # Stores original user uploads (temporary)
├── processed/            # Stores cropped & enhanced images (temporary)
├── output/               # Stores the final generated PDFs
├── templates/            # Contains all HTML files
├── app.py                # The main Flask application logic
├── strings.py            # Route constants and string definitions
├── requirements.txt      # List of Python dependencies
└── README.md             # This file

🧪 Testing

Run the test suite to verify functionality:

python test.py

📄 License

This project is licensed under the MIT License.