File size: 2,057 Bytes
008e1fb
 
 
 
 
4c1fc4e
008e1fb
 
 
 
 
b0bec61
 
a112630
 
b0bec61
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
---
title: Bill Invoice Scanner Pro
emoji: 🧾
colorFrom: blue
colorTo: indigo
sdk: docker
sdk_version: 1.42.0
app_file: app.py
pinned: false
---

# 🧾 Invoice Scanner Pro

πŸš€ **Live Application:** [https://huggingface.co/spaces/Divya499/Bill-Invoice-Scanner-Pro](https://huggingface.co/spaces/Divya499/Bill-Invoice-Scanner-Pro)

## πŸ“– Project Description
Invoice Scanner Pro is a highly capable, GPU-accelerated web application built on Streamlit and EasyOCR. It rapidly automates financial data processing by utilizing regex rules to extract vendor names, precise transaction dates, and total amounts directly from uploaded receipts and invoices. Featuring an interactive dashboard, users can easily perform human-in-the-loop data corrections, push verified information into a local SQLite database, and seamlessly export their records into instantly updated real-time CSV or Excel spreadsheets.

## πŸ“‚ Folder Structure

```text
Bill Invoice detector/
β”œβ”€β”€ bill_scanner/             # Main Source Package
β”‚   β”œβ”€β”€ app.py                # Streamlit Dashboard Entrypoint
β”‚   β”œβ”€β”€ benchmark_sroie.py    # SROIE Benchmarking Script
β”‚   β”œβ”€β”€ database.py           # SQLite Wrapper & Persistence 
β”‚   β”œβ”€β”€ extractor.py          # Field Parsing & Regex Rules
β”‚   └── ocr.py                # Wrapper around EasyOCR
β”œβ”€β”€ SROIE_Dataset/            # Benchmark images and texts
β”œβ”€β”€ tests/                    # Unit tests for the system
β”œβ”€β”€ scripts/                  # Helper processing scripts
β”œβ”€β”€ requirements.txt          # Python dependencies
β”œβ”€β”€ LICENSE                   # Project software license
└── README.md                 # Project documentation
```

## βš™οΈ Installation & Usage

1. **Install Requirements:**
   Make sure you have PyTorch installed for your specific CUDA version (e.g., cu118). Then install the requirements:
   ```bash
   pip install -r requirements.txt
   ```

2. **Run the Dashboard:**
   ```bash
   cd bill_scanner
   streamlit run app.py
   ```