File size: 1,420 Bytes
38f4471
 
 
 
 
 
f4be98d
38f4471
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
f4be98d
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
---
title: Automated Insurance Claim Validation System
emoji: 📄
colorFrom: blue
colorTo: red
sdk: gradio
sdk_version: 5.27.0
app_file: app.py
pinned: false
license: mit
---

# Automated Insurance Claim Validation System

This project automates the validation process for insurance claims using image processing and NLP techniques.

## Features:
- Support for both PDF and image files
- Multiple page processing
- EasyOCR for reliable text extraction
- BERT-based text validation
- Document classification using pre-trained models
- Export results to Excel
- User-friendly Gradio interface

## System Requirements:
- Python 3.9+
- Required Python packages (see requirements.txt)
- System dependencies: tesseract-ocr, poppler-utils

## Installation:
```bash
pip install -r requirements.txt
```

## Usage:
1. Upload an insurance claim document (PDF or image)
2. The system will:
   - Process all pages in the document
   - Extract text using EasyOCR
   - Validate the extracted text
   - Classify each page
   - Generate a downloadable report
3. Download the Excel report for detailed analysis

## Models Used:
- OCR: EasyOCR
- Text Classification: DistilBERT (distilbert-base-uncased-finetuned-sst-2-english)
- Document Classification: Donut (naver-clova-ix/donut-base-finetuned-rvlcdip)

## Live Demo:
Access the live demo at: https://huggingface.co/spaces/anoopreddyyeddula/Automated-Insurance-Claim-Validation-System