File size: 1,420 Bytes
38f4471 f4be98d 38f4471 f4be98d | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 | ---
title: Automated Insurance Claim Validation System
emoji: 📄
colorFrom: blue
colorTo: red
sdk: gradio
sdk_version: 5.27.0
app_file: app.py
pinned: false
license: mit
---
# Automated Insurance Claim Validation System
This project automates the validation process for insurance claims using image processing and NLP techniques.
## Features:
- Support for both PDF and image files
- Multiple page processing
- EasyOCR for reliable text extraction
- BERT-based text validation
- Document classification using pre-trained models
- Export results to Excel
- User-friendly Gradio interface
## System Requirements:
- Python 3.9+
- Required Python packages (see requirements.txt)
- System dependencies: tesseract-ocr, poppler-utils
## Installation:
```bash
pip install -r requirements.txt
```
## Usage:
1. Upload an insurance claim document (PDF or image)
2. The system will:
- Process all pages in the document
- Extract text using EasyOCR
- Validate the extracted text
- Classify each page
- Generate a downloadable report
3. Download the Excel report for detailed analysis
## Models Used:
- OCR: EasyOCR
- Text Classification: DistilBERT (distilbert-base-uncased-finetuned-sst-2-english)
- Document Classification: Donut (naver-clova-ix/donut-base-finetuned-rvlcdip)
## Live Demo:
Access the live demo at: https://huggingface.co/spaces/anoopreddyyeddula/Automated-Insurance-Claim-Validation-System |