File size: 1,686 Bytes
c6a156e
7e894fd
 
 
 
c6a156e
ca73d96
c6a156e
e5ef2b7
 
b136d8f
c6a156e
 
 
 
7e894fd
 
 
 
f08c41c
 
 
7e894fd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
---
title: Invoice RAG with MCP
emoji: πŸ“„
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 5.33.1
app_file: app.py
tags:
  - Agents-MCP-Hackathon
  - mcp-server-track
pinned: false
license: mit
---

# πŸ“„ Invoice RAG System with MCP Integration

An intelligent system for processing and querying invoice data using RAG (Retrieval Augmented Generation) with MCP (Model Context Protocol) support.

## DEMO
[DemoVideo](https://drive.google.com/file/d/1R6LG7qwtsaHLWZMv02NeJK_cqXaa8e-n/view?usp=sharing)

## πŸš€ Features

- **PDF Invoice Processing**: Extract and analyze invoice data
- **RAG System**: Intelligent document retrieval and question answering  
- **Multiple LLM Support**: Groq integration with various models
- **MCP Integration**: Expose tools via Model Context Protocol
- **Interactive UI**: Clean Gradio interface

## πŸ”§ Setup

1. **Set API Keys**: Add your `GROQ_API_KEY` in the Space settings
2. **Upload Invoices**: Use the training tab to process your PDFs
3. **Query Data**: Ask questions about your invoices
4. **MCP Tools**: Access structured data extraction tools

## πŸ“ Usage

1. **Train**: Upload invoice PDFs to train the RAG system
2. **Query**: Ask natural language questions about your invoices
3. **Extract**: Use MCP tools for structured data extraction

## πŸ”‘ Required Environment Variables

- `GROQ_API_KEY`: Your Groq API key for LLM access

## πŸ› οΈ MCP Integration

The system exposes these MCP tools:
- `query_invoice_info`: Extract information from invoices
- `get_invoice_summary`: Get summary of processed invoices  
- `extract_specific_field`: Extract specific fields
- `list_available_invoices`: List available invoice sources