Spaces:

VashuTheGreat2
/

Multi-Rag

Sleeping

File size: 5,011 Bytes

---
title: Multi-Rag
emoji: 🤖
colorFrom: blue
colorTo: green
sdk: docker
app_file: main.py
pinned: false
short_description: This is the Multi-Rag Agent
---


<div align="center">
  <h1>🚀 Multi-RAG AI Pipeline</h1>
  <p><strong>Advanced Multi-Agent RAG Orchestration powered by LangGraph, AWS Bedrock, and FAISS</strong></p>

  [![Python](https://img.shields.io/badge/Python-3.12+-blue.svg)](https://www.python.org/)
  [![LangGraph](https://img.shields.io/badge/Framework-LangGraph-orange.svg)](https://github.com/langchain-ai/langgraph)
  [![FastAPI](https://img.shields.io/badge/Backend-FastAPI-green.svg)](https://fastapi.tiangolo.com/)
  [![FAISS](https://img.shields.io/badge/VectorDB-FAISS-red.svg)](https://github.com/facebookresearch/faiss)
</div>

---

## 📖 Overview

**Multi-RAG AI** is a state-of-the-art, multi-agent RAG (Retrieval-Augmented Generation) pipeline designed for high-performance document intelligence. It leverages **LangGraph** for sophisticated orchestration, allowing an autonomous "Orchestrator" agent to decide which specialized workers (PDF, DOCX, TXT, Images, Web Search) are needed to answer complex user queries.

### Why Multi-RAG?
- **Intelligent Fan-out**: The orchestrator can trigger multiple workers in parallel to gather information from different sources.
- **Dynamic Routing**: Automatically detects file types and routes tasks to specialized loaders.
- **OCR Integration**: Built-in support for image processing and optical character recognition.
- **Web Search Fallback**: If local documents are insufficient, the agents can autonomously search the live web.

---

## 🏗️ Architecture

The system is built as a nested graph structure, providing a clean separation between high-level orchestration and low-level specialized tasks.

### 1. Main Orchestration Graph
The main graph handles the interaction between the user, the orchestrator, and the final chat response.

![Main Graph Architecture](./graph.png)

### 2. Worker Sub-Graph
The worker sub-graph is responsible for specialized information retrieval from various file formats.

![Worker Sub-Graph](./worker_sub_graph.png)

---

## ✨ Key Features

- **📂 Multi-Format Support**:
  - **PDF**: Deep document parsing.
  - **DOCX**: Microsoft Word document integration.
  - **TXT**: Plain text analysis.
  - **Images (OCR)**: Extraction of text from PNG/JPG using specialized loaders.
- **🤖 Autonomous Orchestration**: Uses a Llama-3.3-70B model on **AWS Bedrock** with a manual JSON fallback mechanism for 100% reliable structured output.
- **🔍 Hybrid Retrieval**: Combines local FAISS vector stores with real-time Google Search integration.
- **🧠 Persistence & Memory**: Full multi-turn conversation support with LangGraph checkpointers.
- **⚡ Modern Tech Stack**: Built with `uv` for lightning-fast dependency management and `FastAPI` for a high-performance backend.

---

## 🛠️ Tech Stack

- **Core**: [Python 3.12](https://www.python.org/)
- **Orchestration**: [LangGraph](https://github.com/langchain-ai/langgraph) & [LangChain](https://github.com/langchain-ai/langchain)
- **Large Language Models**: [AWS Bedrock](https://aws.amazon.com/bedrock/) (Llama 3.3 70B)
- **Vector Storage**: [FAISS](https://github.com/facebookresearch/faiss)
- **Embeddings**: [HuggingFace](https://huggingface.co/) (all-MiniLM-L6-v2)
- **Backend API**: [FastAPI](https://fastapi.tiangolo.com/)
- **Package Management**: [uv](https://github.com/astral-sh/uv)

---

## 🚀 Getting Started

### Prerequisites
- Python 3.12+
- `uv` installed (`pip install uv`)
- AWS Credentials (for Bedrock access)

### 1. Installation
```bash
# Clone the repository
git clone https://github.com/VashuTheGreat/Multi-Rag.git
cd Multi-Rag

# Install dependencies
uv sync
```

### 2. Environment Setup
Create a `.env` file in the root directory:
```env
# AWS Bedrock Config
AWS_ACCESS_KEY_ID=your_access_key
AWS_SECRET_ACCESS_KEY=your_secret_key
AWS_REGION_NAME=us-east-1

# Tooling (e.g., Search API keys if applicable)
# ...
```

### 3. Run the Application
```bash
# Start the FastAPI server
uv run main.py
```
Navigate to `http://127.0.0.1:8000` to start chatting with your documents!

---

## 📂 Project Structure

```bash
Multi-Rag/
├── api/                # FastAPI Endpoints & Controllers
├── src/
│   └── MultiRag/
│       ├── components/ # Core graph runners & embedders
│       ├── graph/      # LangGraph definitions (Main & Worker)
│       ├── models/     # Pydantic state & output schemas
│       ├── nodes/      # Individual graph node implementations
│       ├── prompts/    # LLM system prompts
│       └── utils/      # Ingestion & document processing utilities
├── static/             # Frontend assets (CSS, JS)
├── templates/          # Jinja2 HTML templates
└── db/                 # Local FAISS index persistence
```

---

<div align="center">
  <p>Built with 💖 for the future of Agentic RAG.</p>
</div>