Spaces:

Abeshith
/

RAG-Project

Running

App Files Files Community

github-actions[bot] commited on Dec 4, 2025

Commit

dfa6a46

1 Parent(s): db5b6fe

Deploy from GitHub Actions

Browse files

Files changed (8) hide show

Dockerfile +6 -4
README.md +173 -34
project/model/reranking.py +1 -1
project/model/retriever.py +3 -4
project/pipeline/agents.py +1 -1
project/pipeline/rag.py +1 -1
project/source/data_preparation.py +1 -1
requirements.txt +1 -1

Dockerfile CHANGED Viewed

@@ -2,14 +2,16 @@ FROM python:3.11-slim
 WORKDIR /app
-COPY requirements.txt ./
-RUN pip install --no-cache-dir -r requirements.txt
 COPY . .
-EXPOSE 7860
 ENV PYTHONUNBUFFERED=1
-CMD ["uvicorn", "app:app", "--host", "0.0.0.0", "--port", "7860"]

 WORKDIR /app
+COPY pyproject.toml ./
+COPY requirements.txt* ./
+RUN pip install --no-cache-dir uv && \
+    uv pip install --system --no-cache -r pyproject.toml
 COPY . .
+EXPOSE 8000
 ENV PYTHONUNBUFFERED=1
+CMD ["python", "app.py"]

README.md CHANGED Viewed

@@ -1,47 +1,186 @@
----
-title: RAG Project - Learn with Transformers
-emoji: 🤖
-colorFrom: purple
-colorTo: blue
-sdk: docker
-pinned: false
----
-# RAG Project - Learn with Transformers
-A production-ready Corrective Retrieval-Augmented Generation (CRAG) system built with LangChain, LangGraph, and FastAPI.
-## Features
-- **Intelligent Document Grading**: LLM evaluates retrieved documents for relevance
-- **Query Transformation**: Rewrites queries for better retrieval
-- **Web Search Fallback**: Tavily API integration when local docs insufficient
-- **Advanced Retrieval**: FAISS + FastEmbed + FlashRank reranking
-- **Agent Workflow**: LangGraph state machine with conditional routing
 ## Tech Stack
-- **LLM**: Groq (openai/gpt-oss-120b)
-- **Embeddings**: FastEmbed (BAAI/bge-small-en-v1.5)
-- **Vector Store**: FAISS
-- **Reranker**: FlashRank (rank-T5-flan)
-- **Framework**: LangChain 0.3.x + LangGraph
-- **Web**: FastAPI + Uvicorn
-## Environment Variables
-Required secrets (set in Space Settings):
-- `GROQ_API_KEY`
-- `GOOGLE_API_KEY`
-- `LANGSMITH_API_KEY` (optional)
-- `TAVILY_API_KEY` (optional)
 ## How It Works
-1. User query → FAISS retrieval with MMR
-2. FlashRank reranking
-3. LLM grades document relevance
-4. If poor quality → Transform query + Web search
-5. Generate answer with Groq LLM
-The app will automatically download "Attention Is All You Need" paper from ArXiv on first run.

+# RAG Project - Learn About Transformers
+A production-ready **Corrective Retrieval-Augmented Generation (CRAG)** system built with LangChain, LangGraph, and FastAPI. This project implements an intelligent RAG pipeline that not only retrieves relevant documents but also **validates, corrects, and improves** retrieval quality through an agent-based workflow.
+## What Makes This Different from Traditional RAG?
+### Traditional RAG:
+```
+Query → Retrieve Documents → Generate Answer
+```
+**Problem**: If retrieved documents are irrelevant or low-quality, the answer will be poor.
+### This Project (Corrective RAG):
+```
+Query → Retrieve → Grade Quality → Transform Query if Needed → Web Search if Necessary → Generate
+```
+**Solution**: Intelligent agent workflow that **self-corrects** by grading document relevance and taking corrective actions.
+## Architecture
+```mermaid
+graph LR
+    A[User Query] --> B[Retrieve]
+    B --> C[FAISS+MMR]
+    C --> D[Rerank]
+    D --> E{Grade}
+    E -->|Relevant| F[Generate]
+    E -->|Partial| G[Filter]
+    E -->|Poor| H[Transform]
+    G --> F
+    H --> I[Web Search]
+    I --> F
+    F --> J[Groq LLM]
+    J --> K[Answer]
+```
+## Key Features
+### 1. **Intelligent Document Grading**
+- LLM evaluates retrieved documents for relevance
+- Filters out low-quality results automatically
+- Ensures only useful context reaches generation
+### 2. **Query Transformation**
+- Rewrites ambiguous or poor queries
+- Improves retrieval on second attempt
+- Adaptive query refinement
+### 3. **Web Search Fallback**
+- Tavily API integration for external knowledge
+- Activates when local documents insufficient
+- Combines local + web results
+### 4. **Advanced Retrieval Stack**
+- **FAISS** vector store with MMR search
+- **FastEmbed** (BAAI/bge-small-en-v1.5) embeddings
+- **FlashRank** (rank-T5-flan) reranking
+- Self-query retriever support
+### 5. **LangGraph Agent Workflow**
+- State machine orchestration
+- Conditional routing logic
+- Transparent decision-making
 ## Tech Stack
+| Component | Technology |
+|-----------|------------|
+| **LLM** | Groq (openai/gpt-oss-120b) |
+| **Embeddings** | FastEmbed (BAAI/bge-small-en-v1.5) |
+| **Vector Store** | FAISS |
+| **Reranker** | FlashRank (rank-T5-flan) |
+| **Agent Framework** | LangGraph |
+| **RAG Framework** | LangChain 0.3.x |
+| **Web Search** | Tavily API |
+| **Web Framework** | FastAPI + Uvicorn |
+| **Observability** | LangSmith (optional) |
+| **Document Source** | "Attention Is All You Need" (Transformer paper) |
+## Project Structure
+```
+RAG Project/
+├── project/
+│   ├── config/
+│   │   └── config.yaml              # Model & pipeline configuration
+│   ├── logger/
+│   │   └── logging.py               # Centralized logging
+│   ├── exception/
+│   │   └── except.py                # Custom exception handling
+│   ├── utils/
+│   │   ├── config_loader.py         # YAML config loader
+│   │   └── model_loader.py          # LLM & embedding initialization
+│   ├── source/
+│   │   └── data_preparation.py      # PDF/ArXiv document loading
+│   ├── model/
+│   │   ├── retriever.py             # FAISS retriever with MMR
+│   │   └── reranking.py             # FlashRank reranking
+│   ├── prompts/
+│   │   └── prompt_template.py       # RAG, Router, WebSearch prompts
+│   └── pipeline/
+│       ├── rag.py                   # Core RAG pipeline
+│       └── agents.py                # CRAG agent workflow
+├── templates/
+│   └── index.html                   # Web UI template
+├── static/
+│   └── styles.css                   # Purple gradient theme
+├── data/
+│   └── attention-is-all-you-need.pdf
+├── app.py                           # FastAPI application
+├── main.py                          # CLI entry point
+├── Dockerfile                       # Docker containerization
+└── requirements.txt                 # Dependencies
+```
+## Quick Start
+### 1. Clone & Install
+```bash
+git clone https://github.com/Abeshith/RAG-Project-PipeLine.git
+cd RAG-Project-PipeLine
+pip install -r requirements.txt
+```
+### 2. Set Environment Variables
+Create `.env` file:
+```env
+GROQ_API_KEY=your_groq_api_key
+GOOGLE_API_KEY=your_google_api_key
+LANGSMITH_API_KEY=your_langsmith_key
+TAVILY_API_KEY=your_tavily_key
+```
+### 3. Run Web Interface
+```bash
+python app.py
+```
+Visit: http://localhost:8000
+### 4. Run CLI
+```bash
+python main.py
+```
+## Docker Deployment
+### Build & Run
+```bash
+docker build -t rag-project .
+docker run -d -p 8000:8000 --env-file .env rag-project
+```
 ## How It Works
+### Workflow Example
+**Query**: "What is the attention mechanism in transformers?"
+1. **Retrieval**: FAISS finds top 3 most similar chunks from "Attention Is All You Need" paper
+2. **Reranking**: FlashRank reorders by relevance (top 3 kept)
+3. **Grading**: LLM evaluates each document:
+   - ✅ Doc 1: Relevant (explains attention)
+   - ✅ Doc 2: Relevant (shows formula)
+   - ❌ Doc 3: Not relevant (talks about training data)
+4. **Decision**: 2/3 relevant → Use filtered docs
+5. **Generation**: Groq LLM synthesizes answer from relevant docs
+6. **Output**: Comprehensive answer with LaTeX formulas (rendered via MathJax)
+### When Retrieval Fails
+**Query**: "What are the latest improvements to transformers in 2024?"
+1. **Retrieval**: Finds documents from 2017 paper
+2. **Grading**: ❌ All documents marked "not relevant" (outdated info)
+3. **Transform**: Rewrites query → "Recent transformer architecture improvements 2024"
+4. **Web Search**: Tavily searches current web content
+5. **Generation**: Answer combines paper fundamentals + recent developments
+## Web Interface Features
+- **Modern UI**: Purple gradient design with responsive layout
+- **MathJax Integration**: Renders LaTeX formulas beautifully
+- **Transformer Visualization**: Architecture diagram in header
+- **Real-time Search**: Fast async FastAPI backend
+- **Error Handling**: Graceful degradation with user-friendly messages

project/model/reranking.py CHANGED Viewed

@@ -1,5 +1,5 @@
 from typing import List
-from langchain_core.documents import Document
 from flashrank.Ranker import Ranker, RerankRequest
 from project.utils.config_loader import load_config
 from project.logger.logging import get_logger

 from typing import List
+from langchain.schema import Document
 from flashrank.Ranker import Ranker, RerankRequest
 from project.utils.config_loader import load_config
 from project.logger.logging import get_logger

project/model/retriever.py CHANGED Viewed

@@ -1,8 +1,8 @@
 from typing import List, Optional
-from langchain_core.documents import Document
 from langchain_community.vectorstores import FAISS
-from langchain.chains.query_constructor.schema import AttributeInfo
-from langchain.retrievers import SelfQueryRetriever
 from project.utils.model_loader import ModelLoader
 from project.utils.config_loader import load_config
 from project.logger.logging import get_logger
@@ -91,4 +91,3 @@ class DocumentRetriever:
         logger.info(f"Base retriever configured with {search_type} search")
         return self.retriever

 from typing import List, Optional
+from langchain.schema import Document
 from langchain_community.vectorstores import FAISS
+from langchain.chains.query_constructor.base import AttributeInfo
+from langchain.retrievers.self_query.base import SelfQueryRetriever
 from project.utils.model_loader import ModelLoader
 from project.utils.config_loader import load_config
 from project.logger.logging import get_logger
         logger.info(f"Base retriever configured with {search_type} search")
         return self.retriever

project/pipeline/agents.py CHANGED Viewed

@@ -2,7 +2,7 @@ import os
 from typing import List, Literal
 from typing_extensions import TypedDict
 from pydantic import BaseModel, Field
-from langchain_core.documents import Document
 from langchain_core.output_parsers import StrOutputParser
 from langgraph.graph import END, StateGraph, START
 from project.pipeline.rag import RAGPipeline

 from typing import List, Literal
 from typing_extensions import TypedDict
 from pydantic import BaseModel, Field
+from langchain.schema import Document
 from langchain_core.output_parsers import StrOutputParser
 from langgraph.graph import END, StateGraph, START
 from project.pipeline.rag import RAGPipeline

project/pipeline/rag.py CHANGED Viewed

@@ -1,5 +1,5 @@
 from typing import List, Dict, Any
-from langchain_core.documents import Document
 from langchain_core.output_parsers import StrOutputParser
 from langchain_core.runnables import RunnablePassthrough
 from project.source.data_preparation import DataPreparation

 from typing import List, Dict, Any
+from langchain.schema import Document
 from langchain_core.output_parsers import StrOutputParser
 from langchain_core.runnables import RunnablePassthrough
 from project.source.data_preparation import DataPreparation

project/source/data_preparation.py CHANGED Viewed

@@ -3,7 +3,7 @@ from pathlib import Path
 from typing import List, Optional
 from langchain_community.document_loaders import PyPDFLoader, ArxivLoader
 from langchain_text_splitters import RecursiveCharacterTextSplitter
-from langchain_core.documents import Document
 from project.logger.logging import get_logger
 logger = get_logger(__name__)

 from typing import List, Optional
 from langchain_community.document_loaders import PyPDFLoader, ArxivLoader
 from langchain_text_splitters import RecursiveCharacterTextSplitter
+from langchain.schema import Document
 from project.logger.logging import get_logger
 logger = get_logger(__name__)

requirements.txt CHANGED Viewed

@@ -13,9 +13,9 @@ langchain-google-genai>=2.0.5
 langchain-groq>=0.2.0
 langgraph>=0.2.0
 pypdf>=6.4.0
 python-dotenv>=1.2.1
 python-multipart>=0.0.20
 rapidocr-onnxruntime>=1.4.4
 tiktoken>=0.12.0
 uvicorn>=0.34.0
-pymupdf

 langchain-groq>=0.2.0
 langgraph>=0.2.0
 pypdf>=6.4.0
+pymupdf
 python-dotenv>=1.2.1
 python-multipart>=0.0.20
 rapidocr-onnxruntime>=1.4.4
 tiktoken>=0.12.0
 uvicorn>=0.34.0