File size: 954 Bytes
eea2580
 
 
 
 
 
 
 
 
 
bf10662
eea2580
bf10662
 
eea2580
bf10662
eea2580
bf10662
eea2580
bf10662
eea2580
bf10662
eea2580
bf10662
eea2580
 
 
 
 
 
 
bf10662
aee5618
 
eea2580
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
---
title: PDF Chatbot  RAG Pipeline
emoji: 📄
colorFrom: indigo
colorTo: purple
sdk: streamlit
sdk_version: "1.35.0"
app_file: app.py
pinned: false
---

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference


# 📄 PDF Chatbot – RAG Pipeline

This Space hosts an end-to-end **Retrieval-Augmented Generation (RAG)** pipeline that allows users to upload PDFs and ask questions about their content.

The system extracts text, chunks it intelligently, embeds it into a vector database, and retrieves relevant context to answer queries using a large language model (LLM).

---

## 🚀 Features

- 🔹 PDF upload support  
- 🔹 Automatic text extraction  
- 🔹 Smart document chunking  
- 🔹 Vector storage using ChromaDB  
- 🔹 LLM-powered question answering  
- 🔹 Streamlit-based interface  
- 🔹 Clean RAG pipeline implementation (`src/rag_pipeline.py`)

---

## 🏗️ Project Structure