Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
|
@@ -1,32 +1,43 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# ๐ฌ AI Research Companion (Groq + LangChain + FAISS)
|
| 2 |
|
| 3 |
-
An advanced **AI-powered research assistant** that helps you analyze academic papers, ask natural language questions, generate engaging summaries, and discover related research
|
| 4 |
|
| 5 |
---
|
| 6 |
|
| 7 |
## ๐ Features
|
| 8 |
|
| 9 |
-
|
| 10 |
-
|
| 11 |
-
|
| 12 |
-
|
| 13 |
-
|
| 14 |
-
|
| 15 |
-
|
| 16 |
|
| 17 |
---
|
| 18 |
|
| 19 |
## ๐ Tech Stack
|
| 20 |
|
| 21 |
-
-
|
| 22 |
-
-
|
| 23 |
-
-
|
| 24 |
-
-
|
| 25 |
-
-
|
| 26 |
-
-
|
| 27 |
-
-
|
| 28 |
-
-
|
| 29 |
-
-
|
| 30 |
|
| 31 |
---
|
| 32 |
|
|
@@ -37,58 +48,3 @@ An advanced **AI-powered research assistant** that helps you analyze academic pa
|
|
| 37 |
```bash
|
| 38 |
git clone https://github.com/bobbythomas985/Research_Assistant
|
| 39 |
cd Research_Assistant
|
| 40 |
-
```
|
| 41 |
-
### 2๏ธโฃ Install Dependencies
|
| 42 |
-
```bash
|
| 43 |
-
pip install -r requirements.txt
|
| 44 |
-
```
|
| 45 |
-
### 3๏ธโฃ Set Up Your API Key
|
| 46 |
-
Export your Groq API key as an environment variable:
|
| 47 |
-
**Linux / macOS**
|
| 48 |
-
```bash
|
| 49 |
-
export GROQ_API_KEY="your_api_key_here"
|
| 50 |
-
```
|
| 51 |
-
**Windows**
|
| 52 |
-
```powershell
|
| 53 |
-
setx GROQ_API_KEY "your_api_key_here"
|
| 54 |
-
```
|
| 55 |
-
Alternatively, replace the placeholder in **app.py**:
|
| 56 |
-
```python
|
| 57 |
-
GROQ_API_KEY = os.getenv("GROQ_API_KEY", "your-api-key")
|
| 58 |
-
```
|
| 59 |
-
### 4๏ธโฃ Run the App
|
| 60 |
-
```bash
|
| 61 |
-
python app.py
|
| 62 |
-
```
|
| 63 |
-
|
| 64 |
-
---
|
| 65 |
-
|
| 66 |
-
## ๐ฅ๏ธ How It Works
|
| 67 |
-
|
| 68 |
-
1๏ธโฃ **Upload a PDF**
|
| 69 |
-
๐ The system extracts all text from the research paper.
|
| 70 |
-
|
| 71 |
-
2๏ธโฃ **Process & Embed**
|
| 72 |
-
๐ Splits the extracted text into overlapping chunks and creates a **FAISS vector index** using **HuggingFace embeddings** for efficient semantic search.
|
| 73 |
-
|
| 74 |
-
3๏ธโฃ **Ask Questions**
|
| 75 |
-
โ User questions are converted into embeddings and matched with the most relevant chunks from the document.
|
| 76 |
-
|
| 77 |
-
4๏ธโฃ **LLM Answer Generation**
|
| 78 |
-
๐ค Groqโs `llama-3.3-70b-versatile` model is used to generate accurate, context-aware answers with a custom prompt.
|
| 79 |
-
|
| 80 |
-
5๏ธโฃ **Summarize & Discover Papers**
|
| 81 |
-
๐ Generates engaging, structured summaries of the document and retrieves similar papers from **arXiv** for further reading.
|
| 82 |
-
|
| 83 |
-
|
| 84 |
-
## ๐ฎ Future Improvements
|
| 85 |
-
|
| 86 |
-
- ๐ **Multi-document support** โ Build a single knowledge base from multiple PDFs
|
| 87 |
-
- ๐ท **LLM Reranking** โ Use cross-encoder reranking for better context selection
|
| 88 |
-
- ๐ **Clickable Source References** โ Jump directly to relevant sections inside the PDF
|
| 89 |
-
- ๐ **Deploy on Hugging Face Spaces / Streamlit Cloud** โ Make it public and shareable
|
| 90 |
-
- ๐ **Multilingual Q&A** โ Integrate translation for global research accessibility
|
| 91 |
-
|
| 92 |
-
---
|
| 93 |
-
|
| 94 |
-
> *Empowering researchers to go from papers โ insights โ new discoveries.*
|
|
|
|
| 1 |
+
---
|
| 2 |
+
title: AI Research Companion
|
| 3 |
+
emoji: ๐ฌ
|
| 4 |
+
colorFrom: indigo
|
| 5 |
+
colorTo: blue
|
| 6 |
+
sdk: gradio
|
| 7 |
+
sdk_version: 4.0.0
|
| 8 |
+
app_file: app.py
|
| 9 |
+
pinned: false
|
| 10 |
+
---
|
| 11 |
+
|
| 12 |
# ๐ฌ AI Research Companion (Groq + LangChain + FAISS)
|
| 13 |
|
| 14 |
+
An advanced **AI-powered research assistant** that helps you analyze academic papers, ask natural language questions, generate engaging summaries, and discover related research โ all from a modern, tabbed **Gradio interface**.
|
| 15 |
|
| 16 |
---
|
| 17 |
|
| 18 |
## ๐ Features
|
| 19 |
|
| 20 |
+
- **PDF Upload & Text Extraction** โ Extracts full text from research papers
|
| 21 |
+
- **Chunking & Vector Embedding** โ Uses LangChain + HuggingFace embeddings for semantic search
|
| 22 |
+
- **Groq LLM Q&A** โ Powered by `llama-3.3-70b-versatile` for accurate answers
|
| 23 |
+
- **Cited Source References** โ Shows the exact text chunks used
|
| 24 |
+
- **Research Paper Summaries** โ Creates clear, engaging summaries
|
| 25 |
+
- **Similar Paper Discovery** โ Uses arXiv API to find related studies
|
| 26 |
+
- **Beautiful Multi-Tab UI** โ Custom-styled Gradio interface
|
| 27 |
|
| 28 |
---
|
| 29 |
|
| 30 |
## ๐ Tech Stack
|
| 31 |
|
| 32 |
+
- Python 3.9+
|
| 33 |
+
- Gradio
|
| 34 |
+
- LangChain
|
| 35 |
+
- FAISS
|
| 36 |
+
- HuggingFace Sentence Transformers (`all-mpnet-base-v2`)
|
| 37 |
+
- Groq API
|
| 38 |
+
- PyPDF2
|
| 39 |
+
- Feedparser
|
| 40 |
+
- Custom CSS
|
| 41 |
|
| 42 |
---
|
| 43 |
|
|
|
|
| 48 |
```bash
|
| 49 |
git clone https://github.com/bobbythomas985/Research_Assistant
|
| 50 |
cd Research_Assistant
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|