File size: 1,252 Bytes
d7f8a47
 
b2204d1
 
d7f8a47
 
 
 
 
 
b2204d1
d7f8a47
 
b2204d1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
---
title: AI Chatbot File Web Image Audio
emoji: πŸ€–
colorFrom: indigo
colorTo: pink
sdk: gradio
sdk_version: 5.34.2
app_file: app.py
pinned: false
license: mit
short_description: AI Chatbot with RAG β€” Ask from File, Web, Image, or Audio
---

# 🧠 AI Chatbot with File, Web, OCR & Audio (Gradio + Groq)

A multimodal AI assistant that can answer questions using content from:
- πŸ“„ Uploaded `.txt`, `.pdf`, `.docx`, `.csv` files
- 🌐 Any website URL (RAG)
- πŸ–ΌοΈ Images (OCR with Tesseract)
- 🎧 Audio files (transcription with Whisper)

---

## πŸš€ Features

- Chat with files (PDF, DOCX, TXT, CSV)
- Extract info from websites
- Perform OCR on images
- Transcribe audio to text
- Keeps file and URL-specific chat history

---

## πŸ› οΈ Tech Stack

- [Gradio UI](https://gradio.app)
- [Groq LLaMA 3](https://groq.com/)
- [Tesseract OCR](https://github.com/tesseract-ocr)
- [OpenAI Whisper](https://github.com/openai/whisper)
- [FastAPI backend](https://fastapi.tiangolo.com/) (if used locally)

---

## πŸ“¦ How to Run Locally

```bash
git clone https://github.com/your-username/your-repo.git
cd your-repo
pip install -r requirements.txt
uvicorn main:app --reload     # FastAPI backend
python app.py                 # Gradio frontend