File size: 1,252 Bytes
d7f8a47 b2204d1 d7f8a47 b2204d1 d7f8a47 b2204d1 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 | ---
title: AI Chatbot File Web Image Audio
emoji: π€
colorFrom: indigo
colorTo: pink
sdk: gradio
sdk_version: 5.34.2
app_file: app.py
pinned: false
license: mit
short_description: AI Chatbot with RAG β Ask from File, Web, Image, or Audio
---
# π§ AI Chatbot with File, Web, OCR & Audio (Gradio + Groq)
A multimodal AI assistant that can answer questions using content from:
- π Uploaded `.txt`, `.pdf`, `.docx`, `.csv` files
- π Any website URL (RAG)
- πΌοΈ Images (OCR with Tesseract)
- π§ Audio files (transcription with Whisper)
---
## π Features
- Chat with files (PDF, DOCX, TXT, CSV)
- Extract info from websites
- Perform OCR on images
- Transcribe audio to text
- Keeps file and URL-specific chat history
---
## π οΈ Tech Stack
- [Gradio UI](https://gradio.app)
- [Groq LLaMA 3](https://groq.com/)
- [Tesseract OCR](https://github.com/tesseract-ocr)
- [OpenAI Whisper](https://github.com/openai/whisper)
- [FastAPI backend](https://fastapi.tiangolo.com/) (if used locally)
---
## π¦ How to Run Locally
```bash
git clone https://github.com/your-username/your-repo.git
cd your-repo
pip install -r requirements.txt
uvicorn main:app --reload # FastAPI backend
python app.py # Gradio frontend
|