metadata
title: AI Chatbot File Web Image Audio
emoji: π€
colorFrom: indigo
colorTo: pink
sdk: gradio
sdk_version: 5.34.2
app_file: app.py
pinned: false
license: mit
short_description: AI Chatbot with RAG β Ask from File, Web, Image, or Audio
π§ AI Chatbot with File, Web, OCR & Audio (Gradio + Groq)
A multimodal AI assistant that can answer questions using content from:
- π Uploaded
.txt,.pdf,.docx,.csvfiles - π Any website URL (RAG)
- πΌοΈ Images (OCR with Tesseract)
- π§ Audio files (transcription with Whisper)
π Features
- Chat with files (PDF, DOCX, TXT, CSV)
- Extract info from websites
- Perform OCR on images
- Transcribe audio to text
- Keeps file and URL-specific chat history
π οΈ Tech Stack
- Gradio UI
- Groq LLaMA 3
- Tesseract OCR
- OpenAI Whisper
- FastAPI backend (if used locally)
π¦ How to Run Locally
git clone https://github.com/your-username/your-repo.git
cd your-repo
pip install -r requirements.txt
uvicorn main:app --reload # FastAPI backend
python app.py # Gradio frontend