| | --- |
| | title: AI Chatbot File Web Image Audio |
| | emoji: π€ |
| | colorFrom: indigo |
| | colorTo: pink |
| | sdk: gradio |
| | sdk_version: 5.34.2 |
| | app_file: app.py |
| | pinned: false |
| | license: mit |
| | short_description: AI Chatbot with RAG β Ask from File, Web, Image, or Audio |
| | --- |
| | |
| | # π§ AI Chatbot with File, Web, OCR & Audio (Gradio + Groq) |
| |
|
| | A multimodal AI assistant that can answer questions using content from: |
| | - π Uploaded `.txt`, `.pdf`, `.docx`, `.csv` files |
| | - π Any website URL (RAG) |
| | - πΌοΈ Images (OCR with Tesseract) |
| | - π§ Audio files (transcription with Whisper) |
| |
|
| | --- |
| |
|
| | ## π Features |
| |
|
| | - Chat with files (PDF, DOCX, TXT, CSV) |
| | - Extract info from websites |
| | - Perform OCR on images |
| | - Transcribe audio to text |
| | - Keeps file and URL-specific chat history |
| |
|
| | --- |
| |
|
| | ## π οΈ Tech Stack |
| |
|
| | - [Gradio UI](https://gradio.app) |
| | - [Groq LLaMA 3](https://groq.com/) |
| | - [Tesseract OCR](https://github.com/tesseract-ocr) |
| | - [OpenAI Whisper](https://github.com/openai/whisper) |
| | - [FastAPI backend](https://fastapi.tiangolo.com/) (if used locally) |
| |
|
| | --- |
| |
|
| | ## π¦ How to Run Locally |
| |
|
| | ```bash |
| | git clone https://github.com/your-username/your-repo.git |
| | cd your-repo |
| | pip install -r requirements.txt |
| | uvicorn main:app --reload # FastAPI backend |
| | python app.py # Gradio frontend |
| | |