Spaces:

Al1Abdullah
/

AI_Chatbot_File_Web_Image_Audio

Sleeping

Ali Abdullah

Update README.md

b6dd802 verified 7 months ago

1.43 kB

	---
	title: AI Chatbot File Web Image Audio
	emoji: 🤖
	colorFrom: indigo
	colorTo: pink
	sdk: gradio
	sdk_version: 5.34.2
	app_file: app.py
	pinned: false
	license: mit
	short_description: AI Chatbot using RAG from Files, URLs, Images & Audio
	---

	# 🧠 AI Chatbot with File, Web, Image & Audio Support (Gradio + Groq)

	A multimodal AI assistant powered by Groq's LLaMA 3 that can answer questions using:

	- 📄 Uploaded documents (`.txt`, `.pdf`, `.docx`, `.csv`)
	- 🌐 Any public website URL (RAG retrieval)
	- 🖼️ Images via OCR (Tesseract)
	- 🎧 Audio files via transcription (Whisper)

	---

	## 🚀 Features

	- Chat with files (PDF, DOCX, TXT, CSV)
	- Question answering from website content
	- OCR-based text extraction from images
	- Speech-to-text from audio recordings
	- Maintains separate history for File & URL chat sessions

	---

	## 🛠️ Tech Stack

	- [Gradio](https://gradio.app) — User Interface
	- [FastAPI](https://fastapi.tiangolo.com/) — API Backend
	- [Groq API](https://groq.com/) — LLaMA 3 inference
	- [Tesseract OCR](https://github.com/tesseract-ocr) — Image text extraction
	- [Whisper](https://github.com/openai/whisper) — Audio transcription

	---

	## 📦 How to Run Locally

	```bash
	git clone https://github.com/your-username/your-repo.git
	cd your-repo

	# Install dependencies
	pip install -r requirements.txt

	# Start FastAPI backend
	uvicorn main:app --reload

	# Run Gradio frontend
	python app.py