Spaces:

Al1Abdullah
/

AI_Chatbot_File_Web_Image_Audio

Sleeping

App Files Files Community

AI_Chatbot_File_Web_Image_Audio / README.md

Ali Abdullah

Update README.md

b6dd802 verified 7 months ago

preview code

raw

history blame contribute delete

1.43 kB

A newer version of the Gradio SDK is available: 6.4.0

Upgrade

metadata

title: AI Chatbot File Web Image Audio
emoji: 🤖
colorFrom: indigo
colorTo: pink
sdk: gradio
sdk_version: 5.34.2
app_file: app.py
pinned: false
license: mit
short_description: AI Chatbot using RAG from Files, URLs, Images & Audio

🧠 AI Chatbot with File, Web, Image & Audio Support (Gradio + Groq)

A multimodal AI assistant powered by Groq's LLaMA 3 that can answer questions using:

📄 Uploaded documents (.txt, .pdf, .docx, .csv)
🌐 Any public website URL (RAG retrieval)
🖼️ Images via OCR (Tesseract)
🎧 Audio files via transcription (Whisper)

🚀 Features

Chat with files (PDF, DOCX, TXT, CSV)
Question answering from website content
OCR-based text extraction from images
Speech-to-text from audio recordings
Maintains separate history for File & URL chat sessions

🛠️ Tech Stack

Gradio — User Interface
FastAPI — API Backend
Groq API — LLaMA 3 inference
Tesseract OCR — Image text extraction
Whisper — Audio transcription

📦 How to Run Locally

git clone https://github.com/your-username/your-repo.git
cd your-repo

# Install dependencies
pip install -r requirements.txt

# Start FastAPI backend
uvicorn main:app --reload

# Run Gradio frontend
python app.py