Spaces:

Al1Abdullah
/

AI_Chatbot_File_Web_Image_Audio

Sleeping

Ali Abdullah commited on Jun 26, 2025

Commit

b6dd802

verified ·

1 Parent(s): 0812f0d

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,36 +8,37 @@ sdk_version: 5.34.2
 app_file: app.py
 pinned: false
 license: mit
-short_description: AI Chatbot with RAG — Ask from File, Web, Image, or Audio
 ---
-# 🧠 AI Chatbot with File, Web, OCR & Audio (Gradio + Groq)
-A multimodal AI assistant that can answer questions using content from:
-- 📄 Uploaded `.txt`, `.pdf`, `.docx`, `.csv` files
-- 🌐 Any website URL (RAG)
-- 🖼️ Images (OCR with Tesseract)
-- 🎧 Audio files (transcription with Whisper)
 ---
 ## 🚀 Features
 - Chat with files (PDF, DOCX, TXT, CSV)
-- Extract info from websites
-- Perform OCR on images
-- Transcribe audio to text
-- Keeps file and URL-specific chat history
 ---
 ## 🛠️ Tech Stack
-- [Gradio UI](https://gradio.app)
-- [Groq LLaMA 3](https://groq.com/)
-- [Tesseract OCR](https://github.com/tesseract-ocr)
-- [OpenAI Whisper](https://github.com/openai/whisper)
-- [FastAPI backend](https://fastapi.tiangolo.com/) (if used locally)
 ---
@@ -46,6 +47,12 @@ A multimodal AI assistant that can answer questions using content from:
 ```bash
 git clone https://github.com/your-username/your-repo.git
 cd your-repo
 pip install -r requirements.txt
-uvicorn main:app --reload     # FastAPI backend
-python app.py                 # Gradio frontend

 app_file: app.py
 pinned: false
 license: mit
+short_description: AI Chatbot using RAG from Files, URLs, Images & Audio
 ---
+# 🧠 AI Chatbot with File, Web, Image & Audio Support (Gradio + Groq)
+A multimodal AI assistant powered by Groq's LLaMA 3 that can answer questions using:
+- 📄 Uploaded documents (`.txt`, `.pdf`, `.docx`, `.csv`)
+- 🌐 Any public website URL (RAG retrieval)
+- 🖼️ Images via OCR (Tesseract)
+- 🎧 Audio files via transcription (Whisper)
 ---
 ## 🚀 Features
 - Chat with files (PDF, DOCX, TXT, CSV)
+- Question answering from website content
+- OCR-based text extraction from images
+- Speech-to-text from audio recordings
+- Maintains separate history for File & URL chat sessions
 ---
 ## 🛠️ Tech Stack
+- [Gradio](https://gradio.app) — User Interface
+- [FastAPI](https://fastapi.tiangolo.com/) — API Backend
+- [Groq API](https://groq.com/) — LLaMA 3 inference
+- [Tesseract OCR](https://github.com/tesseract-ocr) — Image text extraction
+- [Whisper](https://github.com/openai/whisper) — Audio transcription
 ---
 ```bash
 git clone https://github.com/your-username/your-repo.git
 cd your-repo
+# Install dependencies
 pip install -r requirements.txt
+# Start FastAPI backend
+uvicorn main:app --reload
+# Run Gradio frontend
+python app.py