Spaces:

muddasser
/

Webscrapping_Playwright

Running

muddasser commited on Aug 27, 2025

Commit

6b33a86

verified ·

1 Parent(s): 03ce648

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -1,41 +1,48 @@
 ---
-title: WebScraping with Selenium + RAG
 emoji: 🕷️
 colorFrom: red
 colorTo: red
-sdk: streamlit
 app_file: app.py
 app_port: 8501
 tags:
-- streamlit
-- selenium
-- rag
-- flan-t5
-- web-scraping
 pinned: true
-short_description: A Streamlit RAG Selenium + FLAN-T5-small.
 ---
-# 🕷️ Web Scraping + RAG Chatbot 🚀
-This project combines **Selenium web scraping** with **Retrieval-Augmented Generation (RAG)** to build an intelligent chatbot.
-It can scrape live websites, index the content into a **FAISS vector store**, and let you ask natural language questions.
-### 🔹 Features
-- 🌐 Scrape dynamic websites using **Selenium**
-- 📚 Store & retrieve content with **FAISS embeddings**
-- 🧠 Answer questions using **FLAN-T5-small** (runs on CPU)
-- 🎨 Simple **Streamlit UI** for interaction
-### 🚀 Usage
-1. Enter a URL to scrape.
-2. The system extracts + indexes the text.
-3. Ask questions — the chatbot answers using RAG.
----
-👩‍💻 **Tech Stack**: `Streamlit`, `Selenium`, `LangChain`, `FAISS`, `Transformers`
-📖 Check the [docs](https://docs.streamlit.io) for customizing your Streamlit app.

 ---
+title: Web Scraping with Selenium + RAG
 emoji: 🕷️
 colorFrom: red
 colorTo: red
+sdk: docker
 app_file: app.py
 app_port: 8501
 tags:
+  - streamlit
+  - selenium
+  - rag
+  - flan-t5
+  - web-scraping
 pinned: true
+short_description: Selenium RAG using FLAN-T5-small
 ---
+# 🕷️ Web Scraping + RAG Chatbot
+This project combines **Selenium web scraping** with **Retrieval-Augmented Generation (RAG)** to build an intelligent chatbot that can extract information from websites and answer questions about the content.
+![Demo](https://img.shields.io/badge/Demo-Live%20Demo-blue)
+![Python](https://img.shields.io/badge/Python-3.10%2B-blue)
+![License](https://img.shields.io/badge/License-MIT-green)
+## ✨ Features
+- 🌐 **Web Scraping**: Extract content from dynamic websites using Selenium
+- 📚 **Vector Storage**: Index and retrieve content using FAISS embeddings
+- 🧠 **Question Answering**: Generate answers using FLAN-T5-small model
+- 🎨 **User-Friendly Interface**: Simple Streamlit UI for interaction
+- 🐳 **Dockerized**: Ready for deployment on Hugging Face Spaces
+## 🚀 Quick Start
+### Prerequisites
+- Python 3.10+
+- Docker (for containerized deployment)
+- Hugging Face account (for deployment)
+### Local Installation
+1. Clone the repository:
+```bash
+git clone https://huggingface.co/spaces/your-username/your-space-name
+cd your-space-name