Spaces:
Paused
Paused
| title: Smart Web Monitor | |
| emoji: π | |
| colorFrom: blue | |
| colorTo: purple | |
| sdk: docker | |
| pinned: false | |
| app_port: 7860 | |
| # π Smart Web Monitor | |
| **AI-Powered Website Change Detection System** | |
| Monitor websites for changes automatically with AI-driven content analysis. Perfect for tracking competitor updates, news sites, or any web content you care about. | |
| ## β¨ Features | |
| - π **Automated Monitoring**: Check websites every 5 minutes automatically | |
| - πΈ **Content Hash Detection**: Track changes via MD5 hash comparison | |
| - π€ **AI Sentiment Analysis**: Powered by DistilBERT model from HuggingFace | |
| - π **Manual Checks**: Instant verification anytime | |
| - π **History Tracking**: Review all past checks | |
| - π― **Multi-URL Support**: Monitor unlimited websites | |
| ## π€ AI Technology | |
| This project uses **real HuggingFace Transformers**: | |
| - Model: `distilbert-base-uncased-finetuned-sst-2-english` | |
| - Task: Sentiment Analysis (POSITIVE/NEGATIVE classification) | |
| - Purpose: Detect tone changes in web content over time | |
| ## π Quick Start | |
| 1. **Add URLs**: Go to "Monitor Management" tab and add websites | |
| 2. **Auto-Check**: System automatically checks every 5 minutes | |
| 3. **Manual Check**: Use "Manual Check" tab for instant verification | |
| 4. **View History**: Check "History" tab to see all results | |
| ## π Use Cases | |
| - π° News monitoring | |
| - π’ Competitor tracking | |
| - π° Price change alerts | |
| - π Content update detection | |
| - π Government notice tracking | |
| ## π οΈ Technology Stack | |
| - **Frontend**: Gradio 4.x | |
| - **Backend**: Python 3.11 | |
| - **Browser Engine**: Chromium (for advanced scraping) | |
| - **Deployment**: HuggingFace Spaces (Docker SDK) | |
| ## βοΈ Configuration | |
| Set these environment variables in HuggingFace Spaces settings: | |
| ```bash | |
| # Optional: Custom check interval (default: 5 minutes) | |
| CHECK_INTERVAL=300 | |
| # Optional: Maximum URLs to monitor (default: 50) | |
| MAX_URLS=50 | |
| ``` | |
| ## π How It Works | |
| 1. **Hash-Based Detection**: Each check computes MD5 hash of page content | |
| 2. **Background Worker**: Daemon thread runs checks every 5 minutes | |
| 3. **History Persistence**: Recent 100 checks stored in memory | |
| 4. **Smart Scheduling**: Distributed checks to avoid rate limiting | |
| ## π Privacy | |
| - No data is stored permanently | |
| - All checks run in-memory | |
| - No external logging or tracking | |
| - Your URLs stay private | |
| ## π License | |
| MIT License - Free for personal and commercial use | |
| ## π€ Contributing | |
| Found a bug? Have a feature request? Open an issue on GitHub! | |
| --- | |
| **Made with β€οΈ by the HuggingFace Community** | |