Spaces:
Paused
Paused
File size: 2,537 Bytes
b36d0b3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 |
---
title: Smart Web Monitor
emoji: π
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
app_port: 7860
---
# π Smart Web Monitor
**AI-Powered Website Change Detection System**
Monitor websites for changes automatically with AI-driven content analysis. Perfect for tracking competitor updates, news sites, or any web content you care about.
## β¨ Features
- π **Automated Monitoring**: Check websites every 5 minutes automatically
- πΈ **Content Hash Detection**: Track changes via MD5 hash comparison
- π€ **AI Sentiment Analysis**: Powered by DistilBERT model from HuggingFace
- π **Manual Checks**: Instant verification anytime
- π **History Tracking**: Review all past checks
- π― **Multi-URL Support**: Monitor unlimited websites
## π€ AI Technology
This project uses **real HuggingFace Transformers**:
- Model: `distilbert-base-uncased-finetuned-sst-2-english`
- Task: Sentiment Analysis (POSITIVE/NEGATIVE classification)
- Purpose: Detect tone changes in web content over time
## π Quick Start
1. **Add URLs**: Go to "Monitor Management" tab and add websites
2. **Auto-Check**: System automatically checks every 5 minutes
3. **Manual Check**: Use "Manual Check" tab for instant verification
4. **View History**: Check "History" tab to see all results
## π Use Cases
- π° News monitoring
- π’ Competitor tracking
- π° Price change alerts
- π Content update detection
- π Government notice tracking
## π οΈ Technology Stack
- **Frontend**: Gradio 4.x
- **Backend**: Python 3.11
- **Browser Engine**: Chromium (for advanced scraping)
- **Deployment**: HuggingFace Spaces (Docker SDK)
## βοΈ Configuration
Set these environment variables in HuggingFace Spaces settings:
```bash
# Optional: Custom check interval (default: 5 minutes)
CHECK_INTERVAL=300
# Optional: Maximum URLs to monitor (default: 50)
MAX_URLS=50
```
## π How It Works
1. **Hash-Based Detection**: Each check computes MD5 hash of page content
2. **Background Worker**: Daemon thread runs checks every 5 minutes
3. **History Persistence**: Recent 100 checks stored in memory
4. **Smart Scheduling**: Distributed checks to avoid rate limiting
## π Privacy
- No data is stored permanently
- All checks run in-memory
- No external logging or tracking
- Your URLs stay private
## π License
MIT License - Free for personal and commercial use
## π€ Contributing
Found a bug? Have a feature request? Open an issue on GitHub!
---
**Made with β€οΈ by the HuggingFace Community**
|