File size: 2,537 Bytes
b36d0b3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
---
title: Smart Web Monitor
emoji: πŸ”
colorFrom: blue
colorTo: purple
sdk: docker
pinned: false
app_port: 7860
---

# πŸ” Smart Web Monitor

**AI-Powered Website Change Detection System**

Monitor websites for changes automatically with AI-driven content analysis. Perfect for tracking competitor updates, news sites, or any web content you care about.

## ✨ Features

- πŸ• **Automated Monitoring**: Check websites every 5 minutes automatically
- πŸ“Έ **Content Hash Detection**: Track changes via MD5 hash comparison
- πŸ€– **AI Sentiment Analysis**: Powered by DistilBERT model from HuggingFace
- πŸ” **Manual Checks**: Instant verification anytime
- πŸ“Š **History Tracking**: Review all past checks
- 🎯 **Multi-URL Support**: Monitor unlimited websites

## πŸ€– AI Technology

This project uses **real HuggingFace Transformers**:
- Model: `distilbert-base-uncased-finetuned-sst-2-english`
- Task: Sentiment Analysis (POSITIVE/NEGATIVE classification)
- Purpose: Detect tone changes in web content over time

## πŸš€ Quick Start

1. **Add URLs**: Go to "Monitor Management" tab and add websites
2. **Auto-Check**: System automatically checks every 5 minutes
3. **Manual Check**: Use "Manual Check" tab for instant verification
4. **View History**: Check "History" tab to see all results

## πŸ“‹ Use Cases

- πŸ“° News monitoring
- 🏒 Competitor tracking  
- πŸ’° Price change alerts
- πŸ“ Content update detection
- πŸ”” Government notice tracking

## πŸ› οΈ Technology Stack

- **Frontend**: Gradio 4.x
- **Backend**: Python 3.11
- **Browser Engine**: Chromium (for advanced scraping)
- **Deployment**: HuggingFace Spaces (Docker SDK)

## βš™οΈ Configuration

Set these environment variables in HuggingFace Spaces settings:

```bash
# Optional: Custom check interval (default: 5 minutes)
CHECK_INTERVAL=300

# Optional: Maximum URLs to monitor (default: 50)
MAX_URLS=50
```

## πŸ“Š How It Works

1. **Hash-Based Detection**: Each check computes MD5 hash of page content
2. **Background Worker**: Daemon thread runs checks every 5 minutes
3. **History Persistence**: Recent 100 checks stored in memory
4. **Smart Scheduling**: Distributed checks to avoid rate limiting

## πŸ”’ Privacy

- No data is stored permanently
- All checks run in-memory
- No external logging or tracking
- Your URLs stay private

## πŸ“ License

MIT License - Free for personal and commercial use

## 🀝 Contributing

Found a bug? Have a feature request? Open an issue on GitHub!

---

**Made with ❀️ by the HuggingFace Community**