Spaces:
Sleeping
Sleeping
| title: Madrid Content Analyzer | |
| emoji: ποΈ | |
| colorFrom: blue | |
| colorTo: green | |
| sdk: gradio | |
| sdk_version: 4.44.0 | |
| app_file: app.py | |
| pinned: false | |
| license: mit | |
| # ποΈ Madrid Content Analyzer | |
| Automated analysis of language clarity in Madrid City Council communications. | |
| ## π What It Does | |
| This application: | |
| - π₯ **Fetches content** from Madrid City Council RSS feeds and Open Data Portal | |
| - π **Analyzes language clarity** using the Aclarador system | |
| - π **Tracks trends** over time in a DuckDB database | |
| - π **Visualizes results** in an interactive dashboard | |
| ## π― Features | |
| ### π Dashboard | |
| - Real-time statistics on analyzed content | |
| - Clarity score distribution charts | |
| - Timeline of content and scores | |
| - Category breakdown | |
| ### π Content Browser | |
| - Search and filter content by date, category, and clarity score | |
| - View detailed analysis for each item | |
| - Identify areas for improvement | |
| ### π Analytics | |
| - Find low-clarity items that need improvement | |
| - Track trends over time | |
| - Export data for further analysis | |
| ### βοΈ Settings | |
| - Manual content fetch trigger | |
| - Database statistics | |
| - Fetch logs and history | |
| ## π Automatic Updates | |
| Content is fetched and analyzed automatically every 6 hours. The app runs continuously on Hugging Face Spaces with persistent data storage. | |
| ## π οΈ Technology Stack | |
| - **Frontend**: Gradio 4.44 | |
| - **Database**: DuckDB (analytics-optimized, 16GB storage) | |
| - **Scheduler**: APScheduler (background tasks) | |
| - **Visualization**: Plotly | |
| - **Analysis**: Aclarador language clarity analyzer | |
| ## πΎ Data Storage | |
| Data is stored in `/data/madrid.duckdb` which persists across Space restarts. The database can hold millions of content items within the 16GB Space storage limit. | |
| ## π Usage | |
| Simply visit the Space URL and explore the tabs: | |
| 1. **Dashboard** - See overall statistics and trends | |
| 2. **Browse Content** - Search and filter analyzed content | |
| 3. **Analytics** - Find improvement opportunities | |
| 4. **Settings** - Trigger manual updates, view logs | |
| ## π Data Sources | |
| - **RSS Feed**: https://diario.madrid.es/feed | |
| - **Open Data Portal**: https://datos.madrid.es/portal/site/egob | |
| ## π Privacy | |
| This Space analyzes publicly available content from Madrid City Council. No personal data is collected or stored. | |
| ## π License | |
| MIT License - Feel free to fork and adapt! | |
| ## π€ Contributing | |
| Contributions are welcome! This project helps improve government communication clarity. | |
| --- | |
| **Built with** π€ Hugging Face Spaces | **Free forever** π° | |