File size: 3,419 Bytes
41a3533
102e87a
66b9109
 
 
41a3533
 
07910ac
66b9109
41a3533
 
102e87a
41a3533
102e87a
 
 
 
 
 
 
 
41a3533
66b9109
 
 
102e87a
 
 
 
 
 
 
 
 
 
 
66b9109
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
---
title: Advanced Sentiment Analytics Dashboard
emoji: πŸ“Š
colorFrom: blue
colorTo: green
sdk: docker
pinned: false
license: mit
app_port: 7860
---

# πŸš€ Advanced Sentiment Analytics Dashboard

A distributed sentiment analysis system with **PyABSA + HF Spaces backend** and **Streamlit Cloud frontend** for scalable, high-accuracy analysis of product reviews.

## πŸ—οΈ Architecture

### πŸ”§ Split Deployment Strategy
- **Backend**: HuggingFace Spaces (PyABSA + FastAPI) - `api_server.py`
- **Frontend**: Streamlit Cloud (Lightweight UI) - `frontend_light.py`
- **Benefits**: High accuracy models + fast, scalable frontend

## ✨ Features

### πŸ” Core Analytics
- **PyABSA Integration**: State-of-the-art aspect-based sentiment analysis
- **M2M100 Translation**: Facebook's multilingual translation model
- **Intent Classification**: Advanced customer intent detection
- **Real-time Processing**: API-based architecture for scalability

### πŸ“Š Advanced Dashboard
- **Multi-page Navigation**: Home, Analytics, History, Documentation
- **Interactive Visualizations**: Network graphs, heatmaps, timelines
- **Advanced Filtering**: Multi-dimensional data exploration
- **Session Management**: Save and restore analysis sessions
- **KPI Dashboard**: Real-time metrics and insights

### 🎯 Business Intelligence
- **Areas of Improvement**: AI-powered identification of problem areas
- **Strength Anchors**: Recognition of positive aspects to leverage
- **Alert System**: Automated sentiment spike detection
- **Impact Simulation**: What-if analysis for business decisions
- **Export Functionality**: PDF reports and Excel data export

## πŸ“ Data Format

Your CSV file should include these columns:
- `id`: Unique identifier for each review
- `reviews_title`: Title of the review
- `review`: The actual review text
- `date`: Review date (YYYY-MM-DD format)
- `user_id`: Identifier for the reviewer

## πŸš€ Usage

### For End Users:
1. **Upload your CSV file** with review data (see format below)
2. **Process the data** using our advanced NLP pipeline (~2-3 minutes for 1000 reviews)
3. **Explore insights** through interactive visualizations
4. **Export results** as PDF reports or Excel files

### For Developers:
#### Docker Deployment (Hugging Face Spaces)
This app is optimized for Docker deployment with:
- `Dockerfile`: Production-ready container setup
- `requirements-docker.txt`: Optimized dependencies with version pinning
- `.dockerignore`: Efficient build context
- Health checks and proper port configuration (7860)

#### Local Development
```bash
git clone <your-repo>
cd insights
pip install -r requirements.txt
streamlit run app_enhanced.py
```

## πŸ› οΈ Technology Stack

- **Frontend**: Streamlit with interactive components
- **NLP**: pyABSA for aspect-based sentiment analysis
- **Translation**: Facebook M2M100 for multilingual support
- **Visualization**: Plotly for interactive charts and graphs
- **Network Analysis**: NetworkX for aspect relationship graphs

## πŸ“Š Sample Output

The dashboard provides:
- Comprehensive sentiment analysis
- Aspect extraction and sentiment mapping
- Intent classification with confidence scores
- Interactive network graphs of aspect relationships
- Time-series analysis of sentiment trends
- Exportable business intelligence reports

---

**Status**: βœ… **Production Ready** - Enterprise-level sentiment analysis with advanced NLP capabilities.