File size: 1,904 Bytes
ef695f6
24e61fb
 
ef695f6
 
 
 
 
 
0e4e8f5
ef695f6
 
da04448
 
 
 
 
 
 
 
 
 
ef695f6
 
da04448
 
 
 
 
 
 
ef695f6
 
 
da04448
 
 
 
ef695f6
 
da04448
 
ef695f6
da04448
 
 
 
 
 
 
ef695f6
 
 
 
 
 
da04448
 
ef695f6
da04448
 
ef695f6
 
da04448
ef695f6
 
da04448
 
 
 
 
 
 
 
 
 
 
d3d019c
da04448
 
 
ef695f6
 
 
da04448
 
 
ef695f6
 
 
 
 
 
da04448
d3d019c
 
da04448
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
---
title: Group08 UrbanMobilityApp
emoji: πŸš—
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: "4.31.0"
app_file: app.py
pinned: false
python_version: "3.10"
---

# πŸš€ Urban Mobility Pricing & Satisfaction App

## πŸ“Š Project Overview
This project analyzes a European urban mobility startup operating in:
- Paris
- Berlin
- Madrid
- Warsaw
- Turin

🎯 Goal:
Optimize pricing strategy and user satisfaction using:
- Ride data (quantitative)
- App reviews (qualitative)

---

## 🧠 Pipeline

### πŸ”Ή Notebook 1 – Data Processing
- Synthetic ride dataset (2,000 rides)
- Review dataset (500 reviews)
- Data cleaning & preprocessing
- VADER sentiment analysis
- Output: `merged_summary.csv`

### πŸ”Ή Notebook 2 – Predictive Modelling
- Random Forest β†’ predict user satisfaction
- Feature importance β†’ price is key driver
- ARIMA β†’ revenue forecasting
- Outputs:
  - `rf_model.pkl`
  - encoder files

---

## πŸ’» Hugging Face App

### πŸ“Š Dashboard
- KPI overview:
  - Average price
  - Rating
  - Sentiment
  - Cancellation rate
- Interactive charts by city and vehicle type

### πŸ€– Prediction
- Input ride parameters
- Output:
  - Satisfaction probability
  - Predicted label (High / Low)

### πŸ’‘ Recommendation
- Pricing recommendation based on segment

---

## βš™οΈ Technologies Used
- Python (pandas, numpy)
- scikit-learn (Random Forest)
- statsmodels (ARIMA)
- VADER Sentiment Analysis
- Gradio (UI)
- Hugging Face Spaces

---

## πŸ“ˆ Key Insights
- Final price is the strongest driver of satisfaction
- E-scooters β†’ highest usage but lower sentiment
- Discounts β†’ improve ratings
- Revenue β†’ stabilizing in mature markets

---

## πŸ“¦ Files
- `app.py` β†’ application
- `merged_summary.csv` β†’ data
- `rf_model.pkl` β†’ ML model
- encoders β†’ feature transformation
- notebooks β†’ full pipeline

---

## πŸ‘₯ Team
Group 08 – AI for Big Data Management