matteobardelle's picture
Update README.md
0e4e8f5 verified
---
title: Group08 UrbanMobilityApp
emoji: πŸš—
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: "4.31.0"
app_file: app.py
pinned: false
python_version: "3.10"
---
# πŸš€ Urban Mobility Pricing & Satisfaction App
## πŸ“Š Project Overview
This project analyzes a European urban mobility startup operating in:
- Paris
- Berlin
- Madrid
- Warsaw
- Turin
🎯 Goal:
Optimize pricing strategy and user satisfaction using:
- Ride data (quantitative)
- App reviews (qualitative)
---
## 🧠 Pipeline
### πŸ”Ή Notebook 1 – Data Processing
- Synthetic ride dataset (2,000 rides)
- Review dataset (500 reviews)
- Data cleaning & preprocessing
- VADER sentiment analysis
- Output: `merged_summary.csv`
### πŸ”Ή Notebook 2 – Predictive Modelling
- Random Forest β†’ predict user satisfaction
- Feature importance β†’ price is key driver
- ARIMA β†’ revenue forecasting
- Outputs:
- `rf_model.pkl`
- encoder files
---
## πŸ’» Hugging Face App
### πŸ“Š Dashboard
- KPI overview:
- Average price
- Rating
- Sentiment
- Cancellation rate
- Interactive charts by city and vehicle type
### πŸ€– Prediction
- Input ride parameters
- Output:
- Satisfaction probability
- Predicted label (High / Low)
### πŸ’‘ Recommendation
- Pricing recommendation based on segment
---
## βš™οΈ Technologies Used
- Python (pandas, numpy)
- scikit-learn (Random Forest)
- statsmodels (ARIMA)
- VADER Sentiment Analysis
- Gradio (UI)
- Hugging Face Spaces
---
## πŸ“ˆ Key Insights
- Final price is the strongest driver of satisfaction
- E-scooters β†’ highest usage but lower sentiment
- Discounts β†’ improve ratings
- Revenue β†’ stabilizing in mature markets
---
## πŸ“¦ Files
- `app.py` β†’ application
- `merged_summary.csv` β†’ data
- `rf_model.pkl` β†’ ML model
- encoders β†’ feature transformation
- notebooks β†’ full pipeline
---
## πŸ‘₯ Team
Group 08 – AI for Big Data Management