BentoUniAcc commited on
Commit
d9c3f9a
·
verified ·
1 Parent(s): a73cdfa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -27,6 +27,12 @@ base_model: None
27
 
28
  ---
29
 
 
 
 
 
 
 
30
  ## Overview
31
 
32
  This project uses the **Stack Overflow Annual Developer Survey 2025** (49,123 responses, 170 features) to predict a software developer's annual salary. The pipeline covers end-to-end data science: exploratory analysis, feature engineering, unsupervised clustering, regression, and multi-class classification.
@@ -198,7 +204,7 @@ Silhouette scores are low across all values of k, confirming that natural cluste
198
  | Algorithm | k / params | Silhouette |
199
  |-----------|-----------|------------|
200
  | KMeans | k=4 | 0.0109 |
201
- | DBSCAN | eps=5, min_samples=25 | N/A (1 cluster) |
202
  | Agglomerative Ward | k=4 | **0.0224** |
203
 
204
  The data's high dimensionality (249 features) makes density-based clustering (DBSCAN) impractical — inter-point distances are too large for meaningful core-point detection.
 
27
 
28
  ---
29
 
30
+ ## Video
31
+
32
+ # <video src="https://youtu.be/3QKJx76NT64" controls="controls" style="max-width: 720px;"></video>
33
+
34
+ ---
35
+
36
  ## Overview
37
 
38
  This project uses the **Stack Overflow Annual Developer Survey 2025** (49,123 responses, 170 features) to predict a software developer's annual salary. The pipeline covers end-to-end data science: exploratory analysis, feature engineering, unsupervised clustering, regression, and multi-class classification.
 
204
  | Algorithm | k / params | Silhouette |
205
  |-----------|-----------|------------|
206
  | KMeans | k=4 | 0.0109 |
207
+ | DBSCAN | eps=5, min_samples=25 | 0.0912 (7 clusters) |
208
  | Agglomerative Ward | k=4 | **0.0224** |
209
 
210
  The data's high dimensionality (249 features) makes density-based clustering (DBSCAN) impractical — inter-point distances are too large for meaningful core-point detection.