File size: 1,497 Bytes
3c2a4f8
 
 
 
 
 
 
9f70330
 
1acc1b1
 
3c2a4f8
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---
license: mit
---

# ๐Ÿ€ NBA Player Performance Predictor (Assignment #2)

## ๐ŸŽฅ Project Presentation
[## Click here to Watch the Video ](https://youtu.be/MlvFNjFf5ZI)




---

## ๐Ÿ“Œ Overview
This project analyzes NBA player data (1950-2017) to predict player performance based solely on physical attributes (Height, Weight, Age).
* **Goal 1 (Regression):** Predict exact points scored per season.
* **Goal 2 (Classification):** Classify players as "High Scorers" vs. "Low Scorers."

## ๐Ÿ”ง Feature Engineering
We engineered several features to improve model performance:
* **Clustering:** Used K-Means to identify 5 distinct body types (e.g., "Small Guard" vs. "Heavy Center").
* **Scaling:** Standardized Height and Weight to compare players fairly.
* **Encoding:** Converted player positions into numeric data.

## ๐Ÿ† Model Results

### Part 1: Regression (Predicting Points)
* **Winner:** Gradient Boosting Regressor
* **Rยฒ Score:** 0.075
* **Insight:** Physical attributes alone are weak predictors of exact scoring numbers.

### Part 2: Classification (High vs. Low Scorer)
* **Winner:** Support Vector Machine (SVM)
* **Recall:** 64%
* **Insight:** The SVM model was the best at identifying "Hidden Gems" (High Recall), minimizing the chance of missing out on talent.

## ๐Ÿ“‚ Files Included
* `winning_model.pkl`: The trained Gradient Boosting Regressor.
* `winning_classifier.pkl`: The trained SVM Classifier.
* `My_Notebook.ipynb`: The complete Python code for this analysis.