File size: 2,102 Bytes
66de98a
 
 
 
 
 
 
 
 
 
 
 
 
 
734956f
 
9477bfa
 
 
734956f
 
1962a7b
9477bfa
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
734956f
 
 
 
 
 
 
9477bfa
734956f
95aab7f
734956f
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
---
library_name: autogluon
pipeline_tag: tabular-classification
license: mit
tags:
  - automl
  - tabular
  - autogluon
  - sklearn
model_name: Football Elite Classifier  AutoML (AutoGluon Tabular)
language:
  - en
---

# Football Elite Classifier — AutoML (AutoGluon Tabular)

## Purpose
This model was developed as part of a class assignment on designing and deploying AI/ML systems.  
It demonstrates the use of AutoML (AutoGluon Tabular) to build a binary classifier on football receiver stats.

## Dataset
- **Source:** https://huggingface.co/datasets/james-kramer/receiverstats
- **Split:** Stratified Train/Test = 80/20 on the **original** split.  
- **Features:** ['Tgt', 'Rec', 'Yds', 'YBC_per_R', 'YAC_per_R', 'ADOT', 'Drop_pct', 'Rat']  
- **Target:** `Elite` (0/1)  
- **Preprocessing:** Identifier columns dropped (e.g., `Player`). Numeric coercion applied; rows with NA removed.

## Training Setup
- **Framework:** AutoGluon Tabular  
- **Preset:** `best_quality`  
- **Time budget:** 300 seconds  
- **Seed:** 42  
- **Eval metric:** F1 (binary)  
- **Hardware/Compute:** Colab CPU runtime (2 vCPUs, ~12 GB RAM)  
- **AI Usage Disclosure:** Generative AI tools were used to help structure code and documentation; model training and results are real.

## Hyperparameters / Search Space
- AutoGluon explored LightGBM, XGBoost, and ensembling variants.  
- Random state set for reproducibility.  
- Auto-stacking and bagging enabled under `best_quality`.  
- Internal hyperparameter tuning handled automatically by AutoGluon.

## Results (Held-out Test)
```json
{
  "accuracy": 0.8333333333333334,
  "f1": 0.8
}
```


## Limitations & Ethics
- Correlations do not imply causation; labels may reflect selection bias.
- Out-of-distribution players/contexts may reduce performance.
- Intended for coursework, not for real personnel decisions.

## License
- Code & weights: <MIT/Apache-2.0 or course-required license>

## Acknowledgments
AutoML with [AutoGluon Tabular].
Trained in Google Colab.
GenAI tools assisted with boilerplate and doc structure.
James Kramers hugging face dataset