Iris314 commited on
Commit
c5205a6
·
verified ·
1 Parent(s): 3c4d2c6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +56 -31
README.md CHANGED
@@ -29,37 +29,62 @@ model-index:
29
  value: 0.96
30
  ---
31
 
32
- # Lego Brick Classification (Classical AutoML)
33
 
34
- This model was trained using **AutoGluon Tabular (Classical AutoML)** on the dataset [aedupuga/lego-sizes](https://huggingface.co/datasets/aedupuga/lego-sizes).
35
- The task is to classify LEGO bricks into three categories: **Standard, Flat, Sloped**, given their measured dimensions (length, height, width, studs).
 
 
36
 
37
  ## Model Details
38
- - **Framework**: AutoGluon (TabularPredictor)
39
- - **Algorithms searched**: Random Forest, Gradient Boosting Trees, XGBoost, LightGBM, CatBoost
40
- - **Best model**: LightGBM (selected by AutoML)
41
- - **Training data**: 300 augmented + 30 original samples
42
- - **Evaluation metric**: Accuracy, Weighted F1
43
-
44
- ## Results
45
- - Accuracy: **0.97**
46
- - Weighted F1: **0.96**
47
-
48
- ## Dataset
49
- - Name: [aedupuga/lego-sizes](https://huggingface.co/datasets/aedupuga/lego-sizes)
50
- - Original: 30 manually measured LEGO bricks
51
- - Augmented: 300 synthetically generated samples (jittered dimensions)
52
- - Features: `Length`, `Height`, `Width`, `Studs`
53
- - Target: `Type (Standard / Flat / Sloped)`
54
-
55
- ## Intended Use
56
- This model is intended for **educational practice** in AutoML and tabular classification.
57
- It is **not suitable for industrial use**, given the small sample size and synthetic data.
58
-
59
- ## Limitations
60
- - Very small real dataset (30 samples)
61
- - Synthetic augmentation may not capture all real-world variations
62
-
63
- ## Contact
64
- Dataset by: Anuhya Edupuganti (CMU)
65
- Model by: Xinxuan Tang (CMU) - xinxuant@andrew.cmu.edu
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
29
  value: 0.96
30
  ---
31
 
32
+ # Model Card for Lego Brick Classification (Classical AutoML)
33
 
34
+ This model classifies LEGO pieces into three types — **Standard**, **Flat**, and **Sloped** using their dimensions (Length, Height, Width, Studs).
35
+ It was trained with **AutoGluon Tabular AutoML**, selecting the best-performing algorithm (LightGBM).
36
+
37
+ ---
38
 
39
  ## Model Details
40
+
41
+ ### Model Description
42
+ - **Developed by:** Xinxuan Tang (CMU)
43
+ - **Dataset curated by:** Anuhya Edupuganti (CMU)
44
+ - **Model type:** AutoML ensemble (best model = LightGBM)
45
+ - **Language(s):** N/A (tabular data)
46
+ - **License:** MIT
47
+ - **Finetuned from:** Not applicable
48
+
49
+ ### Model Sources
50
+ - **Repository:** [Hugging Face Model Repo](https://huggingface.co/)
51
+ - **Dataset:** [aedupuga/lego-sizes](https://huggingface.co/datasets/aedupuga/lego-sizes)
52
+
53
+ ---
54
+
55
+ ## Uses
56
+
57
+ ### Direct Use
58
+ - Educational practice in **tabular classification**.
59
+ - Experimenting with AutoML search and hyperparameter tuning.
60
+
61
+ ### Downstream Use
62
+ - Could be used as a **teaching example** for AutoML pipelines on small tabular datasets.
63
+
64
+ ### Out-of-Scope Use
65
+ - **Not suitable for industrial LEGO quality control**, since dataset is synthetic and small.
66
+
67
+ ---
68
+
69
+ ## Bias, Risks, and Limitations
70
+
71
+ - **Small dataset**: only 30 original bricks, augmented to 300 synthetic samples.
72
+ - **Synthetic data bias**: jitter augmentation may not reflect real-world LEGO variations.
73
+
74
+ ### Recommendations
75
+ Users should treat results as **proof-of-concept** and not deploy in production.
76
+
77
+ ---
78
+
79
+ ## How to Get Started with the Model
80
+
81
+ ```python
82
+ from autogluon.tabular import TabularPredictor
83
+ import pandas as pd
84
+
85
+ # Load trained predictor
86
+ predictor = TabularPredictor.load("autogluon_model/")
87
+
88
+ # Run inference
89
+ test_data = pd.DataFrame([{"Length": 4, "Height": 1.2, "Width": 2, "Studs": 4}])
90
+ print(predictor.predict(test_data))