Iris314 commited on
Commit
3c4d2c6
·
verified ·
1 Parent(s): 853c181

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +57 -51
README.md CHANGED
@@ -1,59 +1,65 @@
1
- # Model Card for Lego Brick Classification (Classical AutoML)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
 
3
- ## Model Details
4
- - **Model type**: Classical AutoML ensemble (AutoGluon Tabular)
5
- - **Framework**: [AutoGluon](https://auto.gluon.ai/stable/index.html)
6
- - **Task**: Multiclass Classification
7
- - **Classes**:
8
- - Standard
9
- - Flat
10
- - Sloped
11
 
12
- The model was trained using AutoGluon Tabular with automated feature engineering, model selection, and hyperparameter tuning under a fixed budget.
 
13
 
14
- ## Intended Use
15
- - **Direct use**: Classify LEGO bricks into `Standard`, `Flat`, or `Sloped` types based on their dimensions.
16
- - **Educational use**: Demonstrate classical AutoML techniques on a small synthetic dataset.
17
- - **Not intended for**: Industrial-scale LEGO piece classification or manufacturing quality control.
 
 
 
 
 
 
18
 
19
  ## Dataset
20
- - **Dataset name**: [aedupuga/lego-sizes](https://huggingface.co/datasets/aedupuga/lego-sizes)
21
- - **Description**: Contains dimensions and stud count of LEGO bricks, with labels indicating piece type.
22
- - **Splits**:
23
- - Original: 30 measured samples
24
- - Augmented: 300 synthetically generated samples
25
- - **Features**:
26
- - Max Length (cm)
27
- - Max Height (cm)
28
- - Width (cm)
29
- - Studs
30
- - Type (Standard/Flat/Sloped)
31
-
32
- ## Training Procedure
33
- - **Train/Val/Test split**: 80/10/10 (stratified) on augmented dataset
34
- - **External validation**: Performed on original dataset
35
- - **AutoML settings**:
36
- - Preset: `best_quality`
37
- - Metric: Accuracy
38
- - Time budget: ~30 minutes
39
-
40
- ## Evaluation Results
41
- - **Augmented Test Set**:
42
- - Accuracy: 0.97
43
- - Weighted F1: 0.96
44
- - **Original External Validation Set**:
45
- - Accuracy: 0.93
46
- - Weighted F1: 0.92
47
 
48
- ## Limitations
49
- - The dataset is very small (only 30 original samples).
50
- - Synthetic augmentation may not capture full real-world variability.
51
- - Model may not generalize beyond this toy dataset.
52
 
53
- ## Recommendations
54
- - Use primarily for **demonstration and coursework**.
55
- - Do not deploy in real-world LEGO production environments.
56
 
57
- ## Model Card Contact
58
- Author: *Xinxuan Tang (Carnegie Mellon University)* – xinxuant@andrew.cmu.edu
59
- Dataset curator: Anuhya Edupuganti (CMU) aedupuga@andrew.cmu.edu
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: mit
5
+ tags:
6
+ - automl
7
+ - tabular
8
+ - classification
9
+ - sklearn
10
+ datasets:
11
+ - aedupuga/lego-sizes
12
+ metrics:
13
+ - accuracy
14
+ - f1
15
+ model-index:
16
+ - name: Lego Brick Classification (Classical AutoML)
17
+ results:
18
+ - task:
19
+ type: classification
20
+ name: Tabular Classification
21
+ dataset:
22
+ name: aedupuga/lego-sizes
23
+ type: classification
24
+ split: augmented
25
+ metrics:
26
+ - type: accuracy
27
+ value: 0.97
28
+ - type: f1
29
+ value: 0.96
30
+ ---
31
 
32
+ # Lego Brick Classification (Classical AutoML)
 
 
 
 
 
 
 
33
 
34
+ This model was trained using **AutoGluon Tabular (Classical AutoML)** on the dataset [aedupuga/lego-sizes](https://huggingface.co/datasets/aedupuga/lego-sizes).
35
+ The task is to classify LEGO bricks into three categories: **Standard, Flat, Sloped**, given their measured dimensions (length, height, width, studs).
36
 
37
+ ## Model Details
38
+ - **Framework**: AutoGluon (TabularPredictor)
39
+ - **Algorithms searched**: Random Forest, Gradient Boosting Trees, XGBoost, LightGBM, CatBoost
40
+ - **Best model**: LightGBM (selected by AutoML)
41
+ - **Training data**: 300 augmented + 30 original samples
42
+ - **Evaluation metric**: Accuracy, Weighted F1
43
+
44
+ ## Results
45
+ - Accuracy: **0.97**
46
+ - Weighted F1: **0.96**
47
 
48
  ## Dataset
49
+ - Name: [aedupuga/lego-sizes](https://huggingface.co/datasets/aedupuga/lego-sizes)
50
+ - Original: 30 manually measured LEGO bricks
51
+ - Augmented: 300 synthetically generated samples (jittered dimensions)
52
+ - Features: `Length`, `Height`, `Width`, `Studs`
53
+ - Target: `Type (Standard / Flat / Sloped)`
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
54
 
55
+ ## Intended Use
56
+ This model is intended for **educational practice** in AutoML and tabular classification.
57
+ It is **not suitable for industrial use**, given the small sample size and synthetic data.
 
58
 
59
+ ## Limitations
60
+ - Very small real dataset (30 samples)
61
+ - Synthetic augmentation may not capture all real-world variations
62
 
63
+ ## Contact
64
+ Dataset by: Anuhya Edupuganti (CMU)
65
+ Model by: Xinxuan Tang (CMU) - xinxuant@andrew.cmu.edu