Tian Wang commited on Feb 12

Commit

9843ba6

1 Parent(s): dadb116

Add detector and classifier models with training artifacts

Files changed (16) hide show

.gitattributes +2 -0
README.md +78 -0
classifier/classifier_best.pt +3 -0
classifier/training_results.json +13 -0
detector/args.yaml +109 -0
detector/plots/BoxF1_curve.png +3 -0
detector/plots/BoxPR_curve.png +3 -0
detector/plots/BoxP_curve.png +3 -0
detector/plots/BoxR_curve.png +3 -0
detector/plots/confusion_matrix.png +3 -0
detector/plots/confusion_matrix_normalized.png +3 -0
detector/plots/results.png +3 -0
detector/results.csv +11 -0
detector/weights/best.onnx +3 -0
detector/weights/best.pt +3 -0
example-detection.png +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+*.png filter=lfs diff=lfs merge=lfs -text
+*.jpg filter=lfs diff=lfs merge=lfs -text

README.md ADDED Viewed

	@@ -0,0 +1,78 @@

+---
+license: mit
+tags:
+  - object-detection
+  - image-classification
+  - yolo
+  - set-game
+  - card-game
+  - computer-vision
+---
+# Set Solver Models
+Trained models for the [Set card game](https://www.setgame.com/) solver.
+![Example detection](./example-detection.png)
+**Live demo**: [huggingface.co/spaces/wangtianthu/set-solver](https://huggingface.co/spaces/wangtianthu/set-solver)
+## Models
+### Detector — YOLOv11n
+Detects individual Set cards on a board image.
+| Metric | Value |
+|--------|-------|
+| mAP50 | 99.5% |
+| mAP50-95 | 97.4% |
+| Architecture | YOLOv11n |
+| Input size | 640x640 |
+| Epochs | 10 |
+| Training data | 4000 synthetic board images |
+**Files**: `detector/weights/best.pt` (PyTorch), `detector/weights/best.onnx` (ONNX)
+### Classifier — MobileNetV3
+Classifies each card's 4 attributes: shape, color, number, and fill.
+| Metric | Value |
+|--------|-------|
+| Overall accuracy | 99.9% |
+| Number accuracy | 100% |
+| Color accuracy | 100% |
+| Shape accuracy | 99.9% |
+| Fill accuracy | 99.8% |
+| Architecture | MobileNetV3-Small |
+| Input size | 224x224 |
+| Training data | ~9500 cropped card images (81 classes) |
+**File**: `classifier/classifier_best.pt`
+## Usage
+```python
+from ultralytics import YOLO
+from PIL import Image
+# Load detector
+detector = YOLO("detector/weights/best.pt")
+results = detector("board_photo.jpg", conf=0.25)
+# Load classifier
+import torch
+from src.train.classifier import SetCardClassifier
+classifier = SetCardClassifier(pretrained=False)
+checkpoint = torch.load("classifier/classifier_best.pt", map_location="cpu")
+classifier.load_state_dict(checkpoint["model_state_dict"])
+classifier.eval()
+```
+## Training
+Both models were trained on synthetic data generated by a custom board generator that produces realistic Set game layouts with varied backgrounds, perspective transforms, and noise objects.
+Source code: [github.com/wangtian24/set-solver](https://github.com/wangtian24/set-solver)

classifier/classifier_best.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a0c464367eccfcfd6599377c9af35f72cd23c524b01eda7e9a11ccb1e3ba3f6d
+size 11465795

classifier/training_results.json ADDED Viewed

	@@ -0,0 +1,13 @@

+{
+  "test_loss": 0.009692762911799945,
+  "test_accuracy": {
+    "number": 1.0,
+    "color": 1.0,
+    "shape": 0.9992082343626286,
+    "fill": 0.997624703087886
+  },
+  "avg_test_accuracy": 0.9992082343626286,
+  "train_size": 9479,
+  "val_size": 1895,
+  "test_size": 1263
+}

detector/args.yaml ADDED Viewed

	@@ -0,0 +1,109 @@

+task: detect
+mode: train
+model: yolo11n.pt
+data: /Users/wangtian/workspace/set-solver/data/synthetic/dataset.yaml
+epochs: 10
+time: null
+patience: 20
+batch: 16
+imgsz: 640
+save: true
+save_period: -1
+cache: false
+device: mps
+workers: 8
+project: /Users/wangtian/workspace/set-solver/weights
+name: detector
+exist_ok: true
+pretrained: true
+optimizer: auto
+verbose: true
+seed: 0
+deterministic: true
+single_cls: false
+rect: false
+cos_lr: false
+close_mosaic: 10
+resume: false
+amp: true
+fraction: 1.0
+profile: false
+freeze: null
+multi_scale: 0.0
+compile: false
+overlap_mask: true
+mask_ratio: 4
+dropout: 0.0
+val: true
+split: val
+save_json: false
+conf: null
+iou: 0.7
+max_det: 300
+half: false
+dnn: false
+plots: true
+end2end: null
+source: null
+vid_stride: 1
+stream_buffer: false
+visualize: false
+augment: false
+agnostic_nms: false
+classes: null
+retina_masks: false
+embed: null
+show: false
+save_frames: false
+save_txt: false
+save_conf: false
+save_crop: false
+show_labels: true
+show_conf: true
+show_boxes: true
+line_width: null
+format: torchscript
+keras: false
+optimize: false
+int8: false
+dynamic: false
+simplify: true
+opset: null
+workspace: null
+nms: false
+lr0: 0.01
+lrf: 0.01
+momentum: 0.937
+weight_decay: 0.0005
+warmup_epochs: 3.0
+warmup_momentum: 0.8
+warmup_bias_lr: 0.1
+box: 7.5
+cls: 0.5
+dfl: 1.5
+pose: 12.0
+kobj: 1.0
+rle: 1.0
+angle: 1.0
+nbs: 64
+hsv_h: 0.015
+hsv_s: 0.7
+hsv_v: 0.4
+degrees: 0.0
+translate: 0.1
+scale: 0.5
+shear: 0.0
+perspective: 0.0
+flipud: 0.0
+fliplr: 0.5
+bgr: 0.0
+mosaic: 1.0
+mixup: 0.0
+cutmix: 0.0
+copy_paste: 0.0
+copy_paste_mode: flip
+auto_augment: randaugment
+erasing: 0.4
+cfg: null
+tracker: botsort.yaml
+save_dir: /Users/wangtian/workspace/set-solver/weights/detector

detector/plots/BoxF1_curve.png ADDED Viewed

Git LFS Details

SHA256: 22ac76b663bfbc5c70c62254d0838e9551680613f046c10d9a3905712caa78a3
Pointer size: 130 Bytes
Size of remote file: 86.4 kB

detector/plots/BoxPR_curve.png ADDED Viewed

Git LFS Details

SHA256: df0dc0b651b5b88c715752280eaead0c02aad42774f657354029dd78d305161e
Pointer size: 130 Bytes
Size of remote file: 68.3 kB

detector/plots/BoxP_curve.png ADDED Viewed

Git LFS Details

SHA256: 7f3592367666105d034a2ce12f0ea69a7d1d3e8294fd4b322a2f58eabc30fef8
Pointer size: 130 Bytes
Size of remote file: 76.3 kB

detector/plots/BoxR_curve.png ADDED Viewed

Git LFS Details

SHA256: 4f631fabe01c94ae37e1f06bce7c88ce9b36f640e9d62a5123cd0bc56abee686
Pointer size: 130 Bytes
Size of remote file: 80.1 kB

detector/plots/confusion_matrix.png ADDED Viewed

Git LFS Details

SHA256: 8b72dfb16b3589e5f98b21b849bbe6eb7afda4d01aa4ef7dde0f2e661cee2a22
Pointer size: 130 Bytes
Size of remote file: 90.9 kB

detector/plots/confusion_matrix_normalized.png ADDED Viewed

Git LFS Details

SHA256: fb754284bd2eb4bbbaa38f906767e6f5c5bf17d40f4bb796162b9c0b982cb5b0
Pointer size: 130 Bytes
Size of remote file: 85 kB

detector/plots/results.png ADDED Viewed

Git LFS Details

SHA256: 5eaba2b057aa7791c33b23a93e9d82630c4dfee06eff0ee8e81154e482e6b058
Pointer size: 131 Bytes
Size of remote file: 273 kB

detector/results.csv ADDED Viewed

	@@ -0,0 +1,11 @@

+epoch,time,train/box_loss,train/cls_loss,train/dfl_loss,metrics/precision(B),metrics/recall(B),metrics/mAP50(B),metrics/mAP50-95(B),val/box_loss,val/cls_loss,val/dfl_loss,lr/pg0,lr/pg1,lr/pg2
+1,111.468,0.40144,1.19864,0.84562,0.99981,0.99984,0.995,0.9085,0.40196,1.20815,0.82106,0.000656085,0.000656085,0.000656085
+2,206.25,0.34115,0.4089,0.82012,1,1,0.995,0.95523,0.29123,0.4463,0.79571,0.0011918,0.0011918,0.0011918
+3,304.901,0.32374,0.34916,0.8127,0.99999,1,0.995,0.93135,0.35031,0.37025,0.79607,0.00159551,0.00159551,0.00159551
+4,395.233,0.31295,0.32333,0.81186,0.99997,1,0.995,0.96262,0.26761,0.2805,0.79735,0.001406,0.001406,0.001406
+5,492.083,0.26809,0.29046,0.80266,0.99989,1,0.995,0.97475,0.24272,0.25697,0.7803,0.001208,0.001208,0.001208
+6,606.75,0.23878,0.24702,0.79441,0.99999,1,0.995,0.98544,0.21188,0.22382,0.77178,0.00101,0.00101,0.00101
+7,741.542,0.22471,0.22465,0.79115,0.99999,0.99513,0.995,0.97671,0.21286,0.2033,0.77386,0.000812,0.000812,0.000812
+8,873.213,0.2143,0.21188,0.78911,0.99999,0.98882,0.98531,0.97465,0.17893,0.19046,0.76739,0.000614,0.000614,0.000614
+9,1041.5,0.19247,0.19322,0.78823,0.99999,0.97096,0.97537,0.9692,0.14963,0.16733,0.76335,0.000416,0.000416,0.000416
+10,1205.28,0.17649,0.17659,0.78384,0.99999,0.99914,0.995,0.99123,0.13999,0.14978,0.76308,0.000218,0.000218,0.000218

detector/weights/best.onnx ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ccd020c044339d0cd2e671a837f209a8adcdcef17982cbae90da2b962e160081
+size 10477995

detector/weights/best.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d65deae13124271df8739b700d2f893bca1eb7a7bc8ac870702e714b787ceee7
+size 5453594

example-detection.png ADDED Viewed

Git LFS Details

SHA256: c6bd78d069daceb3c9d0990163d75bdd0942a08c841196aa3bf8cf605652cdcd
Pointer size: 132 Bytes
Size of remote file: 2.8 MB