fix: complete models table with real param counts (9.33M/34.2M/25.86M/8.94M)
Browse files
README.md
CHANGED
|
@@ -156,19 +156,20 @@ results = yolo("path/to/print_image.png", conf=0.3)
|
|
| 156 |
|
| 157 |
| File | Description | AUROC | Params |
|
| 158 |
|---|---|---|---|
|
| 159 |
-
| `models/cdm_v3_yolo_bbox.pt` | **CDM λ=0.01,
|
| 160 |
-
| `models/cdm_v3_baseline.pt` | CDM λ=0
|
| 161 |
-
| `models/cdm_v3_test.pt` |
|
| 162 |
-
| `models/yolo_best.pt` | YOLOv8 feature detector | mAP@50=0.950 |
|
| 163 |
-
| `models/semantic_mismatch_angle_model.pt` | Per-feature: angle |
|
| 164 |
-
| `models/semantic_mismatch_dist1_model.pt` | Per-feature: dist1 |
|
| 165 |
-
| `models/semantic_mismatch_dots_model.pt` | Per-feature: dots |
|
| 166 |
|
| 167 |
### Which model should I use?
|
| 168 |
|
| 169 |
-
- **Quick evaluation:** `cdm_v3_yolo_bbox.pt` — the thesis "proposed CDM",
|
| 170 |
-
- **Reproducing the thesis CV result:**
|
| 171 |
-
- **YOLO feature detection only:** `yolo_best.pt`
|
|
|
|
| 172 |
|
| 173 |
> **Why does the baseline win on CV?** On this small dataset (~1330 samples), the 5-fold CV shows λ=0 and λ=0.01 are statistically indistinguishable (0.8673 vs 0.8628). The single-split evaluation clearly favors λ=0.01 (+2.8 pp AUROC, −19 pp FPR@95). The thesis reports the CV result as the primary finding since it is more reliable.
|
| 174 |
|
|
|
|
| 156 |
|
| 157 |
| File | Description | AUROC | Params |
|
| 158 |
|---|---|---|---|
|
| 159 |
+
| `models/cdm_v3_yolo_bbox.pt` | **CDM λ=0.01, base_ch=64 (proposed)** | 0.8603 single-split | **9.33 M** |
|
| 160 |
+
| `models/cdm_v3_baseline.pt` | CDM λ=0, base_ch=128 (thesis CV result) | 0.8673 ± 0.023 CV | **34.2 M** |
|
| 161 |
+
| `models/cdm_v3_test.pt` | CDM λ=0.01, base_ch=64 (dev checkpoint) | ≈0.85 | 9.33 M |
|
| 162 |
+
| `models/yolo_best.pt` | YOLOv8 feature detector (8 print features) | mAP@50=0.950 | **25.86 M** |
|
| 163 |
+
| `models/semantic_mismatch_angle_model.pt` | Per-feature CDM: angle (~9.67:1 imbalance) | ~0.82 | **8.94 M** |
|
| 164 |
+
| `models/semantic_mismatch_dist1_model.pt` | Per-feature CDM: dist1 | ~0.89 | **8.94 M** |
|
| 165 |
+
| `models/semantic_mismatch_dots_model.pt` | Per-feature CDM: dots (best feature) | ~0.96 | **8.94 M** |
|
| 166 |
|
| 167 |
### Which model should I use?
|
| 168 |
|
| 169 |
+
- **Quick evaluation:** `cdm_v3_yolo_bbox.pt` — the thesis "proposed CDM", λ=0.01, base_ch=64 (**9.33 M params**)
|
| 170 |
+
- **Reproducing the thesis CV result:** `cdm_v3_baseline.pt` — λ=0, base_ch=128 (**34.2 M params**); the 5-fold CV AUROC 0.8673 ± 0.023 comes from this wider model
|
| 171 |
+
- **YOLO feature detection only:** `yolo_best.pt` (25.86 M params, YOLOv8-based)
|
| 172 |
+
- **Per-feature analysis:** `semantic_mismatch_*.pt` — one model per feature, 8.94 M each, trained with a DiffGuard-style contrastive approach
|
| 173 |
|
| 174 |
> **Why does the baseline win on CV?** On this small dataset (~1330 samples), the 5-fold CV shows λ=0 and λ=0.01 are statistically indistinguishable (0.8673 vs 0.8628). The single-split evaluation clearly favors λ=0.01 (+2.8 pp AUROC, −19 pp FPR@95). The thesis reports the CV result as the primary finding since it is more reliable.
|
| 175 |
|