ahmed-3m commited on
Commit
378f0ad
·
verified ·
1 Parent(s): b7de8f4

fix: complete models table with real param counts (9.33M/34.2M/25.86M/8.94M)

Browse files
Files changed (1) hide show
  1. README.md +11 -10
README.md CHANGED
@@ -156,19 +156,20 @@ results = yolo("path/to/print_image.png", conf=0.3)
156
 
157
  | File | Description | AUROC | Params |
158
  |---|---|---|---|
159
- | `models/cdm_v3_yolo_bbox.pt` | **CDM λ=0.01, YOLO bbox** (thesis proposed) | 0.8603 single-split | 9.3 M |
160
- | `models/cdm_v3_baseline.pt` | CDM λ=0 baseline | 0.8673 ± 0.023 CV | 9.3 M |
161
- | `models/cdm_v3_test.pt` | Development checkpoint | | 9.3 M |
162
- | `models/yolo_best.pt` | YOLOv8 feature detector | mAP@50=0.950 | |
163
- | `models/semantic_mismatch_angle_model.pt` | Per-feature: angle | | |
164
- | `models/semantic_mismatch_dist1_model.pt` | Per-feature: dist1 | | |
165
- | `models/semantic_mismatch_dots_model.pt` | Per-feature: dots | | |
166
 
167
  ### Which model should I use?
168
 
169
- - **Quick evaluation:** `cdm_v3_yolo_bbox.pt` — the thesis "proposed CDM", trained with separation loss λ=0.01
170
- - **Reproducing the thesis CV result:** Use `cdm_v3_baseline.pt` (λ=0) the 5-fold CV AUROC 0.8673 ± 0.023 comes from the baseline
171
- - **YOLO feature detection only:** `yolo_best.pt`
 
172
 
173
  > **Why does the baseline win on CV?** On this small dataset (~1330 samples), the 5-fold CV shows λ=0 and λ=0.01 are statistically indistinguishable (0.8673 vs 0.8628). The single-split evaluation clearly favors λ=0.01 (+2.8 pp AUROC, −19 pp FPR@95). The thesis reports the CV result as the primary finding since it is more reliable.
174
 
 
156
 
157
  | File | Description | AUROC | Params |
158
  |---|---|---|---|
159
+ | `models/cdm_v3_yolo_bbox.pt` | **CDM λ=0.01, base_ch=64 (proposed)** | 0.8603 single-split | **9.33 M** |
160
+ | `models/cdm_v3_baseline.pt` | CDM λ=0, base_ch=128 (thesis CV result) | 0.8673 ± 0.023 CV | **34.2 M** |
161
+ | `models/cdm_v3_test.pt` | CDM λ=0.01, base_ch=64 (dev checkpoint) | ≈0.85 | 9.33 M |
162
+ | `models/yolo_best.pt` | YOLOv8 feature detector (8 print features) | mAP@50=0.950 | **25.86 M** |
163
+ | `models/semantic_mismatch_angle_model.pt` | Per-feature CDM: angle (~9.67:1 imbalance) | ~0.82 | **8.94 M** |
164
+ | `models/semantic_mismatch_dist1_model.pt` | Per-feature CDM: dist1 | ~0.89 | **8.94 M** |
165
+ | `models/semantic_mismatch_dots_model.pt` | Per-feature CDM: dots (best feature) | ~0.96 | **8.94 M** |
166
 
167
  ### Which model should I use?
168
 
169
+ - **Quick evaluation:** `cdm_v3_yolo_bbox.pt` — the thesis "proposed CDM", λ=0.01, base_ch=64 (**9.33 M params**)
170
+ - **Reproducing the thesis CV result:** `cdm_v3_baseline.pt` λ=0, base_ch=128 (**34.2 M params**); the 5-fold CV AUROC 0.8673 ± 0.023 comes from this wider model
171
+ - **YOLO feature detection only:** `yolo_best.pt` (25.86 M params, YOLOv8-based)
172
+ - **Per-feature analysis:** `semantic_mismatch_*.pt` — one model per feature, 8.94 M each, trained with a DiffGuard-style contrastive approach
173
 
174
  > **Why does the baseline win on CV?** On this small dataset (~1330 samples), the 5-fold CV shows λ=0 and λ=0.01 are statistically indistinguishable (0.8673 vs 0.8628). The single-split evaluation clearly favors λ=0.01 (+2.8 pp AUROC, −19 pp FPR@95). The thesis reports the CV result as the primary finding since it is more reliable.
175