Chucks90
/

covtoken

medical-imaging

self-supervised-learning

vision-transformer

negative-results

Model card Files Files and versions

covtoken / README.md

Chucks90's picture

add model-card frontmatter

dab03bb verified 14 days ago

|

History Blame Contribute Delete

3.18 kB

	---
	license: mit
	tags:
	- medical-imaging
	- token-pruning
	- self-supervised-learning
	- label-free
	- vision-transformer
	- negative-results
	library_name: pytorch
	---

	# covtoken — Label-Free Lesion-Subspace Token Economy for Medical Imaging

	Code, gated evaluation, and a working paper draft for a label-free token-pruning method on
	frozen self-supervised medical vision transformers.

	> Reframed contribution (see `gate_reports/SUMMARY.md`). The load-bearing result is a
	> label-free lesion subspace — a mid-layer geometry that localizes lesions WITHOUT labels
	> across anatomy, modality (CT + ultrasound), and backbone (MedDINOv3 + DINOv2) — together with
	> membership pruning (beats saliency pruning on small-lesion miss-rate), a **conformal
	> retention certificate, and lesion-routed depth** (1.6× FLOPs). The original
	> coverage-constrained optimization with an interpretable dual is reported as a **clean
	> negative result** with a transferable mechanism (rank-based coverage rewards diverse spanning,
	> rare pathology needs concentration). See `gate_reports/NEGATIVE_RESULT.md`.

	## Headline numbers

	\| \| \|
	\|---\|---\|
	\| Mid-layer finding \| lesion AUROC final-layer 0.565 → block-3 0.871 \|
	\| Cross-modality \| density-A: lung CT 0.87, kidney 0.82, breast US 0.73 (DINOv2; attention 0.49) \|
	\| Membership > saliency pruning \| LIDC +27.6/+15.8, KiTS23 +7.4, BUSI +13.8/+19.0 pts \|
	\| Conformal retention cert. \| empirical coverage 0.978 ≥ nominal 0.90 \|
	\| Lesion-routed depth \| 1.6× FLOPs at 98% small-lesion sensitivity \|
	\| Negative result \| coverage floor 0.22 vs membership 0.82 (small-lesion recall) \|

	## Layout

	```
	subspace/ label-free lesion subspace: density (A) + residual (B) constructions
	coverage/ rankme / coding-rate / energy functionals (the FALSIFIED coverage objective)
	gate/ constrained pruner (Gumbel mask + dual) + per-image certificate [negative result]
	arch/ conformal_head, routed_depth, volumetric (Phase-6 components)
	backbone/ frozen MedDINOv3 ViT-B/16 loader (DINOv2 used for ultrasound, in jobs/)
	data/ CT token-bank builder, eval-only mask loading + label-leak guard
	eval/ DeLong / paired-bootstrap / Spearman stats; gate runners
	jobs/ all experiments as Hugging Face Jobs (materialize, banks, gates, ablations)
	gate_reports/ machine-readable per-gate decision records + SUMMARY + NEGATIVE_RESULT
	configs/ thresholds.lock.json (Phase-1b calibrated, locked)
	paper/ working_draft.md + figures/ + venue_notes.md
	tests/ CI label-leak test (no label may touch subspace construction)
	```

	## Reproducibility

	All experiments ran as Hugging Face Jobs. Backbones: `ricklisz123/MedDINOv3-ViTB-16-CT-3M` (CT),
	`facebook/dinov2-base` (ultrasound). Datasets: LIDC-IDRI, KiTS23, MSD Task03 Liver, MSD Task07
	Pancreas, BUSI. Masks are evaluation-only; `tests/test_label_leak.py` fails the build if a label
	reaches subspace construction. The frozen DINOv3 model code is installed from
	`github.com/facebookresearch/dinov3` at job time (not vendored here).

	Decision records: `gate_reports/`. Locked thresholds: `configs/thresholds.lock.json`.