OCTFlow Path-1 code + stripped weights (Stage A* + v3a + v1/v2)

Browse files

Files changed (6) hide show

README.md +70 -0
octflow-raev2-code.tar.gz +3 -0
weights/sd3_oct_stageA_v3_step20000.pt +3 -0
weights/sd3_vb_stageC_v1_step20000.pt +3 -0
weights/sd3_vb_stageC_v2_step20000.pt +3 -0
weights/sd3_vb_stageC_v3a_step30000.pt +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,70 @@

+---
+license: other
+tags:
+  - oct
+  - ophthalmology
+  - segmentation
+  - stable-diffusion-3
+  - instruction-tuning
+  - medical-imaging
+---
+# OCTFlow — Path 1 (SD3 backbone) code + weights
+Reusable code and checkpoints for the OCTFlow pilot: an ophthalmic multimodal
+generative model that does **prompt-controlled OCT retinal-layer segmentation**
+(Vision-Banana-style instruction tuning on a Stable Diffusion 3 medium backbone).
+This repo is for **continuing the work on a new machine** — the dataset is hosted
+separately. Optimizer state has been stripped from the checkpoints (warm-start and
+inference only need `model` weights).
+## Contents
+| File | What |
+|---|---|
+| `octflow-raev2-code.tar.gz` | Full RAEv2 working tree (src/ engine + pilot/path1/ Path-1 code, configs, scripts). Excludes results/, .git/, pretrained_models/, data/. |
+| `weights/sd3_oct_stageA_v3_step20000.pt` | **Stage A\*** — SD3 medium fine-tuned on Topcon OCT (T2I domain adaptation). The warm-start base for all Stage C runs. |
+| `weights/sd3_vb_stageC_v3a_step30000.pt` | **v3a (best)** — multi-prompt instruction tuning. Follows prompts for 9/5/3-layer + arbitrary colors + single-layer selection; zero-shot adapts to new layer schemes. |
+| `weights/sd3_vb_stageC_v1_step20000.pt` | (optional) v1 specialist, prob_seg=0.3, single fixed 10-color prompt. |
+| `weights/sd3_vb_stageC_v2_step20000.pt` | (optional) v2 specialist, prob_seg=0.5. |
+Each `.pt` holds `{step, model, ema, config}` (no optimizer). `model` is a
+`SD3Transformer2DModel` with `pos_embed.proj` expanded 16→32 input channels
+(channel-concat image conditioning).
+## Key results (v3a)
+- **Instruction following**: prompt 9/5/3 layers → outputs 6.95/4.36/2.85 layers; shuffled-color prompt mIoU 0.456 ≈ canonical 0.461 (the model reads the prompt's color map).
+- **Cross-device zero-shot (OCTA500, native 5-layer prompt)**: binary retina IoU **0.538 → 0.897** vs the single-prompt pilot.
+- **per-scheme mIoU (incl bg, N=150)**: 9-layer 0.461 / 5-layer 0.526 / 3-layer 0.610.
+- vs OCT-RAE backbone: 10-class strict mIoU 0.023 → 0.507 (22×).
+## Restore on a new server
+```bash
+# 1. download this repo
+hf download <this-repo-id> --repo-type model --local-dir octflow_restore
+# 2. unpack code
+mkdir RAEv2 && tar xzf octflow_restore/octflow-raev2-code.tar.gz -C RAEv2
+cd RAEv2
+# 3. env (uv) + put weights back where run.sh expects them
+uv sync   # or: conda env + pip install diffusers transformers torch ...
+mkdir -p pilot/path1/results/sd3_oct_stageA_v3/checkpoints
+mkdir -p pilot/path1/results/sd3_vb_stageC_v3a/checkpoints
+cp octflow_restore/weights/sd3_oct_stageA_v3_step20000.pt  pilot/path1/results/sd3_oct_stageA_v3/checkpoints/step-0020000.pt
+cp octflow_restore/weights/sd3_vb_stageC_v3a_step30000.pt  pilot/path1/results/sd3_vb_stageC_v3a/checkpoints/step-0030000.pt
+# 4. point configs/scripts at the new dataset root, then see pilot/path1/run.sh
+```
+SD3 medium base weights (`stabilityai/stable-diffusion-3-medium-diffusers`) are
+downloaded from HF at runtime, not bundled here.
+## Reproduce / next step
+The full pipeline is `pilot/path1/run.sh`. Next planned step is **v3b**:
+decoded-space loss (palette CE + soft Dice + thin-layer weighting) to fix the
+generalist tax and weak thin layers (RPE/GCL). Clinical scope is the macula.

octflow-raev2-code.tar.gz ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f1f4703eb902ed2849f291e8f96d14433c70db8a311063f8808dadbff2c57305
+size 1072660574

weights/sd3_oct_stageA_v3_step20000.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:794136b2ccfe297adf566969366ddaab9c6fe0f9e7eb50aecc1c018dd34dc440
+size 8340371276

weights/sd3_vb_stageC_v1_step20000.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:26b4d8c76a59f7622e5641fe6200831cfe32f3bbe0cb2d17e2ccc9e925b2f97a
+size 8340762482

weights/sd3_vb_stageC_v2_step20000.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fd2999818ef21280b808cb2f2404c86652a2d461eae8a09a56cc94ea82367874
+size 8340762482

weights/sd3_vb_stageC_v3a_step30000.pt ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e34057970598b7b2f89180c5f7461a146a8c2da2037122a8fd5037506234cf71
+size 8340763852