whilethis
/

NEMESIS

+---
+license: mit
+tags:
+  - medical-imaging
+  - self-supervised-learning
+  - masked-autoencoder
+  - 3d-ct
+  - pretraining
+---
+# NEMESIS
+**Superpatch-based 3D Medical Image Self-Supervised Pretraining via Noise-Enhanced Dual-Masking**
+> IEEE AICAS 2026
+## Overview
+NEMESIS is a self-supervised pretraining framework for 3D CT volumes using:
+- **Superpatch processing** (128³ sub-volumes) — memory-efficient ViT pretraining
+- **Dual-masking (MATB)** — plane-wise (xy) + axis-wise (z) masking, exploiting CT anisotropy
+- **NEMESIS Tokens (NTs)** — learnable tokens summarising visible patches via cross-attention
+- **Noise-enhanced reconstruction** — Gaussian noise injection for regularisation
+### Key result (BTCV organ classification, frozen linear probe)
+| Method | AUROC |
+|---|---|
+| **NEMESIS (frozen)** | **0.9633** |
+| SuPreM (fine-tuned) | 0.9493 |
+| VoCo (fine-tuned) | 0.9387 |
+## Checkpoints
+| File | embed_dim | depth | mask_ratio |
+|---|---|---|---|
+| `MAE_768_0.5.pt` | 768 | 6 | 0.5 | ← **main model (paper)** |
+| `MAE_768_0.25.pt` | 768 | 6 | 0.25 | ablation |
+| `MAE_768_0.75.pt` | 768 | 6 | 0.75 | ablation |
+| `MAE_576_0.5.pt` | 576 | 6 | 0.5 | ablation |
+| `MAE_384_0.5.pt` | 384 | 6 | 0.5 | ablation |
+| (others) | | | | embed_dim × mask_ratio ablations |
+## Usage
+```bash
+pip install huggingface_hub
+huggingface-cli download whilethis/NEMESIS MAE_768_0.5.pt --local-dir pretrained/
+```
+```python
+import torch
+from nemesis.models.mae import MAEgic3DMAE
+ckpt = torch.load("pretrained/MAE_768_0.5.pt", map_location="cpu")
+model = MAEgic3DMAE(
+    embed_dim=768, depth=6, num_heads=8,
+    decoder_embed_dim=128, decoder_depth=3,
+    num_maegic_tokens=8,
+)
+model.load_state_dict(ckpt["model_state_dict"])
+encoder = model.encoder
+```
+## Code
+[https://github.com/whilethis00/NEMESIS-public](https://github.com/whilethis00/NEMESIS-public)
+## Citation
+```bibtex
+@inproceedings{jung2026nemesis,
+  title     = {{NEMESIS}: Superpatch-based 3{D} Medical Image Self-Supervised Pretraining via Noise-Enhanced Dual-Masking},
+  author    = {Jung, Hyeonseok and others},
+  booktitle = {IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS)},
+  year      = {2026},
+}
+```