NEMESIS / README.md
whilethis's picture
Upload README.md with huggingface_hub
1c09125 verified
---
license: mit
tags:
- medical-imaging
- self-supervised-learning
- masked-autoencoder
- 3d-ct
- pretraining
---
# NEMESIS
**Superpatch-based 3D Medical Image Self-Supervised Pretraining via Noise-Enhanced Dual-Masking**
> IEEE AICAS 2026
## Overview
NEMESIS is a self-supervised pretraining framework for 3D CT volumes using:
- **Superpatch processing** (128³ sub-volumes) — memory-efficient ViT pretraining
- **Dual-masking (MATB)** — plane-wise (xy) + axis-wise (z) masking, exploiting CT anisotropy
- **NEMESIS Tokens (NTs)** — learnable tokens summarising visible patches via cross-attention
- **Noise-enhanced reconstruction** — Gaussian noise injection for regularisation
### Key result (BTCV organ classification, frozen linear probe)
| Method | AUROC |
|---|---|
| **NEMESIS (frozen)** | **0.9633** |
| SuPreM (fine-tuned) | 0.9493 |
| VoCo (fine-tuned) | 0.9387 |
## Checkpoints
| File | embed_dim | depth | mask_ratio |
|---|---|---|---|
| `MAE_768_0.5.pt` | 768 | 6 | 0.5 | ← **main model (paper)** |
| `MAE_768_0.25.pt` | 768 | 6 | 0.25 | ablation |
| `MAE_768_0.75.pt` | 768 | 6 | 0.75 | ablation |
| `MAE_576_0.5.pt` | 576 | 6 | 0.5 | ablation |
| `MAE_384_0.5.pt` | 384 | 6 | 0.5 | ablation |
| (others) | | | | embed_dim × mask_ratio ablations |
## Usage
```bash
pip install huggingface_hub
huggingface-cli download whilethis/NEMESIS MAE_768_0.5.pt --local-dir pretrained/
```
```python
import torch
from nemesis.models.mae import MAEgic3DMAE
ckpt = torch.load("pretrained/MAE_768_0.5.pt", map_location="cpu")
model = MAEgic3DMAE(
embed_dim=768, depth=6, num_heads=8,
decoder_embed_dim=128, decoder_depth=3,
num_maegic_tokens=8,
)
model.load_state_dict(ckpt["model_state_dict"])
encoder = model.encoder
```
## Code
[https://github.com/whilethis00/NEMESIS-public](https://github.com/whilethis00/NEMESIS-public)
## Citation
```bibtex
@inproceedings{jung2026nemesis,
title = {{NEMESIS}: Superpatch-based 3{D} Medical Image Self-Supervised Pretraining via Noise-Enhanced Dual-Masking},
author = {Jung, Hyeonseok and others},
booktitle = {IEEE International Conference on Artificial Intelligence Circuits and Systems (AICAS)},
year = {2026},
}
```