MambaMIM / README.md

Update README.md

af08da0 verified 9 months ago

6.55 kB

	---
	license: apache-2.0
	pipeline_tag: image-segmentation
	---


	## [MIA'25] MambaMIM: Pre-training Mamba with State Space Token Interpolation and its Application to Medical Image Segmentation

	<p align="center" width="100%">
	<!---->
	</p>

	![MambaMIM](img/TOKI.png)



	<div align="center">
	<span class="author-block">
	<a href="https://scholar.google.com/citations?user=x1pODsMAAAAJ&hl=en" target="_blank">Fenghe Tang</a><sup>1,2</sup>,</span>
	<span class="author-block">
	<a target="_blank">Bingkun Nian</a><sup>3</sup>,</span>
	<span class="author-block">
	<a href="https://scholar.google.com/citations?user=ocAtNkkAAAAJ&hl=en" target="_blank">Yingtai Li</a><sup>1,2</sup>,</span>
	<span class="author-block">
	<a href="https://scholar.google.com/citations?user=Wo8tMSMAAAAJ&hl=en" target="_blank">Zihang Jiang</a><sup>1,2</sup>,</span>
	<span class="author-block">
	<a href="https://scholar.google.com/citations?user=tmx7tu8AAAAJ&hl=en" target="_blank">Jie Yang</a><sup>3</sup>,</span>
	<span class="author-block">
	<a href="https://scholar.google.com/citations?user=Vbb5EGIAAAAJ&hl=en" target="_blank"> Liu Wei</a><sup>3</sup>,</span>
	<span class="author-block">
	<a href="https://scholar.google.com/citations?user=8eNm2GMAAAAJ&hl=en" target="_blank">S.Kevin Zhou</a><sup>1,2</sup>
	</span>
	</div>


	<br>

	<div align="center">
	<sup>1</sup>
	<a href='https://en.ustc.edu.cn/' target='_blank'>School of Biomedical Engineering, University of Science and Technology of China</a>&emsp;
	<br>
	<sup>2</sup> <a href='http://english.ict.cas.cn/' target='_blank'>Suzhou Institute for Advanced Research, University of Science and Technology of China</a>&emsp;
	<br>
	<sup>3</sup> <a href='http://www.pami.sjtu.edu.cn/En/Home' target='_blank'>Department of Automation, Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University</a>
	<br>
	</div>

	<br>
	<br>

	[![arXiv](https://img.shields.io/badge/arxiv-2408.08070-b31b1b)](https://arxiv.org/pdf/2408.08070.pdf) [![github](https://img.shields.io/badge/github-MambaMIM-purple)](https://github.com/FengheTan9/MambaMIM) <a href="#LICENSE--citation"><img alt="License: Apache2.0" src="https://img.shields.io/badge/LICENSE-Apache%202.0-blue.svg"/></a>



	## News

	- MambaMIM accepted by Medical Image Analyses (MIA'25) ! 🥰
	- Weights released ! 😎
	- Code released ! 😘
	- Code and weights will be released soon ! 😘
	- [2024/08/16] Paper released !



	## TODOs

	- [x] Paper released
	- [x] Code released
	- [x] Weight released



	## Getting Started

	### Download weights

	\| Name \| Resolution \| Intensities \| Spacing \| Weights \|
	\| :------: \| :----------: \| :-----------: \| :----------------: \| :----------------------------------------------------------: \|
	\| MambaMIM \| 96 x 96 x 96 \| [-175, - 250] \| 1.5 x 1.5 x 1.5 mm \| [Google Drive (87MB)](https://drive.google.com/file/d/1B3j5aRPxkDJqf8UPGKDiAjg2X85a3Kwx/view?usp=sharing) \|



	### Prepare Environments

	```
	conda create -n mambamim python=3.9
	conda activate mambamim
	pip install torch==1.13.0 torchvision==0.14.0 torchaudio==0.13.0 --extra-index-url https://download.pytorch.org/whl/cu117
	pip install packaging timm==0.5.4
	pip install transformers==4.34.1 typed-argument-parser
	pip install numpy==1.21.2 opencv-python==4.5.5.64 opencv-python-headless==4.5.5.64
	pip install 'monai[all]'
	pip install monai==1.2.0
	pip install causal_conv1d-1.2.0.post2+cu118torch1.13cxx11abiTRUE-cp38-cp38-linux_x86_64.whl
	pip install mamba_ssm-1.2.0.post1+cu118torch1.13cxx11abiFALSE-cp38-cp38-linux_x86_64.whl
	```



	### Prepare Datasets

	We recommend that you convert the dataset into the [nnUNet](https://github.com/MIC-DKFZ/nnUNet/blob/master/documentation/dataset_format.md) format.

	```
	└── MambaMIM
	├── data
	├── Dataset060_TotalSegmentator
	└── imagesTr
	├── xxx_0000.nii.gz
	├── ...
	├── Dataset006_FLARE2022
	└── imagesTr
	├── xxx_0000.nii.gz
	├── ...
	└── Other_dataset
	└── imagesTr
	├── xxx_0000.nii.gz
	├── ...
	```

	An example ```dataset.json``` will be generated in ```./data```

	The content should be like below:

	```json
	{
	"training": [
	{
	"image": "./Dataset060_TotalSegmentator/imagesTr/xxx_0000.nii.gz"
	},
	{
	"image": "./Dataset006_FLARE2022/imagesTr/xxx_0000.nii.gz"
	},
	]
	}

	```



	## Start Training

	![MambaMIM](img/masking_consistency.png)



	Run training on multi-GPU :

	```sh
	# An example of training on 4 GPUs with DDP
	torchrun --nproc_per_node=4 --nnodes=1 --node_rank=0 --master_addr=localhost --master_port=12351 main.py --exp_name=debug --data_path=./data --model=mambamim --bs=16 --exp_dir=debug_mambamim_ddp_4
	```

	Run training on the single-GPU :

	```sh
	# An example of training on the single GPU
	python main.py --exp_name=debug --data_path=./data --model=mambamim --bs=4 --exp_dir=debug_mambamim
	```



	## Fine-tuning

	Load pre-training weights :

	```python
	# An example of Fine-tuning on BTCV (num_classes=14)
	from models.network.hymamba import build_hybird

	model = build_hybird(in_channel=1, n_classes=14, img_size=96).cuda()

	model_dict = torch.load("mambamim_mask75.pth")

	if model.load_state_dict(model_dict, strict=False):
	print("MambaMIM use pretrained weights successfully !")
	```

	Downstream pipeline can be referred to [UNETR]([research-contributions/UNETR/BTCV at main · Project-MONAI/research-contributions (github.com)](https://github.com/Project-MONAI/research-contributions/tree/main/UNETR/BTCV)).



	## Acknowledgements:

	This code uses helper functions from [SparK](https://github.com/keyu-tian/SparK) and [HySparK](https://github.com/FengheTan9/HySparK).



	## Citation

	If the code, paper and weights help your research, please cite:

	```
	@article{tang2024mambamim,
	title={MambaMIM: Pre-training Mamba with State Space Token-interpolation},
	author={Tang, Fenghe and Nian, Bingkun and Li, Yingtai and Yang, Jie and Wei, Liu and Zhou, S Kevin},
	journal={arXiv preprint arXiv:2408.08070},
	year={2024}
	}
	```



	## License

	This project is released under the Apache 2.0 license. Please see the [LICENSE](LICENSE) file for more information.