MedVAE / README.md

MitakaKuma

Update README.md

3896d4a verified 10 days ago

preview code

raw

history blame contribute delete

2.14 kB

metadata

license: mit
language:
  - en
base_model:
  - stable-diffusion-v1-5/stable-diffusion-v1-5
pipeline_tag: image-to-image
tags:
  - medical
  - CT
  - autoencoders

MedVAE

Variational Autoencoders (VAEs) are widely used in imaging tasks such as image generation, reconstruction, and representation learning. However, most open-source VAEs are trained on natural images and are not suitable for 3D medical CT data. This creates a gap for researchers who need reliable VAE models that can handle real clinical CT volumes.

To address this problem, we benchmark MedVAE, a VAE pre-trained on large-scale 3D medical CT data. The goal is to evaluate whether MedVAE can better preserve anatomical structures and intensity distributions compared to general-purpose VAEs.

Benchmarks

Name	VAE Type	# Patients
stable-diffusion-v1-5	KL-VAE	0
stable-diffusion-3.5-large	KL-VAE	0
2D-MedVAE_KL-smooth-tianyu-2K	KL-VAE	2k
2D-MedVAE_KL-smooth-kuma-10K	KL-VAE	10k
2D-MedVAE_KL-sharp-kuma-10K	KL-VAE	10k
2D-MedVAE_KL-sharp-kuma-20K	KL-VAE	20k
2D-MedVAE_KL-sharp-kuma-100K	KL-VAE	100k
2D-MedVAE_KL-sharp-kuma-300K	KL-VAE	300k
3D-MedVAE_MAISI	KL-VAE	40K
3D-MedVAE_KL-sharp-kuma-100K (ongoing)	KL-VAE	100K
3D-MedVAE_KL-sharp-kuma-1M (ongoing)	KL-VAE	1M
3D-MedVAE_VQ	VQ-VAE	100K
3D-MedVAE_VQ-kuma-100K (ongoing)	VQ-VAE	100K

Citation

@article{liu2025see,
  title={See More, Change Less: Anatomy-Aware Diffusion for Contrast Enhancement},
  author={Liu, Junqi and Wu, Zejun and Bassi, Pedro RAS and Zhou, Xinze and Li, Wenxuan and Hamamci, Ibrahim E and Er, Sezgin and Lin, Tianyu and Luo, Yi and Płotka, Szymon and others},
  journal={arXiv preprint arXiv:https://www.arxiv.org/abs/2512.07251},
  year={2025},
  url={https://github.com/MrGiovanni/SMILE}
}