File size: 922 Bytes
4826fb1 |
1 2 3 4 5 6 7 8 9 10 11 |
# ProgEmu: Towards Interpretable Counterfactual Generation via Multimodal Autoregression
This repository contains the model weights for the MICCAI'25 paper: Towards Interpretable Counterfactual Generation via Multimodal Autoregression ([arxiv](https://arxiv.org/abs/2503.23149), [homepage](https://progemu.github.io/), [model](https://huggingface.co/massaki75/progemu/tree/main)). Supported by Shanghai Innovation Institute (SII).
## Highlights 💡
- **Interpretable Counterfactual Generation (ICG)**: Jointly produces a counterfactual CXR image and a concise interpretation text that pinpoints progression-induced visual changes.
- **ICG-CXR Dataset**: Over 10k longitudinal CXR quadruples (prior image, prompt, subsequent image, interpretation) that supports ICG task.
- **ProgEmu Framework**: A single multimodal autoregressive transformer that generates visual and textual counterfactuals in one forward pass.
|