Any-to-Any
Bagel
Safetensors
English
medical
medical-imaging
brain-mri
multimodal
image-to-image
image-text-to-text
modality-imputation
Instructions to use Astrostellar/UniBrain with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Bagel
How to use Astrostellar/UniBrain with Bagel:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
zhiyuns commited on
Commit ·
55263a9
1
Parent(s): eae8616
update readme
Browse files
README.md
CHANGED
|
@@ -1,3 +1,97 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
+
base_model:
|
| 4 |
+
- ByteDance-Seed/BAGEL-7B-MoT
|
| 5 |
+
pipeline_tag: any-to-any
|
| 6 |
+
library_name: bagel-mot
|
| 7 |
+
language:
|
| 8 |
+
- en
|
| 9 |
+
datasets:
|
| 10 |
+
- Astrostellar/RadGenome-Brain_MRI_parquet
|
| 11 |
+
tags:
|
| 12 |
+
- medical
|
| 13 |
+
- medical-imaging
|
| 14 |
+
- brain-mri
|
| 15 |
+
- multimodal
|
| 16 |
+
- image-to-image
|
| 17 |
+
- image-text-to-text
|
| 18 |
+
- modality-imputation
|
| 19 |
---
|
| 20 |
+
|
| 21 |
+
# UniBrain: Unified Multimodal Model for Brain MRI Imputation and Understanding
|
| 22 |
+
|
| 23 |
+
<p align="left">
|
| 24 |
+
<a href="https://medicalumm.github.io/unibrain.github.io/"><img src="https://img.shields.io/badge/UniBrain-Project_Page-0A66C2?logo=safari&logoColor=white" alt="UniBrain project page"></a>
|
| 25 |
+
<a href="https://arxiv.org/abs/2606.16484"><img src="https://img.shields.io/badge/UniBrain-Paper-red?logo=arxiv&logoColor=white" alt="UniBrain paper"></a>
|
| 26 |
+
<a href="https://github.com/zhiyuns/UniBrain"><img src="https://img.shields.io/badge/UniBrain-Code-536af5?logo=github&logoColor=white" alt="UniBrain code"></a>
|
| 27 |
+
</p>
|
| 28 |
+
|
| 29 |
+
> **UniBrain** is a unified multimodal model for brain MRI analysis. In one autoregressive context, it can impute missing MRI sequences, interpret the available and generated images, and produce a disease diagnosis. This repository hosts the UniBrain model checkpoints.
|
| 30 |
+
|
| 31 |
+
For installation, training, evaluation, and usage instructions, please visit the [official GitHub repository](https://github.com/zhiyuns/UniBrain).
|
| 32 |
+
|
| 33 |
+
<p align="center">
|
| 34 |
+
<img src="https://github.com/zhiyuns/UniBrain/raw/main/assets/main_figure.png" alt="Overview of the UniBrain framework" width="95%">
|
| 35 |
+
</p>
|
| 36 |
+
|
| 37 |
+
UniBrain is initialized from [BAGEL-7B-MoT](https://huggingface.co/ByteDance-Seed/BAGEL-7B-MoT), a Mixture-of-Transformer-Experts (MoT) model for multimodal understanding and generation. It adapts BAGEL to brain MRI using an interleaved, description-enriched training flow and three main ideas:
|
| 38 |
+
|
| 39 |
+
- **Unified MRI generation and understanding:** missing-sequence imputation and downstream interpretation share one autoregressive context.
|
| 40 |
+
- **Self-alignment:** medical image reconstruction provides dense supervision for fine-grained anatomical representation learning without requiring detailed captions for every image.
|
| 41 |
+
- **Dynamic hidden states:** training conditions the model on its own generated visual context to reduce exposure bias during long multimodal sequences.
|
| 42 |
+
|
| 43 |
+
## Model details
|
| 44 |
+
|
| 45 |
+
| Item | Description |
|
| 46 |
+
| --- | --- |
|
| 47 |
+
| Base model | [ByteDance-Seed/BAGEL-7B-MoT](https://huggingface.co/ByteDance-Seed/BAGEL-7B-MoT) |
|
| 48 |
+
| Architecture | Unified MoT architecture |
|
| 49 |
+
| Domain | 2D axial brain MRI slices |
|
| 50 |
+
| Tasks | MRI modality imputation, brain MRI understanding/diagnosis |
|
| 51 |
+
| Training data | [RadGenome-Brain_MRI](https://huggingface.co/datasets/JiayuLei/RadGenome-Brain_MRI), using the [UniBrain preprocessed release](https://huggingface.co/datasets/Astrostellar/RadGenome-Brain_MRI_parquet) |
|
| 52 |
+
| Inference precision | BF16 |
|
| 53 |
+
|
| 54 |
+
|
| 55 |
+
## Reported results
|
| 56 |
+
|
| 57 |
+
The following results are reported on the RadGenome-Brain MRI evaluation split in the paper and [project page](https://medicalumm.github.io/unibrain.github.io/).
|
| 58 |
+
|
| 59 |
+
### MRI diagnosis and report generation
|
| 60 |
+
|
| 61 |
+
| Available modalities | Top-1 diagnosis accuracy (%) | ROUGE |
|
| 62 |
+
| --- | ---: | ---: |
|
| 63 |
+
| T1w only | 74.47 | 36.93 |
|
| 64 |
+
| T1w + T2w | 76.60 | 38.23 |
|
| 65 |
+
| T1w + T2w + T2-FLAIR | 78.01 | 38.68 |
|
| 66 |
+
| Complete data | 82.06 | 38.94 |
|
| 67 |
+
|
| 68 |
+
### MRI modality imputation
|
| 69 |
+
|
| 70 |
+
| Imputation sequence | PSNR | Downstream Top-1 accuracy (%) |
|
| 71 |
+
| --- | ---: | ---: |
|
| 72 |
+
| T1w → T2w | 22.23 | 68.09 |
|
| 73 |
+
| T1w, T2w → T2-FLAIR | 22.58 | 67.38 |
|
| 74 |
+
| T1w, T2w, T2-FLAIR → T1c | 22.26 | 74.47 |
|
| 75 |
+
|
| 76 |
+
|
| 77 |
+
## License
|
| 78 |
+
|
| 79 |
+
The UniBrain model weights are released under the Apache License 2.0. UniBrain builds on [BAGEL](https://github.com/ByteDance-Seed/Bagel) and [AutoRG-Brain](https://github.com/ljy19970415/AutoRG-Brain); the code, base model, incorporated components, and datasets retain their respective licenses and terms.
|
| 80 |
+
|
| 81 |
+
## Acknowledgements
|
| 82 |
+
|
| 83 |
+
The implementation is adapted from [BAGEL](https://github.com/ByteDance-Seed/Bagel), a unified multimodal foundation model for natural images. The training and evaluation data are based on [RadGenome-Brain_MRI](https://huggingface.co/datasets/JiayuLei/RadGenome-Brain_MRI) from the [AutoRG-Brain](https://github.com/ljy19970415/AutoRG-Brain) project.
|
| 84 |
+
|
| 85 |
+
## Citation
|
| 86 |
+
|
| 87 |
+
If you find UniBrain useful, please cite:
|
| 88 |
+
|
| 89 |
+
```bibtex
|
| 90 |
+
@article{unibrain2026,
|
| 91 |
+
title = {Unified Multimodal Model for Brain MRI Imputation and Understanding},
|
| 92 |
+
author = {Zhiyun Song, Che Liu, Tian Xia, Avinash Kori, Wenjia Bai},
|
| 93 |
+
journal = {arXiv preprint arXiv:2606.16484},
|
| 94 |
+
year = {2026}
|
| 95 |
+
}
|
| 96 |
+
```
|
| 97 |
+
|