zhiyuns commited on
Commit
55263a9
·
1 Parent(s): eae8616

update readme

Browse files
Files changed (1) hide show
  1. README.md +94 -0
README.md CHANGED
@@ -1,3 +1,97 @@
1
  ---
2
  license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ base_model:
4
+ - ByteDance-Seed/BAGEL-7B-MoT
5
+ pipeline_tag: any-to-any
6
+ library_name: bagel-mot
7
+ language:
8
+ - en
9
+ datasets:
10
+ - Astrostellar/RadGenome-Brain_MRI_parquet
11
+ tags:
12
+ - medical
13
+ - medical-imaging
14
+ - brain-mri
15
+ - multimodal
16
+ - image-to-image
17
+ - image-text-to-text
18
+ - modality-imputation
19
  ---
20
+
21
+ # UniBrain: Unified Multimodal Model for Brain MRI Imputation and Understanding
22
+
23
+ <p align="left">
24
+ <a href="https://medicalumm.github.io/unibrain.github.io/"><img src="https://img.shields.io/badge/UniBrain-Project_Page-0A66C2?logo=safari&logoColor=white" alt="UniBrain project page"></a>
25
+ <a href="https://arxiv.org/abs/2606.16484"><img src="https://img.shields.io/badge/UniBrain-Paper-red?logo=arxiv&logoColor=white" alt="UniBrain paper"></a>
26
+ <a href="https://github.com/zhiyuns/UniBrain"><img src="https://img.shields.io/badge/UniBrain-Code-536af5?logo=github&logoColor=white" alt="UniBrain code"></a>
27
+ </p>
28
+
29
+ > **UniBrain** is a unified multimodal model for brain MRI analysis. In one autoregressive context, it can impute missing MRI sequences, interpret the available and generated images, and produce a disease diagnosis. This repository hosts the UniBrain model checkpoints.
30
+
31
+ For installation, training, evaluation, and usage instructions, please visit the [official GitHub repository](https://github.com/zhiyuns/UniBrain).
32
+
33
+ <p align="center">
34
+ <img src="https://github.com/zhiyuns/UniBrain/raw/main/assets/main_figure.png" alt="Overview of the UniBrain framework" width="95%">
35
+ </p>
36
+
37
+ UniBrain is initialized from [BAGEL-7B-MoT](https://huggingface.co/ByteDance-Seed/BAGEL-7B-MoT), a Mixture-of-Transformer-Experts (MoT) model for multimodal understanding and generation. It adapts BAGEL to brain MRI using an interleaved, description-enriched training flow and three main ideas:
38
+
39
+ - **Unified MRI generation and understanding:** missing-sequence imputation and downstream interpretation share one autoregressive context.
40
+ - **Self-alignment:** medical image reconstruction provides dense supervision for fine-grained anatomical representation learning without requiring detailed captions for every image.
41
+ - **Dynamic hidden states:** training conditions the model on its own generated visual context to reduce exposure bias during long multimodal sequences.
42
+
43
+ ## Model details
44
+
45
+ | Item | Description |
46
+ | --- | --- |
47
+ | Base model | [ByteDance-Seed/BAGEL-7B-MoT](https://huggingface.co/ByteDance-Seed/BAGEL-7B-MoT) |
48
+ | Architecture | Unified MoT architecture |
49
+ | Domain | 2D axial brain MRI slices |
50
+ | Tasks | MRI modality imputation, brain MRI understanding/diagnosis |
51
+ | Training data | [RadGenome-Brain_MRI](https://huggingface.co/datasets/JiayuLei/RadGenome-Brain_MRI), using the [UniBrain preprocessed release](https://huggingface.co/datasets/Astrostellar/RadGenome-Brain_MRI_parquet) |
52
+ | Inference precision | BF16 |
53
+
54
+
55
+ ## Reported results
56
+
57
+ The following results are reported on the RadGenome-Brain MRI evaluation split in the paper and [project page](https://medicalumm.github.io/unibrain.github.io/).
58
+
59
+ ### MRI diagnosis and report generation
60
+
61
+ | Available modalities | Top-1 diagnosis accuracy (%) | ROUGE |
62
+ | --- | ---: | ---: |
63
+ | T1w only | 74.47 | 36.93 |
64
+ | T1w + T2w | 76.60 | 38.23 |
65
+ | T1w + T2w + T2-FLAIR | 78.01 | 38.68 |
66
+ | Complete data | 82.06 | 38.94 |
67
+
68
+ ### MRI modality imputation
69
+
70
+ | Imputation sequence | PSNR | Downstream Top-1 accuracy (%) |
71
+ | --- | ---: | ---: |
72
+ | T1w → T2w | 22.23 | 68.09 |
73
+ | T1w, T2w → T2-FLAIR | 22.58 | 67.38 |
74
+ | T1w, T2w, T2-FLAIR → T1c | 22.26 | 74.47 |
75
+
76
+
77
+ ## License
78
+
79
+ The UniBrain model weights are released under the Apache License 2.0. UniBrain builds on [BAGEL](https://github.com/ByteDance-Seed/Bagel) and [AutoRG-Brain](https://github.com/ljy19970415/AutoRG-Brain); the code, base model, incorporated components, and datasets retain their respective licenses and terms.
80
+
81
+ ## Acknowledgements
82
+
83
+ The implementation is adapted from [BAGEL](https://github.com/ByteDance-Seed/Bagel), a unified multimodal foundation model for natural images. The training and evaluation data are based on [RadGenome-Brain_MRI](https://huggingface.co/datasets/JiayuLei/RadGenome-Brain_MRI) from the [AutoRG-Brain](https://github.com/ljy19970415/AutoRG-Brain) project.
84
+
85
+ ## Citation
86
+
87
+ If you find UniBrain useful, please cite:
88
+
89
+ ```bibtex
90
+ @article{unibrain2026,
91
+ title = {Unified Multimodal Model for Brain MRI Imputation and Understanding},
92
+ author = {Zhiyun Song, Che Liu, Tian Xia, Avinash Kori, Wenjia Bai},
93
+ journal = {arXiv preprint arXiv:2606.16484},
94
+ year = {2026}
95
+ }
96
+ ```
97
+