nielsr HF Staff commited on
Commit
b04bba8
·
verified ·
1 Parent(s): ed4e07f

Add model card for UniX

Browse files

This PR adds a comprehensive model card for UniX, a unified medical foundation model for chest X-ray understanding and generation. It includes:
- Metadata with the `any-to-any` pipeline tag.
- A summary of the model's decoupled dual-branch architecture.
- Performance highlights and key features from the paper.
- A quick start guide for environment setup and running the Gradio demo.
- Links to the arXiv paper and official GitHub repository.

Files changed (1) hide show
  1. README.md +49 -3
README.md CHANGED
@@ -1,3 +1,49 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ pipeline_tag: any-to-any
4
+ ---
5
+
6
+ # UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation
7
+
8
+ UniX is a next-generation unified medical foundation model specifically designed for chest X-ray understanding and generation. It addresses the inherent conflict between semantic abstraction (understanding) and pixel-level reconstruction (generation) by decoupling the tasks into an autoregressive branch and a diffusion branch, coordinated via a cross-modal self-attention mechanism.
9
+
10
+ - **Paper:** [UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation](https://huggingface.co/papers/2601.11522)
11
+ - **Repository:** [https://github.com/ZrH42/UniX](https://github.com/ZrH42/UniX)
12
+
13
+ ## Highlights
14
+
15
+ - **Decoupled Dual-Branch Architecture:** Unifies autoregressive understanding and diffusion-based generation to fundamentally resolve intrinsic task conflicts and feature interference.
16
+ - **Superior Efficiency & Quality:** Achieves state-of-the-art performance with **only 25% of the parameters** compared to previous unified models like LLM-CXR, while boosting understanding (Micro-F1) by 46.1% and generation quality (FD-RadDino) by 24.2%.
17
+ - **Cross-Modal Synergy:** Introduces a novel self-attention mechanism that enables dynamic semantic guidance for high-fidelity, continuous medical image synthesis.
18
+
19
+ ## Quick Start
20
+
21
+ To set up the environment and run the model, follow the instructions from the official repository:
22
+
23
+ 1. **Install Environment**
24
+
25
+ ```bash
26
+ git clone https://github.com/ZrH42/UniX.git
27
+ cd UniX
28
+ conda create -n unix python=3.10 -y
29
+ conda activate unix
30
+ bash install.sh
31
+ ```
32
+
33
+ 2. **Launch Gradio Demo**
34
+ ```bash
35
+ python demo_gradio.py
36
+ ```
37
+
38
+ ## Citation
39
+
40
+ If you find UniX useful in your research, please cite:
41
+
42
+ ```bibtex
43
+ @article{zhang2026unix,
44
+ title={UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation},
45
+ author={Zhang, Ruiheng and Yao, Jingfeng and Zhao, Huangxuan and Yan, Hao and He, Xiao and Chen, Lei and Wei, Zhou and Luo, Yong and Wang, Zengmao and Zhang, Lefei and Tao, Dacheng and Du, Bo},
46
+ journal={arXiv preprint arXiv:2601.11522},
47
+ year={2026}
48
+ }
49
+ ```