nielsr HF Staff commited on
Commit
0e619b0
·
verified ·
1 Parent(s): 9c99bba

Improve model card metadata and content

Browse files

This PR improves the model card by:
- Adding `library_name: transformers` to the metadata based on the repository's configuration.
- Refining the structure of the README for better readability on the Hub.
- Retaining the essential installation and inference code snippets found in the official documentation.
- Including the performance benchmarks to showcase the model's competitive results.

Files changed (1) hide show
  1. README.md +22 -216
README.md CHANGED
@@ -1,11 +1,12 @@
1
  ---
2
- license: apache-2.0
 
3
  language:
4
  - en
5
  - zh
6
- base_model:
7
- - shallowdream204/BitDance-14B-16x
8
  pipeline_tag: text-to-image
 
9
  ---
10
 
11
  # BitDance: Scaling Autoregressive Generative Models with Binary Tokens
@@ -17,7 +18,7 @@ pipeline_tag: text-to-image
17
  alt="Project Page"
18
  />
19
  </a>
20
- <a href="https://arxiv.org/abs/2602.14041">
21
  <img
22
  src="https://img.shields.io/badge/arXiv paper-2602.14041-red?logo=arxiv&logoColor=red"
23
  alt="BitDance Paper on arXiv"
@@ -45,19 +46,15 @@ pipeline_tag: text-to-image
45
 
46
  <p align="center"><img src="https://github.com/shallowdream204/BitDance/raw/main/assets/speed.webp" width=90%"></p>
47
 
48
-
49
  > [Yuang Ai*](https://shallowdream204.github.io/), [Jiaming Han*](https://csuhan.com/), [Shaobin Zhuang*](https://scholar.google.com/citations?user=PGaDirMAAAAJ), [Weijia Mao](https://scholar.google.com/citations?user=S7bGBmkyNtEC), [Xuefeng Hu](https://xuefenghu.me/), [Ziyan Yang](https://ziyanyang.github.io/), [Zhenheng Yang](https://zhenheny.github.io/), [Huaibo Huang†](https://hhb072.github.io/), [Xiangyu Yue†](https://xyue.io/), [Hao Chen*†‡](https://haochen-rye.github.io/)
50
- >
51
- > <sup>*</sup> Equal Contribution&nbsp;&nbsp;<sup>†</sup> Corresponding Author&nbsp;&nbsp;<sup>‡</sup> Project Lead
52
- >
53
- > For visual generation, discrete autoregressive models often struggle with poor tokenizer reconstruction, difficulties in sampling from large vocabularies, and slow token-by-token generation speeds. We present **BitDance**, which addresses these challenges via a large-vocabulary binary tokenizer, a binary diffusion head for sampling in large discrete space, and a next-patch diffusion paradigm that enables efficient multitoken prediction. BitDance is an open-source discrete autoregressive foundation model with 14B parameters, trained on large-scale multimodal tokens. While maintaining the standard language modeling paradigm for text tokens, BitDance employs a next-patch diffusion paradigm for visual tokens to predict multiple tokens in parallel—up to 64 per step. This unified multimodal framework is simple, scalable, and capable of efficiently generating high-resolution, photorealistic images.
54
 
55
- <p align="center"><img src="https://github.com/shallowdream204/BitDance/raw/main/assets/teaser.webp" width="90%"></p>
56
 
 
57
 
58
  ## ⚡ Quick Start
59
 
60
- 1️⃣ Create Conda Environment and Install Package
61
  ```bash
62
  git clone https://github.com/shallowdream204/BitDance.git
63
  cd BitDance
@@ -67,62 +64,30 @@ pip install -r requirements.txt
67
  pip install flash_attn==2.8.2 --no-build-isolation
68
  ```
69
 
70
- 2️⃣ Download Model Weights
71
-
72
  We offer two models, BitDance-14B-64x and BitDance-14B-16x, which can predict 64 and 16 tokens in parallel at each step, respectively.
73
- | Model | #Token per Step | Step (1024px) | Supported Size | Huggingface |
74
- |:-------:|:----:|:----:|:-----------:|:----:|
75
- | BitDance-14B-64x| 64 | 64 |1024px | [BitDance-14B-64x](https://huggingface.co/shallowdream204/BitDance-14B-64x) |
76
- | BitDance-14B-16x| 16 | 256 |512&1024px | [BitDance-14B-16x](https://huggingface.co/shallowdream204/BitDance-14B-16x) |
77
 
 
 
 
 
78
 
 
79
  ```python
80
- from huggingface_hub import snapshot_download
81
-
82
- save_dir = "models/BitDance-14B-64x"
83
- repo_id = "shallowdream204/BitDance-14B-64x"
84
- cache_dir = save_dir + "/cache"
85
-
86
- snapshot_download(cache_dir=cache_dir,
87
- local_dir=save_dir,
88
- repo_id=repo_id,
89
- local_dir_use_symlinks=False,
90
- resume_download=True,
91
- allow_patterns=["*.json", "*.safetensors", "*.bin", "*.py", "*.md", "*.txt"],
92
- )
93
-
94
- save_dir = "models/BitDance-14B-16x"
95
- repo_id = "shallowdream204/BitDance-14B-16x"
96
- cache_dir = save_dir + "/cache"
97
-
98
- snapshot_download(cache_dir=cache_dir,
99
- local_dir=save_dir,
100
- repo_id=repo_id,
101
- local_dir_use_symlinks=False,
102
- resume_download=True,
103
- allow_patterns=["*.json", "*.safetensors", "*.bin", "*.py", "*.md", "*.txt"],
104
- )
105
-
106
- ```
107
-
108
- 3️⃣ T2I Inference (check [here](https://github.com/shallowdream204/BitDance/blob/main/modeling/t2i_pipeline.py#L21) for the supported image resolution)
109
- ```python
110
- # example_t2i.py
111
  from modeling.t2i_pipeline import BitDanceT2IPipeline
112
 
113
- model_path = 'models/BitDance-14B-64x'
114
- # model_path = 'models/BitDance-14B-16x'
115
  device = 'cuda'
116
 
117
  pipe = BitDanceT2IPipeline(model_path=model_path, device=device)
118
 
119
- prompt = "A close-up portrait in a cinematic photography style, capturing a girl-next-door look on a sunny daytime urban street. She wears a khaki sweater, with long, flowing hair gently draped over her shoulders. Her head is turned slightly, revealing soft facial features illuminated by realistic, delicate sunlight coming from the left. The sunlight subtly highlights individual strands of her hair. The image has a Canon film-like color tone, evoking a warm nostalgic atmosphere."
120
 
121
  image = pipe.generate(
122
  prompt=prompt,
123
  height=1024,
124
  width=1024,
125
- num_sampling_steps=50, # may adjust to 25 steps for faster inference, but may slightly reduce quality
126
  guidance_scale=7.5,
127
  num_images=1,
128
  seed=42
@@ -131,178 +96,19 @@ image = pipe.generate(
131
  image.save("example.png")
132
  ```
133
 
134
- ## 🤗 Demo
135
-
136
- 🔥 Try the Huggingface Space demo to start playing with BitDance: [BitDance-Demo](https://huggingface.co/spaces/shallowdream204/BitDance-14B-64x)
137
-
138
- You can also run the demo locally:
139
- ```bash
140
- python app.py
141
- ```
142
-
143
  ## 📊 Model Performance
144
- <div style="overflow-x: auto; margin-bottom: 16px;">
145
- <table style="border-collapse: collapse; width: 100%;">
146
- <thead>
147
- <tr>
148
- <th style="white-space: nowrap; padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa;" rowspan="2">Model</th>
149
- <th style="white-space: nowrap; padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa;" rowspan="2">Open Source</th>
150
- <!-- DPG-Bench 移动到这里 -->
151
- <th style="white-space: nowrap; padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa;" rowspan="2">DPG-Bench</th>
152
- <!-- 新增 GenEval 列 -->
153
- <th style="white-space: nowrap; padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa;" rowspan="2">GenEval</th>
154
- <th style="padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa; text-align: center;" colspan="2">OneIG-Bench</th>
155
- <th style="padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa; text-align: center;" colspan="2">TIIF-Bench</th>
156
- </tr>
157
- <tr>
158
- <th style="white-space: nowrap; padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa; text-align: center;">EN</th>
159
- <th style="white-space: nowrap; padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa; text-align: center;">ZH</th>
160
- <th style="white-space: nowrap; padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa; text-align: center;">short</th>
161
- <th style="white-space: nowrap; padding: 8px; border: 1px solid #d0d7de; background-color: #f6f8fa; text-align: center;">long</th>
162
- </tr>
163
- </thead>
164
- <tbody>
165
- <tr>
166
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">GPT Image 1</td>
167
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✗</td>
168
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">85.15</td>
169
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.84</td>
170
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.533</td>
171
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.474</td>
172
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">89.15</td>
173
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">88.29</td>
174
- </tr>
175
- <tr>
176
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">Seedream 3.0</td>
177
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✗</td>
178
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">88.27</td>
179
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.84</td>
180
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.530</td>
181
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.528</td>
182
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">86.02</td>
183
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">84.31</td>
184
- </tr>
185
- <tr>
186
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">Qwen-Image</td>
187
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
188
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">88.32</td>
189
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.87</td>
190
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.539</td>
191
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.548</td>
192
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">86.14</td>
193
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">86.83</td>
194
- </tr>
195
- <tr>
196
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">Z-Image</td>
197
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
198
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">88.14</td>
199
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.84</td>
200
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.546</td>
201
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.535</td>
202
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">80.20</td>
203
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">83.01</td>
204
- </tr>
205
- <tr>
206
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">Z-Image-Turbo</td>
207
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
208
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">84.86</td>
209
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.82</td>
210
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.528</td>
211
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.507</td>
212
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">77.73</td>
213
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">80.05</td>
214
- </tr>
215
- <tr>
216
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">FLUX.1 [Dev]</td>
217
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
218
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">83.84</td>
219
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.66</td>
220
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.434</td>
221
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">-</td>
222
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">71.09</td>
223
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">71.78</td>
224
- </tr>
225
- <tr>
226
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">BAGEL</td>
227
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
228
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">85.07</td>
229
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.88</td>
230
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.361</td>
231
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.370</td>
232
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">71.50</td>
233
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">71.70</td>
234
- </tr>
235
- <tr>
236
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">Infinity</td>
237
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
238
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">83.46</td>
239
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.73</td>
240
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">-</td>
241
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">-</td>
242
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">62.07</td>
243
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">62.32</td>
244
- </tr>
245
- <tr>
246
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">Janus-Pro</td>
247
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
248
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">84.19</td>
249
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.80</td>
250
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.267</td>
251
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.240</td>
252
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">66.50</td>
253
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">65.01</td>
254
- </tr>
255
- <tr>
256
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">Show-o2</td>
257
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
258
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">86.14</td>
259
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.76</td>
260
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.308</td>
261
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">-</td>
262
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">59.72</td>
263
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">58.86</td>
264
- </tr>
265
- <tr>
266
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">NextStep-1</td>
267
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
268
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">85.28</td>
269
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.73</td>
270
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.418</td>
271
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">-</td>
272
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">-</td>
273
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">-</td>
274
- </tr>
275
- <tr>
276
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;">GLM-Image</td>
277
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
278
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">84.78</td>
279
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">-</td>
280
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.528</td>
281
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.511</td>
282
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">81.01</td>
283
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">81.02</td>
284
- </tr>
285
- <tr>
286
- <td style="padding: 8px; border: 1px solid #d0d7de; white-space:nowrap;font-weight:bold;">BitDance</td>
287
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">✓</td>
288
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">88.28</td>
289
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.86</td>
290
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.532</td>
291
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">0.512</td>
292
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">79.64</td>
293
- <td style="padding: 8px; border: 1px solid #d0d7de; text-align: center;">78.12</td>
294
- </tr>
295
- </tbody>
296
- </table>
297
- </div>
298
 
 
 
 
 
 
299
 
300
  ## 🪪 License
301
 
302
  BitDance is licensed under the Apache 2.0 license.
303
 
304
  ## 📖 Citation
305
- If you find our work useful for your research, please consider citing our paper:
306
  ```bibtex
307
  @article{ai2026bitdance,
308
  title = {BitDance: Scaling Autoregressive Generative Models with Binary Tokens},
 
1
  ---
2
+ base_model:
3
+ - shallowdream204/BitDance-14B-16x
4
  language:
5
  - en
6
  - zh
7
+ license: apache-2.0
 
8
  pipeline_tag: text-to-image
9
+ library_name: transformers
10
  ---
11
 
12
  # BitDance: Scaling Autoregressive Generative Models with Binary Tokens
 
18
  alt="Project Page"
19
  />
20
  </a>
21
+ <a href="https://huggingface.co/papers/2602.14041">
22
  <img
23
  src="https://img.shields.io/badge/arXiv paper-2602.14041-red?logo=arxiv&logoColor=red"
24
  alt="BitDance Paper on arXiv"
 
46
 
47
  <p align="center"><img src="https://github.com/shallowdream204/BitDance/raw/main/assets/speed.webp" width=90%"></p>
48
 
 
49
  > [Yuang Ai*](https://shallowdream204.github.io/), [Jiaming Han*](https://csuhan.com/), [Shaobin Zhuang*](https://scholar.google.com/citations?user=PGaDirMAAAAJ), [Weijia Mao](https://scholar.google.com/citations?user=S7bGBmkyNtEC), [Xuefeng Hu](https://xuefenghu.me/), [Ziyan Yang](https://ziyanyang.github.io/), [Zhenheng Yang](https://zhenheny.github.io/), [Huaibo Huang†](https://hhb072.github.io/), [Xiangyu Yue†](https://xyue.io/), [Hao Chen*†‡](https://haochen-rye.github.io/)
 
 
 
 
50
 
51
+ For visual generation, discrete autoregressive models often struggle with poor tokenizer reconstruction, difficulties in sampling from large vocabularies, and slow token-by-token generation speeds. We present **BitDance**, which addresses these challenges via a large-vocabulary binary tokenizer, a binary diffusion head for sampling in large discrete space, and a next-patch diffusion paradigm that enables efficient multitoken prediction. BitDance is an open-source discrete autoregressive foundation model with 14B parameters, trained on large-scale multimodal tokens.
52
 
53
+ <p align="center"><img src="https://github.com/shallowdream204/BitDance/raw/main/assets/teaser.webp" width="90%"></p>
54
 
55
  ## ⚡ Quick Start
56
 
57
+ ### 1. Installation
58
  ```bash
59
  git clone https://github.com/shallowdream204/BitDance.git
60
  cd BitDance
 
64
  pip install flash_attn==2.8.2 --no-build-isolation
65
  ```
66
 
67
+ ### 2. Download Model Weights
 
68
  We offer two models, BitDance-14B-64x and BitDance-14B-16x, which can predict 64 and 16 tokens in parallel at each step, respectively.
 
 
 
 
69
 
70
+ | Model | #Token per Step | Step (1024px) | Supported Size |
71
+ |:-------:|:----:|:----:|:-----------:|
72
+ | BitDance-14B-64x| 64 | 64 |1024px |
73
+ | BitDance-14B-16x| 16 | 256 |512&1024px |
74
 
75
+ ### 3. Text-to-Image Inference
76
  ```python
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
77
  from modeling.t2i_pipeline import BitDanceT2IPipeline
78
 
79
+ model_path = 'shallowdream204/BitDance-14B-16x'
 
80
  device = 'cuda'
81
 
82
  pipe = BitDanceT2IPipeline(model_path=model_path, device=device)
83
 
84
+ prompt = "A close-up portrait in a cinematic photography style, capturing a girl-next-door look on a sunny daytime urban street. She wears a khaki sweater, with long, flowing hair gently draped over her shoulders. Her head is turned slightly, revealing soft facial features illuminated by realistic, delicate sunlight coming from the left."
85
 
86
  image = pipe.generate(
87
  prompt=prompt,
88
  height=1024,
89
  width=1024,
90
+ num_sampling_steps=50,
91
  guidance_scale=7.5,
92
  num_images=1,
93
  seed=42
 
96
  image.save("example.png")
97
  ```
98
 
 
 
 
 
 
 
 
 
 
99
  ## 📊 Model Performance
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
100
 
101
+ | Model | Open Source | DPG-Bench | GenEval | OneIG-Bench (EN) | TIIF-Bench (short) |
102
+ |:---:|:---:|:---:|:---:|:---:|:---:|
103
+ | Qwen-Image | ✓ | 88.32 | 0.87 | 0.539 | 86.14 |
104
+ | FLUX.1 [Dev] | ✓ | 83.84 | 0.66 | 0.434 | 71.09 |
105
+ | **BitDance** | **✓** | **88.28** | **0.86** | **0.532** | **79.64** |
106
 
107
  ## 🪪 License
108
 
109
  BitDance is licensed under the Apache 2.0 license.
110
 
111
  ## 📖 Citation
 
112
  ```bibtex
113
  @article{ai2026bitdance,
114
  title = {BitDance: Scaling Autoregressive Generative Models with Binary Tokens},