shallowdream204 commited on
Commit
a970064
·
verified ·
1 Parent(s): 9164cab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +59 -3
README.md CHANGED
@@ -1,3 +1,59 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+
5
+ # BitDance: Scaling Autoregressive Generative Models with Binary Tokens
6
+
7
+ <p align="center">
8
+ <a href="TBD">
9
+ <img
10
+ src="https://img.shields.io/badge/Project-Page-0A66C2?logo=chromewebstore&logoColor=0A66C2"
11
+ alt="Project Page"
12
+ />
13
+ </a>
14
+ <a href="TBD">
15
+ <img
16
+ src="https://img.shields.io/badge/arXiv paper-TBD-red?logo=arxiv&logoColor=red"
17
+ alt="BitDance Paper on arXiv"
18
+ />
19
+ </a>
20
+ <a href="https://huggingface.co/collections/shallowdream204/bitdance">
21
+ <img
22
+ src="https://img.shields.io/badge/Weights-BitDance-yellow?logo=huggingface&logoColor=yellow"
23
+ alt="BitDance Model"
24
+ />
25
+ </a>
26
+ <a href="https://huggingface.co/spaces/shallowdream204/BitDance-14B-64x">
27
+ <img
28
+ src="https://img.shields.io/badge/HF Space-Demo-orange?logo=huggingface&logoColor=yellow"
29
+ alt="BitDance Demo"
30
+ />
31
+ </a>
32
+ </p>
33
+
34
+ <p align="center"><img src="https://github.com/shallowdream204/BitDance/raw/main/assets/speed.webp" width=90%"></p>
35
+
36
+
37
+ > [Yuang Ai*](https://shallowdream204.github.io/), [Jiaming Han*](https://csuhan.com/), [Shaobin Zhuang*](https://scholar.google.com/citations?user=PGaDirMAAAAJ), [Weijia Mao](https://scholar.google.com/citations?user=S7bGBmkyNtEC), [Xuefeng Hu](https://xuefenghu.me/), [Ziyan Yang](https://ziyanyang.github.io/), [Zhenheng Yang](https://zhenheny.github.io/), [Huaibo Huang†](https://hhb072.github.io/), [Xiangyu Yue†](https://xyue.io/), [Hao Chen*†‡](https://haochen-rye.github.io/)
38
+ >
39
+ > <sup>*</sup> Equal Contribution&nbsp;&nbsp;<sup>†</sup> Corresponding Author&nbsp;&nbsp;<sup>‡</sup> Project Lead
40
+ >
41
+ > For visual generation, discrete autoregressive models often struggle with poor tokenizer reconstruction, difficulties in sampling from large vocabularies, and slow token-by-token generation speeds. We present **BitDance**, which addresses these challenges via a large-vocabulary binary tokenizer, a binary diffusion head for sampling in large discrete space, and a next-patch diffusion paradigm that enables efficient multitoken prediction. BitDance is an open-source discrete autoregressive foundation model with 14B parameters, trained on large-scale multimodal tokens. While maintaining the standard language modeling paradigm for text tokens, BitDance employs a next-patch diffusion paradigm for visual tokens to predict multiple tokens in parallel—up to 64 per step. This unified multimodal framework is simple, scalable, and capable of efficiently generating high-resolution, photorealistic images.
42
+
43
+ This repository hosts the BitDance model weights for ImageNet Generation. For detailed instructions, please visit our [GitHub Repository](https://github.com/shallowdream204/BitDance).
44
+
45
+
46
+ ## 🪪 License
47
+
48
+ BitDance is licensed under the Apache 2.0 license.
49
+
50
+ ## 📖 Citation
51
+ If you find our work useful for your research, please consider citing our paper:
52
+ ```bibtex
53
+ @article{ai2026bitdance,
54
+ title = {BitDance: Scaling Autoregressive Generative Models with Binary Tokens},
55
+ author = {Ai, Yuang and Han, Jiaming and Zhuang, Shaobin and Hu, Xuefeng and Yang, Ziyan and Yang, Zhenheng and Huang, Huaibo and Yue, Xiangyu and Chen, Hao},
56
+ journal = {TBD},
57
+ year = {2026}
58
+ }
59
+ ```