--- license: mit datasets: - liuhaotian/LLaVA-Instruct-150K language: - en base_model: - lmsys/vicuna-7b-v1.5 --- # Transfusion Paper: [Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model](https://arxiv.org/abs/2408.11039) Code: [lavinal712/Transfusion](https://github.com/lavinal712/Transfusion)