| license: mit | |
| datasets: | |
| - liuhaotian/LLaVA-Instruct-150K | |
| language: | |
| - en | |
| base_model: | |
| - lmsys/vicuna-7b-v1.5 | |
| # Transfusion | |
| Paper: [Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model](https://arxiv.org/abs/2408.11039) | |
| Code: [lavinal712/Transfusion](https://github.com/lavinal712/Transfusion) |