| license: mit | |
| library_name: diffusers | |
| pipeline_tag: text-to-image | |
| Model checkpoint of paper [DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation](https://arxiv.org/abs/2412.07589) | |
| Please see [GitHub repo](https://github.com/jianzongwu/DiffSensei) to get the usage | |
| Project page: https://jianzongwu.github.io/projects/diffsensei | |
| This repo has 8bit quantized versions of Clip image generator and MLLM |