DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Paper • 2412.07589 • Published • 48
import torch
from diffusers import DiffusionPipeline
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("jianzongwu/DiffSensei", dtype=torch.bfloat16, device_map="cuda")
prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k"
image = pipe(prompt).images[0]Model checkpoint of paper DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Please see GitHub repo to get the usage
Project page: https://jianzongwu.github.io/projects/diffsensei