Unconditional Image Generation
Diffusers
Safetensors
English
lightningdit
image-generation
class-conditional
imagenet
flow-matching
Instructions to use BiliSakura/LightningDiT-diffusers with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use BiliSakura/LightningDiT-diffusers with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("BiliSakura/LightningDiT-diffusers", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Notebooks
- Google Colab
- Kaggle
metadata
library_name: diffusers
pipeline_tag: unconditional-image-generation
tags:
- diffusers
- lightningdit
- image-generation
- class-conditional
- imagenet
- flow-matching
license: mit
inference: true
widget:
- output:
url: LightningDit-XL-1-256/demo.png
language:
- en
LightningDiT-diffusers
Diffusers-ready checkpoints for LightningDiT (VA-VAE–aligned latent diffusion with flow matching), converted from hustvl/lightningdit-xl-imagenet256-800ep for local/offline use.
This root folder is a model collection that contains:
LightningDit-XL-1-256
Each subfolder is a self-contained Diffusers model repo with:
pipeline.py(LightningDiTPipeline)transformer/transformer_lightningdit.pyand weightsscheduler/scheduler_config.json(FlowMatchHeunDiscreteScheduler,shift=0.3)vae/(REPA-E/vavae-hf)
Each variant embeds English id2label in model_index.json, so class labels can be passed as ImageNet ids or English synonym strings.
Demo
Class-conditional sample (ImageNet class 207, golden retriever), LightningDiT-XL/1 at 256×256, 250 steps, CFG 6.7, cfg_interval_start=0.125, timestep_shift=0.3, seed 0.
Model Paths
| Model | Resolution | Local path |
|---|---|---|
| LightningDiT-XL/1 | 256×256 | ./LightningDit-XL-1-256 |
Inference
import torch
from diffusers import DiffusionPipeline
pipe = DiffusionPipeline.from_pretrained(
"./LightningDit-XL-1-256",
trust_remote_code=True,
torch_dtype=torch.bfloat16,
).to("cuda")
class_id = pipe.get_label_ids("golden retriever")[0]
image = pipe(
class_labels=class_id,
num_inference_steps=250,
guidance_scale=6.7,
cfg_interval_start=0.125,
timestep_shift=0.3,
generator=torch.Generator(device="cuda").manual_seed(0),
).images[0]
Citation
@inproceedings{yao2025reconstruction,
title={Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models},
author={Yao, Jingfeng and Yang, Bin and Wang, Xinggang},
booktitle={CVPR},
year={2025}
}
