Visual Generation Models
Collection
6 items β’ Updated β’ 1
How to use BiliSakura/PixelFlow-diffusers with Diffusers:
pip install -U diffusers transformers accelerate
import torch
from diffusers import DiffusionPipeline
# switch to "mps" for apple devices
pipe = DiffusionPipeline.from_pretrained("BiliSakura/PixelFlow-diffusers", dtype=torch.bfloat16, device_map="cuda")
prompt = "golden retriever"
image = pipe(prompt).images[0]Self-contained PixelFlow checkpoints for Hugging Face diffusers. Each variant folder ships its own pipeline.py, component modules, and weights.
| Subfolder | Pipeline | Task | Resolution | Params |
|---|---|---|---|---|
PixelFlow-256/ |
PixelFlowPipeline |
class-to-image | 256Γ256 | 677M |
PixelFlow-T2I/ |
PixelFlowT2IPipeline |
text-to-image | 1024Γ1024 | 882M |
BiliSakura/PixelFlow-diffusers/
βββ README.md
βββ PixelFlow-256/
β βββ pipeline.py
β βββ model_index.json
β βββ scheduler/scheduler_config.json
β βββ transformer/
βββ PixelFlow-T2I/
βββ pipeline.py
βββ model_index.json
βββ scheduler/scheduler_config.json
βββ text_encoder/
βββ tokenizer/
βββ transformer/
Each variant is self-contained. The scheduler/ folder contains scheduler_config.json and scheduling_pixelflow.py with PixelFlowScheduler.
No shared helper modules at inference time; only PyPI diffusers plus the local variant directory.
For class-conditional PixelFlow-256/, id2label is embedded in PixelFlow-256/model_index.json (DiT-style).
pipe.id2label β inspect id β English label correspondencepipe.labels β reverse map (English synonym β id)pipe.get_label_ids("golden retriever")pipe(class_labels="golden retriever", ...) β string labels resolved automaticallyClass 207 β golden retriever, 256Γ256, 40 steps ([10, 10, 10, 10]).
Class-to-image:
python demo_inference_c2i.py
Text-to-image:
python demo_inference_t2i.py
PixelFlow-256)
from pathlib import Path
import torch
from diffusers import DiffusionPipeline
model_dir = Path("./PixelFlow-256").resolve()
pipe = DiffusionPipeline.from_pretrained(
str(model_dir),
local_files_only=True,
custom_pipeline=str(model_dir / "pipeline.py"),
trust_remote_code=True,
torch_dtype=torch.bfloat16,
)
pipe.to("cuda")
print(pipe.id2label[207])
print(pipe.get_label_ids("golden retriever"))
generator = torch.Generator(device="cuda").manual_seed(42)
image = pipe(
class_labels="golden retriever",
height=256,
width=256,
num_inference_steps=[10, 10, 10, 10],
guidance_scale=4.0,
generator=generator,
).images[0]
image.save("demo.png")
PixelFlow-T2I)
Uses google/flan-t5-xl when text_encoder/ is not bundled.
from pathlib import Path
import torch
from diffusers import DiffusionPipeline
model_dir = Path("./PixelFlow-T2I").resolve()
pipe = DiffusionPipeline.from_pretrained(
str(model_dir),
local_files_only=True,
custom_pipeline=str(model_dir / "pipeline.py"),
trust_remote_code=True,
torch_dtype=torch.bfloat16,
)
pipe.to("cuda")
generator = torch.Generator(device="cuda").manual_seed(42)
image = pipe(
prompt="A golden retriever playing in a sunny garden",
height=1024,
width=1024,
num_inference_steps=[10, 10, 10, 10],
guidance_scale=4.0,
generator=generator,
).images[0]
image.save("demo.png")