Remote Sensing Visual Generative Models
Collection
diffusers implementation • 24 items • Updated
• 1
we do not have a full checkpoint conversion validation, if you encounter pipeline loading failure and unsidered output, please contact me via bili_sakura@zju.edu.cn
Diffusers-style wrapped CDM (DiT) checkpoint for BRCA, converted from ZoomLDM cdm_dit training outputs.
(B, 512, 65))pipeline.py)Use this model to sample BRCA conditioning embeddings that can be consumed by downstream ZoomLDM workflows.
pipeline.py: custom DiffusionPipeline implementation (CDMDiTPipeline)model_index.json: diffusers metadatacdm/: active model weights/config used by pipelinescheduler/: DDIM scheduler configmodel_raw.safetensors: non-EMA training weights (optional)optimizer.pt: optimizer state (optional)config.json: conversion metadataimport torch
from diffusers import DiffusionPipeline
pipe = DiffusionPipeline.from_pretrained(
"BiliSakura/ZoomLDM-CDM-brca",
custom_pipeline="pipeline.py",
trust_remote_code=True,
).to("cuda")
out = pipe(
batch_size=2,
magnification=torch.tensor([0, 0], device="cuda"), # class labels 0..7
num_inference_steps=50,
guidance_scale=1.0,
)
samples = out.samples # (B, 512, 65)
@InProceedings{Yellapragada_2025_CVPR,
author = {Yellapragada, Srikar and Graikos, Alexandros and Triaridis, Kostas and Prasanna, Prateek and Gupta, Rajarsi and Saltz, Joel and Samaras, Dimitris},
title = {ZoomLDM: Latent Diffusion Model for Multi-scale Image Generation},
booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR)},
month = {June},
year = {2025},
pages = {23453-23463}
}
@inproceedings{Peebles2023DiT,
title={Scalable Diffusion Models with Transformers},
author={Peebles, William and Xie, Saining},
booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
year={2023}
}