BiliSakura
/

DiffusionSat

DiffusionSatPipeline

Model card Files Files and versions

DiffusionSat / README.md

BiliSakura's picture

Upload folder using huggingface_hub

146a3df verified 2 days ago

|

history blame contribute delete

3.09 kB

	---
	language: en
	library_name: diffusers
	pipeline_tag: text-to-image
	tags:
	- satellite
	- controlnet
	- diffusers
	- text-to-image
	---

	# DiffusionSat Custom Pipelines

	Custom community pipelines for loading DiffusionSat checkpoints directly with `diffusers.DiffusionPipeline.from_pretrained()`.

	> See [Diffusers Community Pipeline Documentation](https://huggingface.co/docs/diffusers/using-diffusers/custom_pipeline_overview)

	## Model Index

	`model_index.json` is set to the default text-to-image pipeline (`DiffusionSatPipeline`) so `DiffusionPipeline.from_pretrained()` works out of the box. The ControlNet variant is loaded via `custom_pipeline` plus the `controlnet` subfolder, as shown below.

	## Available Pipelines

	This directory contains two custom pipelines:

	1. `pipeline_diffusionsat.py`: Standard text-to-image pipeline with DiffusionSat metadata support.
	2. `pipeline_diffusionsat_controlnet.py`: ControlNet pipeline with DiffusionSat metadata and conditional metadata support.

	## Setup

	The checkpoint folder (`ckpt/diffusionsat/`) should contain the standard diffusers components (unet, vae, scheduler, etc.). You can reference these pipeline files directly from this directory or copy them to your checkpoint folder.

	## Usage

	### 1. Text-to-Image Pipeline

	Use `pipeline_diffusionsat.py` for standard generation.

	```python
	import torch
	from diffusers import DiffusionPipeline

	# Load pipeline
	pipe = DiffusionPipeline.from_pretrained(
	"path/to/ckpt/diffusionsat",
	custom_pipeline="./custom_pipelines/pipeline_diffusionsat.py", # Path to this file
	torch_dtype=torch.float16,
	trust_remote_code=True,
	)
	pipe = pipe.to("cuda")

	# Optional: Metadata (normalized lat, lon, timestamp, GSD, etc.)
	# metadata = [0.5, -0.3, 0.7, 0.2, 0.1, 0.0, 0.5]

	# Generate
	image = pipe(
	"satellite image of farmland",
	metadata=None, # Optional
	num_inference_steps=30,
	).images[0]
	```

	### 2. ControlNet Pipeline

	Use `pipeline_diffusionsat_controlnet.py` for ControlNet generation.

	```python
	import torch
	from diffusers import DiffusionPipeline, ControlNetModel
	from diffusers.utils import load_image

	# 1. Load ControlNet
	controlnet = ControlNetModel.from_pretrained(
	"path/to/ckpt/diffusionsat/controlnet",
	torch_dtype=torch.float16
	)

	# 2. Load Pipeline with ControlNet
	pipe = DiffusionPipeline.from_pretrained(
	"path/to/ckpt/diffusionsat",
	controlnet=controlnet,
	custom_pipeline="./custom_pipelines/pipeline_diffusionsat_controlnet.py", # Path to this file
	torch_dtype=torch.float16,
	trust_remote_code=True,
	)
	pipe = pipe.to("cuda")

	# 3. Prepare Control Image
	control_image = load_image("path/to/conditioning_image.png")

	# 4. Generate
	# metadata: Target image metadata (optional)
	# cond_metadata: Conditioning image metadata (optional)

	image = pipe(
	"satellite image of farmland",
	image=control_image,
	metadata=None,
	cond_metadata=None,
	num_inference_steps=30,
	).images[0]
	```