JTTJ
/

GradSPO

+# GradSPO: A Gradient Guidance Perspective on Stepwise Preference Optimization for Diffusion Models
+This repository provides **public LoRA checkpoints trained with GradSPO** for **Stable Diffusion v1.5** and **SDXL**.
+**GradSPO** reframes **stepwise preference optimization (SPO)** as learning from **noisy reward signals**, explicitly reducing this noise through **gradient guidance**. This results in **stronger reward signals** and achieves **improved preference alignment**.
+All released checkpoints are **LoRA weights only** and must be loaded on top of their corresponding base models.
+The official training code is available at:
+https://github.com/JoshuaTTJ/GradSPO
+---
+## Usage
+### SDXL (LoRA)
+```python
+from diffusers import StableDiffusionXLPipeline
+import torch
+pipe = StableDiffusionXLPipeline.from_pretrained(
+    "stabilityai/stable-diffusion-xl-base-1.0",
+    torch_dtype=torch.float16,
+)
+pipe.load_lora_weights("./sd1_5")
+pipe = pipe.to("cuda")
+prompt = "A cat holding a sign that says hello world"
+generator = torch.Generator(device="cuda").manual_seed(42)
+image = pipe(
+    prompt=prompt,
+    guidance_scale=5.0,
+    num_inference_steps=20,
+    generator=generator,
+    output_type="pil",
+).images[0]
+image.save("img_sdxl.png")
+```
+---
+### Stable Diffusion v1.5 (LoRA)
+```python
+from diffusers import StableDiffusionPipeline
+import torch
+pipe = StableDiffusionPipeline.from_pretrained(
+    "sd-legacy/stable-diffusion-v1-5",
+    torch_dtype=torch.float16,
+)
+pipe.load_lora_weights("./sdxl")
+pipe = pipe.to("cuda")
+prompt = "a photo of a cat"
+generator = torch.Generator(device="cuda").manual_seed(42)
+image = pipe(
+    prompt=prompt,
+    guidance_scale=5.0,
+    num_inference_steps=20,
+    generator=generator,
+    output_type="pil",
+).images[0]
+image.save("img_sd15.png")
+```
+---
+## Citation
+If you find GradSPO useful in your research, please consider citing our work:
+```bibtex
+@inproceedings{
+tee2025a,
+title={A Gradient Guidance Perspective on Stepwise Preference Optimization for Diffusion Models},
+author={Joshua Tian Jin Tee and Hee Suk Yoon and Abu Hanif Muhammad Syarubany and Eunseop Yoon and Chang D. Yoo},
+booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
+year={2025},
+url={https://openreview.net/forum?id=d6lIOnvOX2}
+}
+```