Training Diffusion Models with Reinforcement Learning
Paper
• 2305.13301 • Published
• 5
This repository contains checkpoints for Stable Diffusion v1.5 fine-tuned using Denoising Diffusion Policy Optimization (DDPO).
We provide checkpoints every 10 epochs plus the final epoch (98). BRISQUE is a no-reference image quality metric where lower values typically indicate higher spatial quality/less distortion.