|
|
--- |
|
|
license: mit |
|
|
library_name: diffusers |
|
|
pipeline_tag: text-to-image |
|
|
datasets: |
|
|
- pickapic-anonymous/pickapic_v1 |
|
|
--- |
|
|
|
|
|
This repository contains the [Latent Reward Models (LRM)](https://github.com/Kwai-Kolors/LPO) based on SD1.5 and SDXL. The corresponding github repository is [https://github.com/Kwai-Kolors/LPO](https://github.com/Kwai-Kolors/LPO). |
|
|
|
|
|
## ❤️ Citation |
|
|
If you find this repository helpful, please consider giving it a like ❤️ and citing: |
|
|
```bibtex |
|
|
@article{zhang2025diffusion, |
|
|
title={Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization}, |
|
|
author={Zhang, Tao and Da, Cheng and Ding, Kun and Jin, Kun and Li, Yan and Gao, Tingting and Zhang, Di and Xiang, Shiming and Pan, Chunhong}, |
|
|
journal={arXiv preprint arXiv:2502.01051}, |
|
|
year={2025} |
|
|
} |
|
|
``` |