LRM / README.md

casiatao

Update README.md

dbda79c verified 19 days ago

preview code

raw

history blame contribute delete

810 Bytes

metadata

license: mit
library_name: diffusers
pipeline_tag: text-to-image
datasets:
  - pickapic-anonymous/pickapic_v1

This repository contains the Latent Reward Models (LRM) based on SD1.5 and SDXL. The corresponding github repository is https://github.com/Kwai-Kolors/LPO.

❤️ Citation

If you find this repository helpful, please consider giving it a like ❤️ and citing:

@article{zhang2025diffusion,
  title={Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization},
  author={Zhang, Tao and Da, Cheng and Ding, Kun and Jin, Kun and Li, Yan and Gao, Tingting and Zhang, Di and Xiang, Shiming and Pan, Chunhong},
  journal={arXiv preprint arXiv:2502.01051},
  year={2025}
}