casiatao
/

LRM

Model card Files Files and versions

LRM / README.md

casiatao's picture

Update README.md

dbda79c verified 26 days ago

|

history blame contribute delete

810 Bytes

	---
	license: mit
	library_name: diffusers
	pipeline_tag: text-to-image
	datasets:
	- pickapic-anonymous/pickapic_v1
	---

	This repository contains the [Latent Reward Models (LRM)](https://github.com/Kwai-Kolors/LPO) based on SD1.5 and SDXL. The corresponding github repository is [https://github.com/Kwai-Kolors/LPO](https://github.com/Kwai-Kolors/LPO).

	## ❤️ Citation
	If you find this repository helpful, please consider giving it a like ❤️ and citing:
	```bibtex
	@article{zhang2025diffusion,
	title={Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization},
	author={Zhang, Tao and Da, Cheng and Ding, Kun and Jin, Kun and Li, Yan and Gao, Tingting and Zhang, Di and Xiang, Shiming and Pan, Chunhong},
	journal={arXiv preprint arXiv:2502.01051},
	year={2025}
	}
	```