casiatao
/

LRM

casiatao commited on Dec 15, 2025

Commit

dbda79c

verified ·

1 Parent(s): b9c4a25

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,12 +8,13 @@ datasets:
 This repository contains the [Latent Reward Models (LRM)](https://github.com/Kwai-Kolors/LPO) based on SD1.5 and SDXL. The corresponding github repository is [https://github.com/Kwai-Kolors/LPO](https://github.com/Kwai-Kolors/LPO).
-❤️ Citation
 If you find this repository helpful, please consider giving it a like ❤️ and citing:
 @article{zhang2025diffusion,
   title={Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization},
   author={Zhang, Tao and Da, Cheng and Ding, Kun and Jin, Kun and Li, Yan and Gao, Tingting and Zhang, Di and Xiang, Shiming and Pan, Chunhong},
   journal={arXiv preprint arXiv:2502.01051},
   year={2025}
-}

 This repository contains the [Latent Reward Models (LRM)](https://github.com/Kwai-Kolors/LPO) based on SD1.5 and SDXL. The corresponding github repository is [https://github.com/Kwai-Kolors/LPO](https://github.com/Kwai-Kolors/LPO).
+## ❤️ Citation
 If you find this repository helpful, please consider giving it a like ❤️ and citing:
+```bibtex
 @article{zhang2025diffusion,
   title={Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization},
   author={Zhang, Tao and Da, Cheng and Ding, Kun and Jin, Kun and Li, Yan and Gao, Tingting and Zhang, Di and Xiang, Shiming and Pan, Chunhong},
   journal={arXiv preprint arXiv:2502.01051},
   year={2025}
+}
+```