Update README.md
Browse files
README.md
CHANGED
|
@@ -8,12 +8,13 @@ datasets:
|
|
| 8 |
|
| 9 |
This repository contains the [Latent Reward Models (LRM)](https://github.com/Kwai-Kolors/LPO) based on SD1.5 and SDXL. The corresponding github repository is [https://github.com/Kwai-Kolors/LPO](https://github.com/Kwai-Kolors/LPO).
|
| 10 |
|
| 11 |
-
❤️ Citation
|
| 12 |
If you find this repository helpful, please consider giving it a like ❤️ and citing:
|
| 13 |
-
|
| 14 |
@article{zhang2025diffusion,
|
| 15 |
title={Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization},
|
| 16 |
author={Zhang, Tao and Da, Cheng and Ding, Kun and Jin, Kun and Li, Yan and Gao, Tingting and Zhang, Di and Xiang, Shiming and Pan, Chunhong},
|
| 17 |
journal={arXiv preprint arXiv:2502.01051},
|
| 18 |
year={2025}
|
| 19 |
-
}
|
|
|
|
|
|
| 8 |
|
| 9 |
This repository contains the [Latent Reward Models (LRM)](https://github.com/Kwai-Kolors/LPO) based on SD1.5 and SDXL. The corresponding github repository is [https://github.com/Kwai-Kolors/LPO](https://github.com/Kwai-Kolors/LPO).
|
| 10 |
|
| 11 |
+
## ❤️ Citation
|
| 12 |
If you find this repository helpful, please consider giving it a like ❤️ and citing:
|
| 13 |
+
```bibtex
|
| 14 |
@article{zhang2025diffusion,
|
| 15 |
title={Diffusion Model as a Noise-Aware Latent Reward Model for Step-Level Preference Optimization},
|
| 16 |
author={Zhang, Tao and Da, Cheng and Ding, Kun and Jin, Kun and Li, Yan and Gao, Tingting and Zhang, Di and Xiang, Shiming and Pan, Chunhong},
|
| 17 |
journal={arXiv preprint arXiv:2502.01051},
|
| 18 |
year={2025}
|
| 19 |
+
}
|
| 20 |
+
```
|