openbmb
/

RLHF-V

@@ -48,10 +48,10 @@ More resistant to over-generalization, even compared to GPT-4V:
 If you find RLHF-V is useful in your work, please cite it with:
 ```
-@article{2023rlhf-v,
-  author      = {Tianyu Yu and Yuan Yao and Haoye Zhang and Taiwen He and Yifeng Han and Ganqu Cui and Jinyi Hu and Zhiyuan Liu and Hai-Tao Zheng and Maosong Sun and Tat-Seng Chua},
-  title       = {RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback},
-  journal      = {arxiv},
-  year         = {2023},
 }
 ```

 If you find RLHF-V is useful in your work, please cite it with:
 ```
+@article{yu2023rlhf,
+  title={Rlhf-v: Towards trustworthy mllms via behavior alignment from fine-grained correctional human feedback},
+  author={Yu, Tianyu and Yao, Yuan and Zhang, Haoye and He, Taiwen and Han, Yifeng and Cui, Ganqu and Hu, Jinyi and Liu, Zhiyuan and Zheng, Hai-Tao and Sun, Maosong and others},
+  journal={arXiv preprint arXiv:2312.00849},
+  year={2023}
 }
 ```