Update README.md
Browse files
README.md
CHANGED
|
@@ -22,8 +22,8 @@ This repository provides trained checkpoints for reward modeling and user-level
|
|
| 22 |
|
| 23 |
## Links
|
| 24 |
|
| 25 |
-
- 📄 **arXiv Paper**: https://arxiv.org/abs/
|
| 26 |
-
- 🤗 **Hugging Face Paper**: https://huggingface.co/papers/
|
| 27 |
- 💻 **GitHub Code**: https://github.com/ModalityDance/MRM
|
| 28 |
- 📦 **Hugging Face Collection**: https://huggingface.co/collections/ModalityDance/mrm
|
| 29 |
|
|
@@ -155,11 +155,14 @@ print("reward(rejected)=", s_rj.tolist())
|
|
| 155 |
If you use this model or code in your research, please cite:
|
| 156 |
|
| 157 |
```bibtex
|
| 158 |
-
@
|
| 159 |
-
|
| 160 |
-
|
| 161 |
-
|
| 162 |
-
|
|
|
|
|
|
|
|
|
|
| 163 |
}
|
| 164 |
```
|
| 165 |
|
|
|
|
| 22 |
|
| 23 |
## Links
|
| 24 |
|
| 25 |
+
- 📄 **arXiv Paper**: https://arxiv.org/abs/2601.18731
|
| 26 |
+
- 🤗 **Hugging Face Paper**: https://huggingface.co/papers/2601.18731
|
| 27 |
- 💻 **GitHub Code**: https://github.com/ModalityDance/MRM
|
| 28 |
- 📦 **Hugging Face Collection**: https://huggingface.co/collections/ModalityDance/mrm
|
| 29 |
|
|
|
|
| 155 |
If you use this model or code in your research, please cite:
|
| 156 |
|
| 157 |
```bibtex
|
| 158 |
+
@misc{cai2026adaptsanymetareward,
|
| 159 |
+
title={One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment},
|
| 160 |
+
author={Hongru Cai and Yongqi Li and Tiezheng Yu and Fengbin Zhu and Wenjie Wang and Fuli Feng and Wenjie Li},
|
| 161 |
+
year={2026},
|
| 162 |
+
eprint={2601.18731},
|
| 163 |
+
archivePrefix={arXiv},
|
| 164 |
+
primaryClass={cs.CL},
|
| 165 |
+
url={https://arxiv.org/abs/2601.18731},
|
| 166 |
}
|
| 167 |
```
|
| 168 |
|