Text Classification
HongruCai commited on
Commit
077cbcb
·
verified ·
1 Parent(s): f15549f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -7
README.md CHANGED
@@ -22,8 +22,8 @@ This repository provides trained checkpoints for reward modeling and user-level
22
 
23
  ## Links
24
 
25
- - 📄 **arXiv Paper**: https://arxiv.org/abs/XXXX.XXXXX
26
- - 🤗 **Hugging Face Paper**: https://huggingface.co/papers/XXXX.XXXXX
27
  - 💻 **GitHub Code**: https://github.com/ModalityDance/MRM
28
  - 📦 **Hugging Face Collection**: https://huggingface.co/collections/ModalityDance/mrm
29
 
@@ -155,11 +155,14 @@ print("reward(rejected)=", s_rj.tolist())
155
  If you use this model or code in your research, please cite:
156
 
157
  ```bibtex
158
- @article{mrm2025,
159
- title = {Meta Reward Modeling for Personalized Alignment},
160
- author = {Author Names},
161
- journal = {arXiv preprint arXiv:XXXX.XXXXX},
162
- year = {2025}
 
 
 
163
  }
164
  ```
165
 
 
22
 
23
  ## Links
24
 
25
+ - 📄 **arXiv Paper**: https://arxiv.org/abs/2601.18731
26
+ - 🤗 **Hugging Face Paper**: https://huggingface.co/papers/2601.18731
27
  - 💻 **GitHub Code**: https://github.com/ModalityDance/MRM
28
  - 📦 **Hugging Face Collection**: https://huggingface.co/collections/ModalityDance/mrm
29
 
 
155
  If you use this model or code in your research, please cite:
156
 
157
  ```bibtex
158
+ @misc{cai2026adaptsanymetareward,
159
+ title={One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment},
160
+ author={Hongru Cai and Yongqi Li and Tiezheng Yu and Fengbin Zhu and Wenjie Wang and Fuli Feng and Wenjie Li},
161
+ year={2026},
162
+ eprint={2601.18731},
163
+ archivePrefix={arXiv},
164
+ primaryClass={cs.CL},
165
+ url={https://arxiv.org/abs/2601.18731},
166
  }
167
  ```
168