Text Classification
Transformers
HongruCai commited on
Commit
765a9f5
·
verified ·
1 Parent(s): 3bb8e46

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -7
README.md CHANGED
@@ -23,8 +23,8 @@ This repository provides trained checkpoints for reward modeling and user-level
23
 
24
  ## Links
25
 
26
- - 📄 **arXiv Paper**: https://arxiv.org/abs/XXXX.XXXXX
27
- - 🤗 **Hugging Face Paper**: https://huggingface.co/papers/XXXX.XXXXX
28
  - 💻 **GitHub Code**: https://github.com/ModalityDance/MRM
29
  - 📦 **Hugging Face Collection**: https://huggingface.co/collections/ModalityDance/mrm
30
 
@@ -157,11 +157,14 @@ print("reward(rejected)=", s_rj.tolist())
157
  If you use this model or code in your research, please cite:
158
 
159
  ```bibtex
160
- @article{mrm2025,
161
- title = {Meta Reward Modeling for Personalized Alignment},
162
- author = {Author Names},
163
- journal = {arXiv preprint arXiv:XXXX.XXXXX},
164
- year = {2025}
 
 
 
165
  }
166
  ```
167
 
 
23
 
24
  ## Links
25
 
26
+ - 📄 **arXiv Paper**: https://arxiv.org/abs/2601.18731
27
+ - 🤗 **Hugging Face Paper**: https://huggingface.co/papers/2601.18731
28
  - 💻 **GitHub Code**: https://github.com/ModalityDance/MRM
29
  - 📦 **Hugging Face Collection**: https://huggingface.co/collections/ModalityDance/mrm
30
 
 
157
  If you use this model or code in your research, please cite:
158
 
159
  ```bibtex
160
+ @misc{cai2026adaptsanymetareward,
161
+ title={One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment},
162
+ author={Hongru Cai and Yongqi Li and Tiezheng Yu and Fengbin Zhu and Wenjie Wang and Fuli Feng and Wenjie Li},
163
+ year={2026},
164
+ eprint={2601.18731},
165
+ archivePrefix={arXiv},
166
+ primaryClass={cs.CL},
167
+ url={https://arxiv.org/abs/2601.18731},
168
  }
169
  ```
170