Text Classification
Transformers
HongruCai commited on
Commit
c900e8e
·
verified ·
1 Parent(s): 77fa4b8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -7
README.md CHANGED
@@ -25,8 +25,8 @@ This repository provides trained checkpoints for reward modeling and user-level
25
 
26
  ## Links
27
 
28
- - 📄 **arXiv Paper**: https://arxiv.org/abs/XXXX.XXXXX
29
- - 🤗 **Hugging Face Paper**: https://huggingface.co/papers/XXXX.XXXXX
30
  - 💻 **GitHub Code**: https://github.com/ModalityDance/MRM
31
  - 📦 **Hugging Face Collection**: https://huggingface.co/collections/ModalityDance/mrm
32
 
@@ -159,11 +159,14 @@ print("reward(rejected)=", s_rj.tolist())
159
  If you use this model or code in your research, please cite:
160
 
161
  ```bibtex
162
- @article{mrm2025,
163
- title = {Meta Reward Modeling for Personalized Alignment},
164
- author = {Author Names},
165
- journal = {arXiv preprint arXiv:XXXX.XXXXX},
166
- year = {2025}
 
 
 
167
  }
168
  ```
169
 
 
25
 
26
  ## Links
27
 
28
+ - 📄 **arXiv Paper**: https://www.arxiv.org/abs/2601.18731
29
+ - 🤗 **Hugging Face Paper**: https://huggingface.co/papers/2601.18731
30
  - 💻 **GitHub Code**: https://github.com/ModalityDance/MRM
31
  - 📦 **Hugging Face Collection**: https://huggingface.co/collections/ModalityDance/mrm
32
 
 
159
  If you use this model or code in your research, please cite:
160
 
161
  ```bibtex
162
+ @misc{cai2026adaptsanymetareward,
163
+ title={One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment},
164
+ author={Hongru Cai and Yongqi Li and Tiezheng Yu and Fengbin Zhu and Wenjie Wang and Fuli Feng and Wenjie Li},
165
+ year={2026},
166
+ eprint={2601.18731},
167
+ archivePrefix={arXiv},
168
+ primaryClass={cs.CL},
169
+ url={https://arxiv.org/abs/2601.18731},
170
  }
171
  ```
172