Text Generation
Safetensors
qwen2
conversational
lintang commited on
Commit
11956d2
·
verified ·
1 Parent(s): 6267ccb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -141,5 +141,13 @@ SP3F-7B is a multilingual model trained with Self-Play with Privileged Pairwise
141
  If you find this work helpful please use the following to cite our work.
142
 
143
  ```
144
-
 
 
 
 
 
 
 
 
145
  ```
 
141
  If you find this work helpful please use the following to cite our work.
142
 
143
  ```
144
+ @misc{sutawika2026gainedtranslationprivilegedpairwise,
145
+ title={Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning},
146
+ author={Lintang Sutawika and Gokul Swamy and Zhiwei Steven Wu and Graham Neubig},
147
+ year={2026},
148
+ eprint={2601.18722},
149
+ archivePrefix={arXiv},
150
+ primaryClass={cs.CL},
151
+ url={https://arxiv.org/abs/2601.18722},
152
+ }
153
  ```