neulab
/

SP3F-7B

@@ -141,5 +141,13 @@ SP3F-7B is a multilingual model trained with Self-Play with Privileged Pairwise
 If you find this work helpful please use the following to cite our work.
 ```
 ```

 If you find this work helpful please use the following to cite our work.
 ```
+@misc{sutawika2026gainedtranslationprivilegedpairwise,
+      title={Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning},
+      author={Lintang Sutawika and Gokul Swamy and Zhiwei Steven Wu and Graham Neubig},
+      year={2026},
+      eprint={2601.18722},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2601.18722},
+}
 ```