Update README.md
Browse files
README.md
CHANGED
|
@@ -141,5 +141,13 @@ SP3F-7B is a multilingual model trained with Self-Play with Privileged Pairwise
|
|
| 141 |
If you find this work helpful please use the following to cite our work.
|
| 142 |
|
| 143 |
```
|
| 144 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 145 |
```
|
|
|
|
| 141 |
If you find this work helpful please use the following to cite our work.
|
| 142 |
|
| 143 |
```
|
| 144 |
+
@misc{sutawika2026gainedtranslationprivilegedpairwise,
|
| 145 |
+
title={Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning},
|
| 146 |
+
author={Lintang Sutawika and Gokul Swamy and Zhiwei Steven Wu and Graham Neubig},
|
| 147 |
+
year={2026},
|
| 148 |
+
eprint={2601.18722},
|
| 149 |
+
archivePrefix={arXiv},
|
| 150 |
+
primaryClass={cs.CL},
|
| 151 |
+
url={https://arxiv.org/abs/2601.18722},
|
| 152 |
+
}
|
| 153 |
```
|