Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -14,14 +14,5 @@ Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
|
|
| 14 |
https://arxiv.org/abs/2505.11595
|
| 15 |
|
| 16 |
Please cite
|
| 17 |
-
|
| 18 |
-
@inproceedings{
|
| 19 |
-
chen2025spectral,
|
| 20 |
-
title={Spectral Policy Optimization: Coloring your Incorrect Reasoning in {GRPO}},
|
| 21 |
-
author={Peter Chen and Xiaopeng Li and Ziniu Li and Xi Chen and Tianyi Lin},
|
| 22 |
-
booktitle={2nd AI for Math Workshop @ ICML 2025},
|
| 23 |
-
year={2025},
|
| 24 |
-
url={https://openreview.net/forum?id=IIBDElbi7s}
|
| 25 |
-
}
|
| 26 |
-
'''
|
| 27 |
-PLC
|
|
|
|
| 14 |
https://arxiv.org/abs/2505.11595
|
| 15 |
|
| 16 |
Please cite
|
| 17 |
+
<pre lang="bibtex"><code>@inproceedings{chen2025spectral, title = {Spectral Policy Optimization: Coloring your Incorrect Reasoning in {GRPO}}, author = {Peter Chen and Xiaopeng Li and Ziniu Li and Xi Chen and Tianyi Lin}, booktitle = {2nd AI for Math Workshop @ ICML 2025}, year = {2025}, url = {https://openreview.net/forum?id=IIBDElbi7s} }</code></pre>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
-PLC
|