Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
|
@@ -13,8 +13,14 @@ Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
|
|
| 13 |
|
| 14 |
https://arxiv.org/abs/2505.11595
|
| 15 |
|
| 16 |
-
Please cite
|
| 17 |
-
|
| 18 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
|
| 20 |
-PLC
|
|
|
|
| 13 |
|
| 14 |
https://arxiv.org/abs/2505.11595
|
| 15 |
|
| 16 |
+
Please cite
|
| 17 |
+
```bibtex
|
| 18 |
+
@inproceedings{chen2025spectral,
|
| 19 |
+
title = {Spectral Policy Optimization: Coloring your Incorrect Reasoning in {GRPO}},
|
| 20 |
+
author = {Peter Chen and Xiaopeng Li and Ziniu Li and Xi Chen and Tianyi Lin},
|
| 21 |
+
booktitle = {2nd AI for Math Workshop @ ICML 2025},
|
| 22 |
+
year = {2025},
|
| 23 |
+
url = {https://openreview.net/forum?id=IIBDElbi7s}
|
| 24 |
+
}
|
| 25 |
|
| 26 |
-PLC
|