PeterLauLukCh commited on
Commit
c12507e
·
verified ·
1 Parent(s): 2bed3fa

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -11,5 +11,6 @@ This repo contains all the models for paper -
11
 
12
  Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
13
 
 
14
 
15
  -PLC
 
11
 
12
  Spectral Policy Optimization: Coloring your Incorrect Reasoning in GRPO
13
 
14
+ https://arxiv.org/abs/2505.11595
15
 
16
  -PLC