zhoujun commited on
Commit
8868213
·
verified ·
1 Parent(s): 73ccb4d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -4,7 +4,11 @@ pipeline_tag: text-generation
4
  license: cc-by-nc-4.0
5
  ---
6
 
7
- This repository contains the Guru model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
8
 
9
- Project page: https://yanqval.github.io/PAE/
10
- Code: https://github.com/Reasoning360/Reasoning360
 
 
 
 
 
4
  license: cc-by-nc-4.0
5
  ---
6
 
7
+ This repository contains the Guru-32B (base Qwen2.5-32B) model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
8
 
9
+ The score we evaluate with temperature=1.0, top_p=0.7.
10
+
11
+ ![Leaderboard](./figures/leaderboard.png)
12
+
13
+
14
+ Please refer to the paper for more details.