LLM360
/

guru-7B

@@ -4,7 +4,11 @@ pipeline_tag: text-generation
 license: cc-by-nc-4.0
 ---
-This repository contains the Guru model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
-Project page: https://yanqval.github.io/PAE/
-Code: https://github.com/Reasoning360/Reasoning360

 license: cc-by-nc-4.0
 ---
+This repository contains the Guru-32B (base Qwen2.5-32B) model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
+The score we evaluate with temperature=1.0, top_p=0.7.
+![Leaderboard](./figures/leaderboard.png)
+Please refer to the paper for more details.