Update README.md
Browse files
README.md
CHANGED
|
@@ -4,7 +4,11 @@ pipeline_tag: text-generation
|
|
| 4 |
license: cc-by-nc-4.0
|
| 5 |
---
|
| 6 |
|
| 7 |
-
This repository contains the Guru model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
|
| 8 |
|
| 9 |
-
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 4 |
license: cc-by-nc-4.0
|
| 5 |
---
|
| 6 |
|
| 7 |
+
This repository contains the Guru-32B (base Qwen2.5-32B) model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
|
| 8 |
|
| 9 |
+
The score we evaluate with temperature=1.0, top_p=0.7.
|
| 10 |
+
|
| 11 |
+

|
| 12 |
+
|
| 13 |
+
|
| 14 |
+
Please refer to the paper for more details.
|