LLM360
/

guru-32B

@@ -1,7 +1,7 @@
 ---
 library_name: transformers
-pipeline_tag: text-generation
 license: cc-by-nc-4.0
 ---
 This repository contains the Guru-32B (base Qwen2.5-32B) model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
@@ -29,7 +29,7 @@ The leaderboard is evaluated with our evaluation [code](https://github.com/LLM36
 |                | LiveBench                     |       18.57 |                   19.76  |       12.64 |          15.20 |        34.30 |         28.78 |            28.33 |
 |                | **Average Score**              | **43.29**   | **33.76**               | **35.42**   | **33.97**      | **54.24**    | **47.53**     | **46.25**        |
 Example usage:
 ```python
@@ -45,4 +45,4 @@ outputs = model.generate(prompt, max_new_tokens=256, temperature=1.0, top_p=0.7)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
-Please refer to the [paper](https://arxiv.org/abs/2506.14965) for more details.

 ---
 library_name: transformers
 license: cc-by-nc-4.0
+pipeline_tag: text-generation
 ---
 This repository contains the Guru-32B (base Qwen2.5-32B) model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
 |                | LiveBench                     |       18.57 |                   19.76  |       12.64 |          15.20 |        34.30 |         28.78 |            28.33 |
 |                | **Average Score**              | **43.29**   | **33.76**               | **35.42**   | **33.97**      | **54.24**    | **47.53**     | **46.25**        |
+Project page: [Project page URL]
 Example usage:
 ```python
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+Please refer to the [paper](https://arxiv.org/abs/2506.14965) for more details.