Add project page URL
#1
by
nielsr
HF Staff
- opened
README.md
CHANGED
|
@@ -1,7 +1,7 @@
|
|
| 1 |
---
|
| 2 |
library_name: transformers
|
| 3 |
-
pipeline_tag: text-generation
|
| 4 |
license: cc-by-nc-4.0
|
|
|
|
| 5 |
---
|
| 6 |
|
| 7 |
This repository contains the Guru-32B (base Qwen2.5-32B) model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
|
|
@@ -29,7 +29,7 @@ The leaderboard is evaluated with our evaluation [code](https://github.com/LLM36
|
|
| 29 |
| | LiveBench | 18.57 | 19.76 | 12.64 | 15.20 | 34.30 | 28.78 | 28.33 |
|
| 30 |
| | **Average Score** | **43.29** | **33.76** | **35.42** | **33.97** | **54.24** | **47.53** | **46.25** |
|
| 31 |
|
| 32 |
-
|
| 33 |
|
| 34 |
Example usage:
|
| 35 |
```python
|
|
@@ -45,4 +45,4 @@ outputs = model.generate(prompt, max_new_tokens=256, temperature=1.0, top_p=0.7)
|
|
| 45 |
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
| 46 |
```
|
| 47 |
|
| 48 |
-
Please refer to the [paper](https://arxiv.org/abs/2506.14965) for more details.
|
|
|
|
| 1 |
---
|
| 2 |
library_name: transformers
|
|
|
|
| 3 |
license: cc-by-nc-4.0
|
| 4 |
+
pipeline_tag: text-generation
|
| 5 |
---
|
| 6 |
|
| 7 |
This repository contains the Guru-32B (base Qwen2.5-32B) model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
|
|
|
|
| 29 |
| | LiveBench | 18.57 | 19.76 | 12.64 | 15.20 | 34.30 | 28.78 | 28.33 |
|
| 30 |
| | **Average Score** | **43.29** | **33.76** | **35.42** | **33.97** | **54.24** | **47.53** | **46.25** |
|
| 31 |
|
| 32 |
+
Project page: [Project page URL]
|
| 33 |
|
| 34 |
Example usage:
|
| 35 |
```python
|
|
|
|
| 45 |
print(tokenizer.decode(outputs[0], skip_special_tokens=True))
|
| 46 |
```
|
| 47 |
|
| 48 |
+
Please refer to the [paper](https://arxiv.org/abs/2506.14965) for more details.
|