guru-7B / README.md
nielsr's picture
nielsr HF Staff
Add model card, link to paper and code
b384baf verified
|
raw
history blame
366 Bytes
---
library_name: transformers
pipeline_tag: text-generation
license: cc-by-nc-4.0
---
This repository contains the Guru model presented in [Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective](https://huggingface.co/papers/2506.14965).
Project page: https://yanqval.github.io/PAE/
Code: https://github.com/Reasoning360/Reasoning360