guru-7B / README.md
nielsr's picture
nielsr HF Staff
Add model card, link to paper and code
b384baf verified
|
raw
history blame
366 Bytes
metadata
library_name: transformers
pipeline_tag: text-generation
license: cc-by-nc-4.0

This repository contains the Guru model presented in Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective.

Project page: https://yanqval.github.io/PAE/ Code: https://github.com/Reasoning360/Reasoning360