metadata
library_name: transformers
pipeline_tag: text-generation
license: cc-by-nc-4.0
This repository contains the Guru model presented in Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective.
Project page: https://yanqval.github.io/PAE/ Code: https://github.com/Reasoning360/Reasoning360