metadata
license: cc-by-nc-4.0
library_name: transformers
pipeline_tag: text-generation
This repository contains the model introduced in OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.
Project page: https://huggingface.co/OctoThinker