OctoThinker
/

OctoThinker-3B-Short-Zero

Text Generation

Model card Files Files and versions

OctoThinker-3B-Short-Zero / README.md

nielsr's picture

nielsr HF Staff

Add model card

7f86f06 verified 8 months ago

|

308 Bytes

license: apache-2.0
library_name: transformers
pipeline_tag: text-generation

This repository contains the model described in the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.

Project page: https://huggingface.co/OctoThinker