Spaces:
Running
Running
| title: README | |
| emoji: π | |
| colorFrom: gray | |
| colorTo: indigo | |
| sdk: static | |
| pinned: false | |
| π OctoThinker is led by [GAIR](https://huggingface.co/GAIR) | |
| π― Our Goal: To reshape the pre-training trajectory so models scale better under RL. | |
| Check our [*technical report*](https://arxiv.org/abs/2506.20512v1) for more details! | |
|  | |