Text Generation
Safetensors
English
llama

Add model card

#1
by nielsr HF Staff - opened

This PR adds a model card by linking it to the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.

It also adds a project page link.

Please review and merge this PR if everything looks good.

Cannot merge
This branch has merge conflicts in the following files:
  • README.md

Sign up or log in to comment