Add model card
#1
by nielsr HF Staff - opened
This PR adds a model card for the OctoThinker model, linking it to the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.
This PR adds a model card for the OctoThinker model, linking it to the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.