Text Generation
Safetensors
English
llama
nielsr HF Staff commited on
Commit
b72eafc
·
verified ·
1 Parent(s): 8576df1

Add model card

Browse files

This PR adds a model card for the OctoThinker model, linking it to the paper [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512).

Files changed (1) hide show
  1. README.md +11 -0
README.md ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
+ This repository contains the OctoThinker model from the paper [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512).
8
+
9
+ Code: https://github.com/AI4Math/OctoThinker
10
+
11
+ Project Page: https://huggingface.co/OctoThinker