Text Generation
Safetensors
English
llama
nielsr HF Staff commited on
Commit
300c35b
·
verified ·
1 Parent(s): 142f155

Add model card

Browse files

This PR adds a model card for the paper [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512).
It sets the pipeline_tag, library_name and license.

Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
+ This repository contains the model introduced in [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512).
8
+
9
+ Project page: https://huggingface.co/OctoThinker