Text Generation
Safetensors
English
llama
nielsr HF Staff commited on
Commit
e833e09
·
verified ·
1 Parent(s): a49c6b1

Add model card

Browse files

This PR adds a model card for the model presented in [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512) paper.
The PR also adds the appropriate tags for license, library, and pipeline.

Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
+ This repository contains the model presented in [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512).
8
+
9
+ Project page: https://huggingface.co/OctoThinker