Text Generation
Safetensors
English
llama
nielsr HF Staff commited on
Commit
7f86f06
·
verified ·
1 Parent(s): 2e52c7f

Add model card

Browse files

This PR adds a model card for the model presented in [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512). It includes relevant metadata (library name, pipeline tag, license) and links to the paper and project page.

Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
+ This repository contains the model described in the paper [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512).
8
+
9
+ Project page: https://huggingface.co/OctoThinker