Text Generation
Safetensors
English
llama
nielsr HF Staff commited on
Commit
1c376a5
·
verified ·
1 Parent(s): 42936df

Add model card

Browse files

This PR adds a model card, linking the model to the paper [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512) as well as adding the relevant metadata (license, pipeline tag, library name) for more discoverability.
Project page: https://huggingface.co/OctoThinker.

Files changed (1) hide show
  1. README.md +7 -0
README.md ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ pipeline_tag: text-generation
5
+ ---
6
+
7
+ This repository contains the model described in the paper [OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling](https://huggingface.co/papers/2506.20512).