Text Generation
Safetensors
English
llama
nielsr's picture
nielsr HF Staff
Add model card
7f86f06 verified
|
raw
history blame
308 Bytes
metadata
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation

This repository contains the model described in the paper OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.

Project page: https://huggingface.co/OctoThinker