Llama3.2-3B-Zero / README.md
nielsr's picture
nielsr HF Staff
Add model card
44456d0 verified
|
raw
history blame
279 Bytes
metadata
library_name: transformers
pipeline_tag: text-generation

This repository contains the models described in OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.

Project page: https://huggingface.co/OctoThinker