OctoThinker
/

Llama3.2-3B-Zero

Model card Files Files and versions

Llama3.2-3B-Zero / README.md

nielsr's picture

nielsr HF Staff

Add model card

44456d0 verified 10 months ago

|

279 Bytes

library_name: transformers
pipeline_tag: text-generation

This repository contains the models described in OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling.

Project page: https://huggingface.co/OctoThinker