Update README.md

d652eda verified 10 months ago

695 Bytes

license: apache-2.0
datasets:
  - PrimeIntellect/Intellect-2-RL-Dataset

INTELLECT-2

INTELLECT-2 is a 32 billion parameter language model trained through a reinforcement learning run leveraging globally distributed, permissionless GPU resources contributed by the community.

The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability. For detailed information on our infrastructure and training recipe, see our technical report.