Safetensors
qwen2
INTELLECT-2 / README.md
justus27's picture
Update README.md
d652eda verified
|
raw
history blame
695 Bytes
---
license: apache-2.0
datasets:
- PrimeIntellect/Intellect-2-RL-Dataset
---
# INTELLECT-2
INTELLECT-2 is a 32 billion parameter language model trained through a reinforcement learning run leveraging globally distributed, permissionless GPU resources contributed by the community.
The model was trained using [prime-rl], a framework designed for distributed asynchronous RL, using GRPO over verifiable rewards along with modifications for improved training stability. For detailed information on our infrastructure and training recipe, see our [technical report](link).
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64a32edf17b9f57eaec2ea65/0NFEBL9eAObkU4IQ_hAo0.png)