|
|
--- |
|
|
library_name: transformers |
|
|
tags: |
|
|
- prime-rl |
|
|
- verifiers |
|
|
- prime-intellect |
|
|
- reinforcement-learning |
|
|
- reasoning |
|
|
- agentic |
|
|
- mixture-of-experts |
|
|
license: mit |
|
|
language: |
|
|
- en |
|
|
base_model: |
|
|
- zai-org/GLM-4.5-Air-Base |
|
|
pipeline_tag: text-generation |
|
|
--- |
|
|
|
|
|
# INTELLECT-3.1 |
|
|
|
|
|
<div align="center"> |
|
|
<img src="https://huggingface.co/PrimeIntellect/INTELLECT-3/resolve/main/banner.png" alt="Prime Intellect Logo" /> |
|
|
</div> |
|
|
|
|
|
<p align="center"> |
|
|
<strong>INTELLECT-3.1: A 100B+ MoE trained with large-scale RL</strong> |
|
|
<br><br> |
|
|
Trained with <a href="https://github.com/PrimeIntellect-ai/prime-rl">prime-rl</a> and <a href="https://github.com/PrimeIntellect-ai/verifiers">verifiers</a> |
|
|
<br> |
|
|
Environments released on <a href="https://app.primeintellect.ai/dashboard/environments">Environments Hub</a> |
|
|
<br> |
|
|
Read the <a href="https://primeintellect.ai/blog/intellect-3">Blog</a> & <a href="https://storage.googleapis.com/intellect-3-paper/INTELLECT_3_Technical_Report.pdf">Technical Report</a> |
|
|
<br> |
|
|
<a href="https://x.com/primeintellect">X</a> | <a href="https://discord.gg/RC5GvMbfDf">Discord</a> | <a href="https://app.primeintellect.ai/dashboard/create-cluster">Prime Intellect Platform</a> |
|
|
</p> |
|
|
|
|
|
## Introduction |
|
|
|
|
|
**INTELLECT-3.1** is a 106B (A12B) parameter Mixture-of-Experts reasoning model built as a continued training of [INTELLECT-3](https://huggingface.co/PrimeIntellect/INTELLECT-3) with additional reinforcement learning on math, coding, software engineering, and agentic tasks. |
|
|
|
|
|
Training was performed with [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) using environments built with the [verifiers](https://github.com/PrimeIntellect-ai/verifiers) library. |
|
|
All training and evaluation environments are available on the [Environments Hub](https://app.primeintellect.ai/dashboard/environments). |
|
|
|
|
|
The model, training frameworks, and environments are open-sourced under fully-permissive licenses (MIT and Apache 2.0). |
|
|
|
|
|
For more details, see the [technical report](https://storage.googleapis.com/intellect-3-paper/INTELLECT_3_Technical_Report.pdf). |
|
|
|
|
|
## Serving with vLLM |
|
|
|
|
|
The model can be served on 2x H200s: |
|
|
```bash |
|
|
vllm serve PrimeIntellect/INTELLECT-3.1 \ |
|
|
--tensor-parallel-size 2 \ |
|
|
--enable-auto-tool-choice \ |
|
|
--tool-call-parser qwen3_coder \ |
|
|
--reasoning-parser deepseek_r1 |
|
|
``` |
|
|
|
|
|
## Citation |
|
|
|
|
|
```bibtex |
|
|
@misc{intellect3.1, |
|
|
title={INTELLECT-3.1: Technical Report}, |
|
|
author={Prime Intellect Team}, |
|
|
year={2025}, |
|
|
url={https://huggingface.co/PrimeIntellect/INTELLECT-3.1} |
|
|
} |
|
|
``` |
|
|
|