README.md · PrimeIntellect/INTELLECT-3 at 4841b2ed22aaa4c3add1181cc6060458045a0ea7

INTELLECT-3 / README.md

willcb

Update README.md

4841b2e verified about 1 month ago

preview code

raw

history blame

2.08 kB

	---
	library_name: transformers
	tags:
	- prime-rl
	- verifiers
	- prime-intellect
	license: mit
	language:
	- en
	base_model:
	- zai-org/GLM-4.5-Air-Base
	pipeline_tag: text-generation
	---

	# INTELLECT-3

	INTELLECT-3 is a 106B (A12B) parameter Mixture-of-Experts reasoning model post-trained from [GLM-4.5-Air-Base](https://huggingface.co/zai-org/GLM-4.5-Air-Base) using supervised fine-tuning (SFT) followed by large-scale reinforcement learning (RL).

	Training was performed with [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) using environments built with the [verifiers](https://github.com/PrimeIntellect-ai/verifiers) library. All training and evaluation environments are available on the [Environments Hub](https://app.primeintellect.ai/dashboard/environments).

	The model, training frameworks, and environments are open-sourced under fully-permissive licenses (MIT and Apache 2.0).

	For more details, see the [technical report](PAPER_LINK_PLACEHOLDER).

	## Evaluation

	INTELLECT-3 achieves best-in-class performance on math, coding, and reasoning benchmarks:

	\| Benchmark \| Score \|
	\|-----------\|-------\|
	\| AIME 2025 \| 88.0 \|
	\| LiveCodeBench v6 \| 69.3 \|
	\| GPQA Diamond \| 74.4 \|
	\| HLE \| 14.6 \|

	## Model Variants

	\| Model \| HuggingFace \|
	\|-------\|-------------\|
	\| INTELLECT-3 \| [PrimeIntellect/INTELLECT-3](https://huggingface.co/PrimeIntellect/INTELLECT-3) \|
	\| INTELLECT-3-FP8 \| [PrimeIntellect/INTELLECT-3-FP8](https://huggingface.co/PrimeIntellect/INTELLECT-3-FP8) \|

	## Serving with vLLM

	The BF16 version can be served on 2x H200s:
	```bash
	vllm serve PrimeIntellect/INTELLECT-3 \
	--tensor-parallel-size 2 \
	--tool-call-parser qwen3_coder \
	--reasoning-parser deepseek_r1
	```

	The FP8 version can be served on a single H200:

	```bash
	vllm serve PrimeIntellect/INTELLECT-3-FP8 \
	--tool-call-parser qwen3_coder \
	--reasoning-parser deepseek_r1
	```

	## Citation

	```bibtex
	@misc{intellect3,
	title={INTELLECT-3: Technical Report},
	author={Prime Intellect Team},
	year={2025},
	url={https://huggingface.co/PrimeIntellect/INTELLECT-3}
	}
	```