File size: 486 Bytes
d1534a1
 
 
 
 
 
 
 
 
 
 
 
1577985
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
---
library_name: transformers
license: apache-2.0
pipeline_tag: text-generation
---

# Qwen3-32B

## Model Overview

**Qwen3-32B** has the following features:

- **Type:** Causal Language Models
- **Training Stage:** Pretraining & Post-training
- **Number of Parameters:** 32.8B
- **Number of Parameters (Non-Embedding):** 31.2B
- **Number of Layers:** 64
- **Number of Attention Heads (GQA):** 64 for Q and 8 for KV
- **Context Length:** 32,768 natively and 131,072 tokens with YaRN.