Zoed's picture
Upload folder using huggingface_hub
8deec80 verified
---
license: apache-2.0
base_model: Qwen/Qwen3-Coder-30B-A3B-Instruct
base_model_relation: quantized
language:
- en
tags:
- qwen3
- qwen3-coder
- code
- gguf
- quantized
- q4_k_m
pipeline_tag: text-generation
---
# Qwen3-Coder-30B-A3B-Instruct · Q4_K_M GGUF
This is a **Q4_K_M GGUF quantization** of [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct), produced from the f16 base.
| Property | Value |
|---|---|
| Base model | Qwen/Qwen3-Coder-30B-A3B-Instruct |
| Quantization | Q4_K_M |
| Format | GGUF |
| Parameters | 30B (MoE, ~3B active) |
## About the base model
Qwen3-Coder-30B-A3B-Instruct is a Mixture-of-Experts (MoE) code-focused instruction model developed by [Qwen Team, Alibaba Cloud](https://qwenlm.github.io/). It features 30B total parameters with ~3B active parameters per token.
For full details, see the [original model page](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct).
## Usage
### llama.cpp
```bash
llama-cli \
-m Qwen3-Coder-30B-A3B-Instruct-f16-Q4_K_M.gguf \
--chat-template qwen3 \
-p "Write a Python function that sorts a list of dictionaries by a given key." \
-n 512
```
### llama-server
```bash
llama-server \
-m Qwen3-Coder-30B-A3B-Instruct-f16-Q4_K_M.gguf \
--chat-template qwen3 \
--port 8080
```
### Ollama (via Modelfile)
```
FROM ./Qwen3-Coder-30B-A3B-Instruct-f16-Q4_K_M.gguf
PARAMETER num_ctx 32768
TEMPLATE "{{ ... }}" # use Qwen3 chat template
```
## Quantization details
| File | Quant | Size (approx.) |
|---|---|---|
| `Qwen3-Coder-30B-A3B-Instruct-f16-Q4_K_M.gguf` | Q4_K_M | ~17 GB |
**Q4_K_M** uses 4-bit quantization with K-quant method on most layers, providing a good balance between size and quality.
## License
This quantized model is derived from [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) and is released under the same [Apache 2.0 License](https://www.apache.org/licenses/LICENSE-2.0).
Per Qwen's terms, appropriate credit is given to the original authors:
> Qwen3-Coder-30B-A3B-Instruct is developed by Qwen Team, Alibaba Cloud.
> Original model: https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct
## Citation
```bibtex
@misc{qwen3coder,
title = {Qwen3-Coder},
author = {Qwen Team},
year = {2025},
organization = {Alibaba Cloud},
url = {https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct}
}
```