rahtml
/

Qwen3-Coder-30B-A3B-Instruct-NVFP4

8-bit precision

Model card Files Files and versions

rahtml commited on Dec 9, 2025

Commit

898e2a0

·

verified ·

1 Parent(s): f926401

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -2,4 +2,8 @@
 license: apache-2.0
 base_model:
 - Qwen/Qwen3-Coder-30B-A3B-Instruct
----

 license: apache-2.0
 base_model:
 - Qwen/Qwen3-Coder-30B-A3B-Instruct
+---
+## Description
+NVFP4 Quantization of [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) using [TensorRT-Model-Optimizer](https://github.com/NVIDIA/Model-Optimizer). KV Cache quantized to FP8 for compatibility with inference backends.