rahtml commited on
Commit
898e2a0
·
verified ·
1 Parent(s): f926401

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -2,4 +2,8 @@
2
  license: apache-2.0
3
  base_model:
4
  - Qwen/Qwen3-Coder-30B-A3B-Instruct
5
- ---
 
 
 
 
 
2
  license: apache-2.0
3
  base_model:
4
  - Qwen/Qwen3-Coder-30B-A3B-Instruct
5
+ ---
6
+
7
+ ## Description
8
+
9
+ NVFP4 Quantization of [Qwen/Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct) using [TensorRT-Model-Optimizer](https://github.com/NVIDIA/Model-Optimizer). KV Cache quantized to FP8 for compatibility with inference backends.