Update README.md

898e2a0 verified 2 months ago

354 Bytes

license: apache-2.0
base_model:
  - Qwen/Qwen3-Coder-30B-A3B-Instruct

Description

NVFP4 Quantization of Qwen/Qwen3-Coder-30B-A3B-Instruct using TensorRT-Model-Optimizer. KV Cache quantized to FP8 for compatibility with inference backends.