amd
/

Qwen3-Coder-Next-MXFP4

8-bit precision

Model card Files Files and versions

jiaxwang commited on Feb 3

Commit

3213b47

·

verified ·

1 Parent(s): 51aed63

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -21,14 +21,14 @@ This model was built with Qwen3-Coder-Next model by applying AMD-Quark for MXFP4
 # Model Quantization
-The model was quantized from Qwen/Qwen3-Coder-Next using [AMD-Quark](https://quark.docs.amd.com/latest/index.html). The weights and activations are quantized to MXFP4.
 **Quantization scripts:**
 Note that qwen3_next is not in the built-in model template list in Quark V0.11, it has to be registered before quantization.
 ```python
-from transformers import AutoModelForCausalLM, AutoTokenizer, AutoProcessor
 from datasets import load_dataset
 from quark.torch import LLMTemplate, ModelQuantizer, export_safetensors
 from quark.contrib.llm_eval import ppl_eval

 # Model Quantization
+The model was quantized from Qwen3-Coder-Next using [AMD-Quark](https://quark.docs.amd.com/latest/index.html). The weights and activations are quantized to MXFP4.
 **Quantization scripts:**
 Note that qwen3_next is not in the built-in model template list in Quark V0.11, it has to be registered before quantization.
 ```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
 from datasets import load_dataset
 from quark.torch import LLMTemplate, ModelQuantizer, export_safetensors
 from quark.contrib.llm_eval import ppl_eval