amd
/

DeepSeek-R1-0528-BF16

Text Generation

text-generation-inference

Model card Files Files and versions

haoyang-amd commited on Nov 27, 2025

Commit

498ec02

·

verified ·

1 Parent(s): 3a3d2a0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -34,7 +34,7 @@ Below is an example of how to quantize this model:
 ```bash
 cd Quark/examples/torch/language_modeling/llm_ptq/
-exclude_layers="lm_head *self_attn* *mlp.gate *eh_proj"
 python3 quantize_quark.py --model_dir $MODEL_DIR \
                           --quant_scheme w_mxfp4_a_mxfp4 \
                           --num_calib_data 32 \

 ```bash
 cd Quark/examples/torch/language_modeling/llm_ptq/
+exclude_layers="lm_head *self_attn* *mlp.gate *eh_proj *shared_head.head"
 python3 quantize_quark.py --model_dir $MODEL_DIR \
                           --quant_scheme w_mxfp4_a_mxfp4 \
                           --num_calib_data 32 \