amd
/

DeepSeek-R1-MXFP4

8-bit precision

Model card Files Files and versions

linzhao-amd commited on Aug 6, 2025

Commit

d39da30

·

verified ·

1 Parent(s): 21582b4

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -37,9 +37,8 @@ python3 quantize_quark.py --model_dir $MODEL_DIR \
                           --quant_scheme w_mxfp4_a_mxfp4 \
                           --group_size 32 \
                           --num_calib_data 128 \
-                          --exclude_layers "*self_attn*" "*mlp.gate.*" "*lm_head" \
                           --multi_gpu \
-                          --quant_algo autosmoothquant \
                           --model_export hf_format \
                           --output_dir amd/DeepSeek-R1-MXFP4
 ```

                           --quant_scheme w_mxfp4_a_mxfp4 \
                           --group_size 32 \
                           --num_calib_data 128 \
+                          --exclude_layers "*lm_head" \
                           --multi_gpu \
                           --model_export hf_format \
                           --output_dir amd/DeepSeek-R1-MXFP4
 ```