linzhao-amd commited on
Commit
355dba7
·
verified ·
1 Parent(s): 345ec45

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -13,9 +13,9 @@ base_model:
13
  - **Operating System(s):** Linux
14
  - **Inference Engine:** [vLLM](https://docs.vllm.ai/en/latest/)
15
  - **Model Optimizer:** [AMD-Quark](https://quark.docs.amd.com/latest/index.html) (V0.11.1)
16
- - **moe**
17
- - **Weight quantization:** MOE-only, OCP MXFP4, Static
18
- - **Activation quantization:** MOE-only, OCP MXFP4, Dynamic
19
  - **Calibration Dataset:** [Pile](https://huggingface.co/datasets/mit-han-lab/pile-val-backup)
20
 
21
  This model was built with GLM-5 model by applying [AMD-Quark](https://quark.docs.amd.com/latest/index.html) for MXFP4 quantization.
 
13
  - **Operating System(s):** Linux
14
  - **Inference Engine:** [vLLM](https://docs.vllm.ai/en/latest/)
15
  - **Model Optimizer:** [AMD-Quark](https://quark.docs.amd.com/latest/index.html) (V0.11.1)
16
+ - **Quantized layers:** Experts in MoE
17
+ - **Weight quantization:** MOE-only, OCP MXFP4, Static
18
+ - **Activation quantization:** MOE-only, OCP MXFP4, Dynamic
19
  - **Calibration Dataset:** [Pile](https://huggingface.co/datasets/mit-han-lab/pile-val-backup)
20
 
21
  This model was built with GLM-5 model by applying [AMD-Quark](https://quark.docs.amd.com/latest/index.html) for MXFP4 quantization.