Update README.md
Browse files
README.md
CHANGED
|
@@ -40,7 +40,7 @@ python3 internal_scripts/quantize_quark.py \
|
|
| 40 |
--attention_dtype fp8 \
|
| 41 |
--exclude_layers $exclude_layers \
|
| 42 |
--num_calib_data 512 \
|
| 43 |
-
--output_dir amd/gpt-oss120b-
|
| 44 |
--model_export hf_format \
|
| 45 |
--multi_gpu
|
| 46 |
```
|
|
|
|
| 40 |
--attention_dtype fp8 \
|
| 41 |
--exclude_layers $exclude_layers \
|
| 42 |
--num_calib_data 512 \
|
| 43 |
+
--output_dir amd/gpt-oss120b-moe_w-mxfp4-a-fp8-attn_ptpc-kv-soft_fp8 \
|
| 44 |
--model_export hf_format \
|
| 45 |
--multi_gpu
|
| 46 |
```
|