amd
/

DeepSeek-R1-MXFP4

8-bit precision

Model card Files Files and versions

bowenbaoamd commited on Aug 5, 2025

Commit

21582b4

·

verified ·

1 Parent(s): cd5cf2d

Update README.md

Files changed (1) hide show

README.md +0 -100

README.md CHANGED Viewed

@@ -49,105 +49,5 @@ python3 quantize_quark.py --model_dir $MODEL_DIR \
 This model can be deployed efficiently using the [SGLang](https://docs.sglang.ai/) backend.
-## Evaluation
-The model was evaluated on AIME2024, GPQA Diamond, and GSM8K.
-Evaluation was conducted using the framework [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) and the SGLang engine.
-### Accuracy
-<table>
-  <tr>
-   <td><strong>Benchmark</strong>
-   </td>
-   <td><strong>DeepSeek-R1 </strong>
-   </td>
-   <td><strong>DeepSeek-R1-MXFP4(this model)</strong>
-   </td>
-   <td><strong>Recovery</strong>
-   </td>
-  </tr>
-  <tr>
-   <td>AIME2024
-   </td>
-   <td>78.00
-   </td>
-   <td>76.00
-   </td>
-   <td>97.44%
-   </td>
-  </tr>
-  <tr>
-   <td>GPQA Diamond
-   </td>
-   <td>68.89
-   </td>
-   <td>68.18
-   </td>
-   <td>98.97%
-   </td>
-  </tr>  <tr>
-   <td>GSM8K
-   </td>
-   <td>95.81
-   </td>
-   <td>95.42
-   </td>
-   <td>99.59%
-   </td>
-  </tr>
-</table>
-### Reproduction
-The results were obtained using the following commands.
-```
-# starting server
-python3 -m sglang.launch_server \
-    --model amd/DeepSeek-R1-MXFP4 \
-    --tp 8  \
-    --trust-remote-code  \
-    --n-share-experts-fusion 8 \
-    --disable-radix-cache
-```
-#### AIME2024
-```
-# evaluating
-lm_eval --model local-completions \
-    --model_args model=amd/DeepSeek-R1-MXFP4,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
-    --tasks aime24 \
-    --num_fewshot 0 \
-    --gen_kwargs "do_sample=True,temperature=0.6,top_p=0.95,max_tokens=32000" \
-    --batch_size auto \
-    --log_samples \
-    --output_path output_data/DeepSeek-R1-MXFP4
-```
-#### GPQA Diamond
-```
-lm_eval --model local-completions \
-    --model_args model=amd/DeepSeek-R1-MXFP4,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
-    --tasks gpqa_diamond_cot_zeroshot \
-    --num_fewshot 0 \
-    --gen_kwargs "do_sample=True,temperature=0.6,top_p=0.95,max_tokens=32000,max_gen_toks=32000" \
-    --batch_size auto \
-    --log_samples \
-    --output_path output_data/DeepSeek-R1-MXFP4
-```
-#### GSM8K
-```
-lm_eval --model local-completions \
-    --model_args model=amd/DeepSeek-R1-MXFP4,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=8096 \
-    --tasks gsm8k \
-    --num_fewshot 5 \
-    --batch_size auto \
-    --log_samples \
-    --output_path output_data/DeepSeek-R1-MXFP4
-```
 # License
 Modifications Copyright(c) 2025 Advanced Micro Devices, Inc. All rights reserved.

 This model can be deployed efficiently using the [SGLang](https://docs.sglang.ai/) backend.
 # License
 Modifications Copyright(c) 2025 Advanced Micro Devices, Inc. All rights reserved.