bowenbaoamd commited on
Commit
21582b4
·
verified ·
1 Parent(s): cd5cf2d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -100
README.md CHANGED
@@ -49,105 +49,5 @@ python3 quantize_quark.py --model_dir $MODEL_DIR \
49
 
50
  This model can be deployed efficiently using the [SGLang](https://docs.sglang.ai/) backend.
51
 
52
- ## Evaluation
53
-
54
- The model was evaluated on AIME2024, GPQA Diamond, and GSM8K.
55
- Evaluation was conducted using the framework [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) and the SGLang engine.
56
-
57
- ### Accuracy
58
-
59
- <table>
60
- <tr>
61
- <td><strong>Benchmark</strong>
62
- </td>
63
- <td><strong>DeepSeek-R1 </strong>
64
- </td>
65
- <td><strong>DeepSeek-R1-MXFP4(this model)</strong>
66
- </td>
67
- <td><strong>Recovery</strong>
68
- </td>
69
- </tr>
70
- <tr>
71
- <td>AIME2024
72
- </td>
73
- <td>78.00
74
- </td>
75
- <td>76.00
76
- </td>
77
- <td>97.44%
78
- </td>
79
- </tr>
80
- <tr>
81
- <td>GPQA Diamond
82
- </td>
83
- <td>68.89
84
- </td>
85
- <td>68.18
86
- </td>
87
- <td>98.97%
88
- </td>
89
- </tr> <tr>
90
- <td>GSM8K
91
- </td>
92
- <td>95.81
93
- </td>
94
- <td>95.42
95
- </td>
96
- <td>99.59%
97
- </td>
98
- </tr>
99
- </table>
100
-
101
-
102
- ### Reproduction
103
-
104
- The results were obtained using the following commands.
105
- ```
106
- # starting server
107
- python3 -m sglang.launch_server \
108
- --model amd/DeepSeek-R1-MXFP4 \
109
- --tp 8 \
110
- --trust-remote-code \
111
- --n-share-experts-fusion 8 \
112
- --disable-radix-cache
113
- ```
114
-
115
- #### AIME2024
116
- ```
117
- # evaluating
118
- lm_eval --model local-completions \
119
- --model_args model=amd/DeepSeek-R1-MXFP4,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
120
- --tasks aime24 \
121
- --num_fewshot 0 \
122
- --gen_kwargs "do_sample=True,temperature=0.6,top_p=0.95,max_tokens=32000" \
123
- --batch_size auto \
124
- --log_samples \
125
- --output_path output_data/DeepSeek-R1-MXFP4
126
- ```
127
-
128
- #### GPQA Diamond
129
- ```
130
- lm_eval --model local-completions \
131
- --model_args model=amd/DeepSeek-R1-MXFP4,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
132
- --tasks gpqa_diamond_cot_zeroshot \
133
- --num_fewshot 0 \
134
- --gen_kwargs "do_sample=True,temperature=0.6,top_p=0.95,max_tokens=32000,max_gen_toks=32000" \
135
- --batch_size auto \
136
- --log_samples \
137
- --output_path output_data/DeepSeek-R1-MXFP4
138
- ```
139
-
140
- #### GSM8K
141
- ```
142
- lm_eval --model local-completions \
143
- --model_args model=amd/DeepSeek-R1-MXFP4,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=8096 \
144
- --tasks gsm8k \
145
- --num_fewshot 5 \
146
- --batch_size auto \
147
- --log_samples \
148
- --output_path output_data/DeepSeek-R1-MXFP4
149
- ```
150
-
151
-
152
  # License
153
  Modifications Copyright(c) 2025 Advanced Micro Devices, Inc. All rights reserved.
 
49
 
50
  This model can be deployed efficiently using the [SGLang](https://docs.sglang.ai/) backend.
51
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
52
  # License
53
  Modifications Copyright(c) 2025 Advanced Micro Devices, Inc. All rights reserved.