linzhao-amd commited on
Commit
d809e62
·
verified ·
1 Parent(s): 23d2de6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -1
README.md CHANGED
@@ -102,10 +102,19 @@ The model was evaluated on reasoning tasks including AIME24, MMLU_COT, and GSM8K
102
 
103
  ### Reproduction
104
 
105
- The results of AIME24 and MMLU_COT were obtained using [SGLang](https://docs.sglang.ai/) via [forked lm-evaluation-harness](https://github.com/BowenBao/lm-evaluation-harness/tree/cot)
106
 
107
  ### AIME24
108
  ```
 
 
 
 
 
 
 
 
 
109
  lm_eval --model local-completions \
110
  --model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
111
  --tasks aime24 \
@@ -118,6 +127,15 @@ lm_eval --model local-completions \
118
 
119
  ### MMLU_COT
120
  ```
 
 
 
 
 
 
 
 
 
121
  lm_eval --model local-completions \
122
  --model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
123
  --tasks mmlu_cot \
 
102
 
103
  ### Reproduction
104
 
105
+ The results of AIME24 and MMLU_COT were obtained using [SGLang](https://docs.sglang.ai/) via forked [lm-evaluation-harness](https://github.com/BowenBao/lm-evaluation-harness/tree/cot)
106
 
107
  ### AIME24
108
  ```
109
+ # Launching server
110
+ python3 -m sglang.launch_server \
111
+ --model /data/DeepSeek-R1-WMXFP4-AMXFP4-Scale-UINT8-Attn-MoE-Quant/ \
112
+ --tp 8 \
113
+ --trust-remote-code \
114
+ --n-share-experts-fusion 8 \
115
+ --disable-radix-cache
116
+
117
+ # Evaluating
118
  lm_eval --model local-completions \
119
  --model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
120
  --tasks aime24 \
 
127
 
128
  ### MMLU_COT
129
  ```
130
+ # Launching server
131
+ python3 -m sglang.launch_server \
132
+ --model amd/DeepSeek-R1-MXFP4-ASQ \
133
+ --tp 8 \
134
+ --trust-remote-code \
135
+ --chunked-prefill-size 32768 \
136
+ --mem-fraction-static 0.83
137
+
138
+ # Evaluating
139
  lm_eval --model local-completions \
140
  --model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
141
  --tasks mmlu_cot \