Update README.md
Browse files
README.md
CHANGED
|
@@ -102,10 +102,19 @@ The model was evaluated on reasoning tasks including AIME24, MMLU_COT, and GSM8K
|
|
| 102 |
|
| 103 |
### Reproduction
|
| 104 |
|
| 105 |
-
The results of AIME24 and MMLU_COT were obtained using [SGLang](https://docs.sglang.ai/) via
|
| 106 |
|
| 107 |
### AIME24
|
| 108 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 109 |
lm_eval --model local-completions \
|
| 110 |
--model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
|
| 111 |
--tasks aime24 \
|
|
@@ -118,6 +127,15 @@ lm_eval --model local-completions \
|
|
| 118 |
|
| 119 |
### MMLU_COT
|
| 120 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 121 |
lm_eval --model local-completions \
|
| 122 |
--model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
|
| 123 |
--tasks mmlu_cot \
|
|
|
|
| 102 |
|
| 103 |
### Reproduction
|
| 104 |
|
| 105 |
+
The results of AIME24 and MMLU_COT were obtained using [SGLang](https://docs.sglang.ai/) via forked [lm-evaluation-harness](https://github.com/BowenBao/lm-evaluation-harness/tree/cot)
|
| 106 |
|
| 107 |
### AIME24
|
| 108 |
```
|
| 109 |
+
# Launching server
|
| 110 |
+
python3 -m sglang.launch_server \
|
| 111 |
+
--model /data/DeepSeek-R1-WMXFP4-AMXFP4-Scale-UINT8-Attn-MoE-Quant/ \
|
| 112 |
+
--tp 8 \
|
| 113 |
+
--trust-remote-code \
|
| 114 |
+
--n-share-experts-fusion 8 \
|
| 115 |
+
--disable-radix-cache
|
| 116 |
+
|
| 117 |
+
# Evaluating
|
| 118 |
lm_eval --model local-completions \
|
| 119 |
--model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
|
| 120 |
--tasks aime24 \
|
|
|
|
| 127 |
|
| 128 |
### MMLU_COT
|
| 129 |
```
|
| 130 |
+
# Launching server
|
| 131 |
+
python3 -m sglang.launch_server \
|
| 132 |
+
--model amd/DeepSeek-R1-MXFP4-ASQ \
|
| 133 |
+
--tp 8 \
|
| 134 |
+
--trust-remote-code \
|
| 135 |
+
--chunked-prefill-size 32768 \
|
| 136 |
+
--mem-fraction-static 0.83
|
| 137 |
+
|
| 138 |
+
# Evaluating
|
| 139 |
lm_eval --model local-completions \
|
| 140 |
--model_args model=amd/DeepSeek-R1-MXFP4-ASQ,base_url=http://localhost:30000/v1/completions,num_concurrent=999999,timeout=999999,tokenized_requests=False,max_length=32000,temperature=0.6,top_p=0.95 \
|
| 141 |
--tasks mmlu_cot \
|