Update README.md
Browse files
README.md
CHANGED
|
@@ -118,10 +118,14 @@ MODEL_ARGS="model_name=amd/DeepSeek-R1-0528-MXFP4-ASQ,dtype=bfloat16,tensor_para
|
|
| 118 |
OUTPUT_DIR="results/DeepSeek-R1-0528-MXFP4-ASQ-Seed"
|
| 119 |
LOG="logs/deepseek_0528_maxfp4.log"
|
| 120 |
|
| 121 |
-
|
| 122 |
-
|
| 123 |
-
|
| 124 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
| 125 |
```
|
| 126 |
|
| 127 |
The result of GSM8K was obtained using [lm-eval-harness](https://github.com/EleutherAI/lm-evaluation-harness) and the following commands.
|
|
|
|
| 118 |
OUTPUT_DIR="results/DeepSeek-R1-0528-MXFP4-ASQ-Seed"
|
| 119 |
LOG="logs/deepseek_0528_maxfp4.log"
|
| 120 |
|
| 121 |
+
for i in $(seq 1 10); do
|
| 122 |
+
# seed in [0, 2**30 - 1]
|
| 123 |
+
SEED=$(shuf -i 0-1073741823 -n 1)
|
| 124 |
+
|
| 125 |
+
lighteval vllm $MODEL_ARGS "custom|aime24_single|0|0,custom|math_500_single|0|0,custom|gpqa:diamond_single|0|0" \
|
| 126 |
+
--use-chat-template \
|
| 127 |
+
--output-dir "$OUTPUT_DIR/seed_$SEED" \
|
| 128 |
+
2>&1 | tee -a "$LOG"
|
| 129 |
```
|
| 130 |
|
| 131 |
The result of GSM8K was obtained using [lm-eval-harness](https://github.com/EleutherAI/lm-evaluation-harness) and the following commands.
|