Xiao-AMD commited on
Commit
f8a517c
·
verified ·
1 Parent(s): 7a9dbbf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -24
README.md CHANGED
@@ -77,35 +77,26 @@ The model was evaluated on gsm8k benchmarks using the [vllm](https://github.com/
77
 
78
  ### Reproduction
79
 
80
- The GSM8K results were obtained using the vLLM framework, based on the Docker image `rocm/vllm:rocm7.0.0_vllm_0.11.2_20251210`, and vLLM is installed inside the container with fixes applied for model support.
81
 
82
- #### Preparation in container
83
- ```
84
- # Reinstall vLLM
85
- pip uninstall vllm -y
86
- git clone https://github.com/vllm-project/vllm.git
87
- cd vllm
88
- git checkout v0.13.0
89
- pip install -r requirements/rocm.txt
90
- python setup.py develop
91
- cd ..
92
  ```
 
93
 
94
- #### Launching server
95
- ```
96
- VLLM_ROCM_USE_AITER=1 \
97
- VLLM_DISABLE_COMPILE_CACHE=1 \
98
- vllm serve "$MODEL" \
99
- --tensor-parallel-size 4 \
100
- --trust-remote-code \
101
- --max-model-len 32768 \
102
- --port 8899
103
- ```
104
 
 
 
 
105
 
106
- #### Evaluating model in a new terminal
107
- ```
108
- python vllm/tests/evals/gsm8k/gsm8k_eval.py --host http://127.0.0.1 --port 8899 --num-questions 1000 --save-results logs
 
 
 
 
109
  ```
110
 
111
 
 
77
 
78
  ### Reproduction
79
 
80
+ The GSM8K results were obtained using the lm-eval framework, based on the Docker image `rocm/vllm-dev:nightly`.
81
 
82
+ #### Evaluating model in a new terminal
 
 
 
 
 
 
 
 
 
83
  ```
84
+ export VLLM_ROCM_USE_AITER=1
85
 
86
+ export model_dir_fp4=MiniMaxAI/MiniMax-M2.5-mxfp4/
87
+ log_file=minimax25-lm_eval_gsm8k_mxfp4.txt
 
 
 
 
 
 
 
 
88
 
89
+ lm_eval --model vllm --model_args pretrained=$model_dir_fp4,enforce_eager=True,trust_remote_code=True,max_model_len=16384 \
90
+ --gen_kwargs temperature=1.0,top_p=0.95,top_k=40 \
91
+ --tasks gsm8k --num_fewshot 8 2>&1 | tee $log_file
92
 
93
+
94
+ export model_dir=MiniMaxAI/MiniMax-M2.5/
95
+ log_file=minimax25-lm_eval_gsm8k.txt
96
+
97
+ lm_eval --model vllm --model_args pretrained=$model_dir,enforce_eager=True,tensor_parallel_size=8,trust_remote_code=True,max_model_len=16384 \
98
+ --gen_kwargs temperature=0.8,top_p=0.95,top_k=40 \
99
+ --tasks gsm8k --num_fewshot 8 2>&1 | tee $log_file
100
  ```
101
 
102