jiaxwang commited on
Commit
60ea990
·
verified ·
1 Parent(s): 406c7b4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -21
README.md CHANGED
@@ -61,27 +61,7 @@ The model was evaluated on GSM8K benchmarks.
61
 
62
  ### Reproduction
63
 
64
- The GSM8K results were obtained using the `lm-evaluation-harness` framework, based on the Docker image `rocm/vllm-dev:base`, with vLLM and lm-eval compiled and installed from source inside the container.
65
-
66
- #### Preparation in container
67
- ```
68
- # installl vLLM
69
- git clone https://github.com/vllm-project/vllm.git
70
- cd vllm
71
- git checkout 74c583bc508c2dafb9e95bab3b635884e4a021f3
72
- pip install -r requirements/rocm.txt
73
- python setup.py develop
74
- cd ..
75
-
76
- # install lm-eval
77
- git clone --recursive https://github.com/EleutherAI/lm-evaluation-harness.git
78
- cd lm-evaluation-harness
79
- git checkout 3ac28fc5782dc1b3cc62ae4337b1aaf8d0065bef
80
- pip install -e .
81
- cd ..
82
-
83
- pip install lm-eval[api]
84
- ```
85
 
86
  #### Launching server
87
  ```
 
61
 
62
  ### Reproduction
63
 
64
+ The GSM8K results were obtained using the lm-evaluation-harness framework, based on the Docker image `rocm/vllm-private:vllm_dev_base_mxfp4_20260122`, with vLLM, lm-eval and amd-quark compiled and installed from source inside the image.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
65
 
66
  #### Launching server
67
  ```