Update README.md
Browse files
README.md
CHANGED
|
@@ -61,27 +61,7 @@ The model was evaluated on GSM8K benchmarks.
|
|
| 61 |
|
| 62 |
### Reproduction
|
| 63 |
|
| 64 |
-
The GSM8K results were obtained using the
|
| 65 |
-
|
| 66 |
-
#### Preparation in container
|
| 67 |
-
```
|
| 68 |
-
# installl vLLM
|
| 69 |
-
git clone https://github.com/vllm-project/vllm.git
|
| 70 |
-
cd vllm
|
| 71 |
-
git checkout 74c583bc508c2dafb9e95bab3b635884e4a021f3
|
| 72 |
-
pip install -r requirements/rocm.txt
|
| 73 |
-
python setup.py develop
|
| 74 |
-
cd ..
|
| 75 |
-
|
| 76 |
-
# install lm-eval
|
| 77 |
-
git clone --recursive https://github.com/EleutherAI/lm-evaluation-harness.git
|
| 78 |
-
cd lm-evaluation-harness
|
| 79 |
-
git checkout 3ac28fc5782dc1b3cc62ae4337b1aaf8d0065bef
|
| 80 |
-
pip install -e .
|
| 81 |
-
cd ..
|
| 82 |
-
|
| 83 |
-
pip install lm-eval[api]
|
| 84 |
-
```
|
| 85 |
|
| 86 |
#### Launching server
|
| 87 |
```
|
|
|
|
| 61 |
|
| 62 |
### Reproduction
|
| 63 |
|
| 64 |
+
The GSM8K results were obtained using the lm-evaluation-harness framework, based on the Docker image `rocm/vllm-private:vllm_dev_base_mxfp4_20260122`, with vLLM, lm-eval and amd-quark compiled and installed from source inside the image.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 65 |
|
| 66 |
#### Launching server
|
| 67 |
```
|