Update README.md (#2)
Browse files- Update README.md (4306f755bc1fac2f3952f0224d5772e09993c6d0)
Co-authored-by: Rodri Mora <bullerwins@users.noreply.huggingface.co>
README.md
CHANGED
|
@@ -26,7 +26,7 @@ otherwise the expert tensors couldn’t be evenly sharded across GPU devices.</i
|
|
| 26 |
```
|
| 27 |
CONTEXT_LENGTH=32768
|
| 28 |
vllm serve \
|
| 29 |
-
|
| 30 |
--served-model-name My_Model \
|
| 31 |
--enable-auto-tool-choice \
|
| 32 |
--tool-call-parser glm45 \
|
|
|
|
| 26 |
```
|
| 27 |
CONTEXT_LENGTH=32768
|
| 28 |
vllm serve \
|
| 29 |
+
QuantTrio/GLM-4.6-AWQ \
|
| 30 |
--served-model-name My_Model \
|
| 31 |
--enable-auto-tool-choice \
|
| 32 |
--tool-call-parser glm45 \
|