Update docs/deploy_guidance.md
Browse files- docs/deploy_guidance.md +1 -1
docs/deploy_guidance.md
CHANGED
|
@@ -15,7 +15,7 @@ docker pull jdopensource/joyai-llm-vllm:v0.15.1-joyai_llm_flash
|
|
| 15 |
```
|
| 16 |
2. launch JoyAI-LLM Flash model with dense MTP.
|
| 17 |
```bash
|
| 18 |
-
vllm serve jdopensource/JoyAI-LLM-Flash-INT4
|
| 19 |
--tool-call-parser qwen3_coder --enable-auto-tool-choice \
|
| 20 |
--speculative-config $'{"method": "mtp", "num_speculative_tokens": 3}'
|
| 21 |
```
|
|
|
|
| 15 |
```
|
| 16 |
2. launch JoyAI-LLM Flash model with dense MTP.
|
| 17 |
```bash
|
| 18 |
+
vllm serve jdopensource/JoyAI-LLM-Flash-INT4 -tp 1 --trust-remote-code \
|
| 19 |
--tool-call-parser qwen3_coder --enable-auto-tool-choice \
|
| 20 |
--speculative-config $'{"method": "mtp", "num_speculative_tokens": 3}'
|
| 21 |
```
|