Qwen
/

Qwen3-Coder-Next

Text Generation

Model card Files Files and versions

littlebird13 commited on 15 days ago

Commit

faa3cdf

·

verified ·

1 Parent(s): 6f98d9d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -134,7 +134,7 @@ See [its documentation](https://docs.vllm.ai/en/stable/getting_started/installat
 The following command can be used to create an API endpoint at `http://localhost:8000/v1` with maximum context length 256K tokens using tensor parallel on 4 GPUs.
 ```shell
-vllm serve Qwen/Qwen3-Coder-Next  --tensor-parallel-size 2 --enable-auto-tool-choice --tool-call-parser qwen3_coder
 ```
 > [!Note]

 The following command can be used to create an API endpoint at `http://localhost:8000/v1` with maximum context length 256K tokens using tensor parallel on 4 GPUs.
 ```shell
+vllm serve Qwen/Qwen3-Coder-Next --port 8000 --tensor-parallel-size 2 --enable-auto-tool-choice --tool-call-parser qwen3_coder
 ```
 > [!Note]