docs: update README.md
#1
by
quocbao747
- opened
The below command should run Qwen/Qwen3-Coder-Next-FP8 on 2 GPUs instead of 4
vllm serve Qwen/Qwen3-Coder-Next-FP8 --port 8000 --tensor-parallel-size 2 --enable-auto-tool-choice --tool-call-parser qwen3_coder