docs: update README.md

#1
by quocbao747 - opened

The below command should run Qwen/Qwen3-Coder-Next-FP8 on 2 GPUs instead of 4

vllm serve Qwen/Qwen3-Coder-Next-FP8 --port 8000 --tensor-parallel-size 2 --enable-auto-tool-choice --tool-call-parser qwen3_coder
Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment