vllm v0.10.2 error
#2
by traphix - opened
vllm v0.10.2 on 2 x A100
vllm serve --served-model-name qwen3-next-80b-a3b-instruct \
--model /data/model-cache/Qwen3-Next-80B-A3B-Instruct-FP8-Dynamic/ \
--tensor-parallel-size 2
error
RuntimeError: size_n = 6176 is not divisible by tile_n_size = 64