vllm v0.10.2 error

#2
by traphix - opened

vllm v0.10.2 on 2 x A100

vllm serve --served-model-name qwen3-next-80b-a3b-instruct \
    --model /data/model-cache/Qwen3-Next-80B-A3B-Instruct-FP8-Dynamic/ \
    --tensor-parallel-size 2

error

RuntimeError: size_n = 6176 is not divisible by tile_n_size = 64

Sign up or log in to comment