bullpoint/Qwen3-Coder-Next-AWQ-4bit · vllm says it's not AWQ

Yeah, --quantization compressed-tensors would work too. No flag lets vLLM auto detect the file format.. the --quantization flag is misleading as to what it does.

bullpoint changed discussion status to closed 24 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment