Only producing garbage in H200, cu130 with CUDA 13.0

#1
by Dsturb - opened

Hi,

I can not get any reasonable output from your quntisation. The Qwen 3 2507 is running well in NVFP4 even there is no native support.

@Dsturb try this ykarout/Qwen3.5-9b-nvfp4

Sign up or log in to comment