Only producing garbage in H200, cu130 with CUDA 13.0

#1
by Dsturb - opened

Hi,

I can not get any reasonable output from your quntisation. The Qwen 3 2507 is running well in NVFP4 even there is no native support.

In a similar position, just get gibberish output from this quant

Sign up or log in to comment