ternary-quant-demo / ternary_quant
893 kB
AsadIsmail's picture
Update Qwen2-VL: now text+vision backbone quantized (fixed qkv + NaN bug)
0b1047e verified