how tu quant to fp8?

#8
by yebo8964 - opened

hello~ May I ask how you quantized GLM-5 from FP16 to FP8?

Sign up or log in to comment