Upload transformer-only FP8 quantized runtime (scaled fp8) a8eeee1 verified chaechae7 commited on 20 days ago