GLM-4.7-Flash-CoreAI / gpu-pipelined
32.1 GB
mlboydaisuke's picture
remove GatherMM int8hu โ€” superseded by gather_qmm sym8 (2.6x, same quality)
e3abac7 verified