GLM2NSA / topk_sparse_attention.py

Commit History

Changed to autotune triton for 48G GPU deployment
4ee9d9e

Maxtimer97 commited on