GLM2NSA / compressed_attention.py

Commit History

Added num stages tuning
42f4907

Maxtimer97 commited on

Changed to autotune triton for 48G GPU deployment
4ee9d9e

Maxtimer97 commited on

Removed assertion
22ba83b

Maxtimer97 commited on

Removed wrong edge case assertion
bb26ab9

Maxtimer97 commited on