BLaST: High Performance Inference and Pretraining using BLock Sparse Transformers Paper • 2507.03117 • Published Jul 3