UniPrefill: Universal Long-Context Prefill Acceleration via Block-wise Dynamic Sparsification Paper • 2605.06221 • Published May 7 • 22
Backward-Compatible Aligned Representations via an Orthogonal Transformation Layer Paper • 2408.08793 • Published Aug 16, 2024 • 7