Fundamental Reasoning Paradigms Induce Out-of-Domain Generalization in Language Models Paper • 2602.08658 • Published Feb 9 • 13
SSA: Sparse Sparse Attention by Aligning Full and Sparse Attention Outputs in Feature Space Paper • 2511.20102 • Published Nov 25, 2025 • 28