Why Muon Outperforms Adam: A Curvature Perspective Paper • 2606.04662 • Published 24 days ago • 10 • 4
Neural Networks Provably Learn Spectral Representations for Group Composition Paper • 2606.02993 • Published 25 days ago • 6