Why Muon Outperforms Adam: A Curvature Perspective Paper • 2606.04662 • Published 27 days ago • 10 • 4
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published Sep 30, 2025 • 20 • 2
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Paper • 2505.15141 • Published May 21, 2025 • 4 • 2