Unraveling the Enigma of Double Descent: An In-depth Analysis through the Lens of Learned Feature Space Paper • 2310.13572 • Published Oct 20, 2023
Mano: Restriking Manifold Optimization for LLM Training Paper • 2601.23000 • Published 6 days ago • 2