Lost in Backpropagation: The LM Head is a Gradient Bottleneck Paper • 2603.10145 • Published Mar 10 • 13
AI Paper of the Day Collection A collection of papers that I think are interesting, one added each day • 639 items • Updated 6 days ago • 97
view article Article 🪆 Introduction to Matryoshka Embedding Models +1 tomaarsen, Xenova, osanseviero • Feb 23, 2024 • 207