The Pitfalls of Memorization: When Memorization Hurts Generalization Paper • 2412.07684 • Published Dec 10, 2024 • 1
Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation Paper • 2507.10524 • Published Jul 14, 2025 • 74