Skip a Layer or Loop It? Learning Program-of-Layers in LLMs Paper • 2606.06574 • Published 22 days ago • 24
Learning from Language Feedback via Variational Policy Distillation Paper • 2605.15113 • Published May 18 • 11