Self-Enhanced Reasoning Training: Activating Latent Reasoning in Small Models for Enhanced Reasoning Distillation Paper • 2502.12744 • Published Feb 18, 2025 • 3
MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent Paper • 2507.02259 • Published Jul 3, 2025 • 6
Self-Taught Self-Correction for Small Language Models Paper • 2503.08681 • Published Mar 11, 2025 • 16
Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards Paper • 2605.14539 • Published May 14 • 7
SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution Paper • 2605.18401 • Published May 18 • 130