Missing Old Logits in Asynchronous Agentic RL: Semantic Mismatch and Repair Methods for Off-Policy Correction Paper • 2605.12070 • Published 2 days ago • 14
Recall-Extend Dynamics: Enhancing Small Language Models through Controlled Exploration and Refined Offline Integration Paper • 2508.16677 • Published Aug 21, 2025 • 2
Attention Mechanisms Perspective: Exploring LLM Processing of Graph-Structured Data Paper • 2505.02130 • Published May 4, 2025 • 3