SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents Paper • 2601.16746 • Published 3 days ago • 59
The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models Paper • 2601.15165 • Published 5 days ago • 63
Spurious Rewards Paradox: Mechanistically Understanding How RLVR Activates Memorization Shortcuts in LLMs Paper • 2601.11061 • Published 10 days ago • 7
Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment Paper • 2601.14249 • Published 6 days ago • 8
A BERTology View of LLM Orchestrations: Token- and Layer-Selective Probes for Efficient Single-Pass Classification Paper • 2601.13288 • Published 7 days ago • 12
Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models Paper • 2601.14152 • Published 6 days ago • 4
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences Paper • 2601.06789 • Published 15 days ago • 77
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 14 days ago • 112
X-Coder: Advancing Competitive Programming with Fully Synthetic Tasks, Solutions, and Tests Paper • 2601.06953 • Published 15 days ago • 43
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published 14 days ago • 32
PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning Paper • 2601.05593 • Published 17 days ago • 79
One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling Paper • 2601.03111 • Published 20 days ago • 9
Few Tokens Matter: Entropy Guided Attacks on Vision-Language Models Paper • 2512.21815 • Published Dec 26, 2025 • 22
DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation Paper • 2601.04823 • Published 18 days ago • 6