Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published Apr 29 • 25
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published Apr 29 • 25
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published Apr 29 • 25
Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space Paper • 2512.12623 • Published Dec 14, 2025 • 4
Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey Paper • 2602.06052 • Published Jan 14 • 6
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published Apr 1 • 15
Proactive Agent Research Environment: Simulating Active Users to Evaluate Proactive Assistants Paper • 2604.00842 • Published Apr 1 • 15
WildSci: Advancing Scientific Reasoning from In-the-Wild Literature Paper • 2601.05567 • Published Jan 9
Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing Paper • 2602.04837 • Published Feb 4 • 9
Procedural Generation of Algorithm Discovery Tasks in Machine Learning Paper • 2603.17863 • Published Mar 18 • 5
CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use Paper • 2602.12268 • Published Feb 12