Learning to Predict Future-Aligned Research Proposals with Language Models Paper • 2603.27146 • Published Apr 6 • 5
Retrieval is Cheap, Show Me the Code: Executable Multi-Hop Reasoning for Retrieval-Augmented Generation Paper • 2605.12975 • Published 1 day ago • 7
Useful Memories Become Faulty When Continuously Updated by LLMs Paper • 2605.12978 • Published 1 day ago • 14
Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs Paper • 2602.07276 • Published Feb 7 • 11
Steer2Adapt: Dynamically Composing Steering Vectors Elicits Efficient Adaptation of LLMs Paper • 2602.07276 • Published Feb 7 • 11
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger Paper • 2602.08222 • Published Feb 9 • 290
ReasonFlux-PRM: Trajectory-Aware PRMs for Long Chain-of-Thought Reasoning in LLMs Paper • 2506.18896 • Published Jun 23, 2025 • 29
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published May 20, 2025 • 20
s3: You Don't Need That Much Data to Train a Search Agent via RL Paper • 2505.14146 • Published May 20, 2025 • 20