EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 17 days ago • 142
Diffusion Language Models are Super Data Learners Paper • 2511.03276 • Published Nov 5, 2025 • 132
From Harm to Help: Turning Reasoning In-Context Demos into Assets for Reasoning LMs Paper • 2509.23196 • Published Sep 27, 2025 • 9
Language Models Can Learn from Verbal Feedback Without Scalar Rewards Paper • 2509.22638 • Published Sep 26, 2025 • 70
Fostering Video Reasoning via Next-Event Prediction Paper • 2505.22457 • Published May 28, 2025 • 29
Reinforcing General Reasoning without Verifiers Paper • 2505.21493 • Published May 27, 2025 • 27
Optimizing Anytime Reasoning via Budget Relative Policy Optimization Paper • 2505.13438 • Published May 19, 2025 • 36
What's "up" with vision-language models? Investigating their struggle with spatial reasoning Paper • 2310.19785 • Published Oct 30, 2023 • 1
Understanding R1-Zero-Like Training: A Critical Perspective Paper • 2503.20783 • Published Mar 26, 2025 • 60
SkyLadder: Better and Faster Pretraining via Context Window Scheduling Paper • 2503.15450 • Published Mar 19, 2025 • 12
Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs Paper • 2502.12982 • Published Feb 18, 2025 • 19
Can Knowledge Editing Really Correct Hallucinations? Paper • 2410.16251 • Published Oct 21, 2024 • 55
From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning Paper • 2304.07995 • Published Apr 17, 2023 • 3
In-context Autoencoder for Context Compression in a Large Language Model Paper • 2307.06945 • Published Jul 13, 2023 • 29