EvoArena: Tracking Memory Evolution for Robust LLM Agents in Dynamic Environments Paper • 2606.13681 • Published 14 days ago • 140
SENSE: Satellite-based ENergy Synthesis for Sustainable Environment Paper • 2605.18101 • Published May 18 • 13
PEEK: Context Map as an Orientation Cache for Long-Context LLM Agents Paper • 2605.19932 • Published May 19 • 7
EquiformerV3: Scaling Efficient, Expressive, and General SE(3)-Equivariant Graph Attention Transformers Paper • 2604.09130 • Published Apr 10 • 4
Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models Paper • 2603.24844 • Published Mar 25 • 10
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights Paper • 2603.12228 • Published Mar 12 • 12
AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games Paper • 2602.17594 • Published Feb 19 • 9
How Much Reasoning Do Retrieval-Augmented Models Add beyond LLMs? A Benchmarking Framework for Multi-Hop Inference over Hybrid Knowledge Paper • 2602.10210 • Published Feb 10 • 1
Stemphonic: All-at-once Flexible Multi-stem Music Generation Paper • 2602.09891 • Published Feb 10 • 2
Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning Paper • 2601.09667 • Published Jan 14 • 92
Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs Paper • 2601.08763 • Published Jan 13 • 150
FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos Paper • 2512.10927 • Published Dec 11, 2025 • 6
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature Paper • 2406.07835 • Published Jun 10, 2024 • 2
SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models Paper • 2510.09541 • Published Oct 10, 2025 • 18
DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research Paper • 2511.19399 • Published Nov 24, 2025 • 63
Learning to Interpret Weight Differences in Language Models Paper • 2510.05092 • Published Oct 6, 2025 • 1