Hallucinations Undermine Trust; Metacognition is a Way Forward Paper • 2605.01428 • Published 7 days ago • 18
The Last Human-Written Paper: Agent-Native Research Artifacts Paper • 2604.24658 • Published 10 days ago • 19
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 9 days ago • 56
Synthetic Computers at Scale for Long-Horizon Productivity Simulation Paper • 2604.28181 • Published 9 days ago • 18
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents Paper • 2604.26752 • Published 10 days ago • 100
Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets Paper • 2604.22294 • Published 15 days ago • 17
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters Paper • 2408.03314 • Published Aug 6, 2024 • 67
LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics Paper • 2604.17295 • Published 20 days ago • 84
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 17 days ago • 239
TEMPO: Scaling Test-time Training for Large Reasoning Models Paper • 2604.19295 • Published 18 days ago • 34
GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification Paper • 2604.14258 • Published 24 days ago • 23
QuantCode-Bench: A Benchmark for Evaluating the Ability of Large Language Models to Generate Executable Algorithmic Trading Strategies Paper • 2604.15151 • Published 23 days ago • 15
MultiWorld: Scalable Multi-Agent Multi-View Video World Models Paper • 2604.18564 • Published 19 days ago • 45