End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer Paper • 2605.00503 • Published 6 days ago • 8
From Context to Skills: Can Language Models Learn from Context Skillfully? Paper • 2604.27660 • Published 4 days ago • 136
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper • 2604.22748 • Published 13 days ago • 224
A Systematic Study of Cross-Modal Typographic Attacks on Audio-Visual Reasoning Paper • 2604.03995 • Published Apr 5 • 4
MARS: Enabling Autoregressive Models Multi-Token Generation Paper • 2604.07023 • Published 29 days ago • 38
HY-Embodied-0.5: Embodied Foundation Models for Real-World Agents Paper • 2604.07430 • Published 29 days ago • 187
CUE-R: Beyond the Final Answer in Retrieval-Augmented Generation Paper • 2604.05467 • Published about 1 month ago • 7
VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training Paper • 2602.10693 • Published Feb 11 • 220
The Trinity of Consistency as a Defining Principle for General World Models Paper • 2602.23152 • Published Feb 26 • 201