SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer Paper • 2605.15178 • Published 7 days ago • 80
Echo-Forcing: A Scene Memory Framework for Interactive Long Video Generation Paper • 2605.16003 • Published 6 days ago • 3
TrackCraft3R: Repurposing Video Diffusion Transformers for Dense 3D Tracking Paper • 2605.12587 • Published 9 days ago • 37
KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration Paper • 2605.14278 • Published 7 days ago • 37
Stop When Reasoning Converges: Semantic-Preserving Early Exit for Reasoning Models Paper • 2605.17672 • Published 4 days ago • 20
SkillsVote: Lifecycle Governance of Agent Skills from Collection, Recommendation to Evolution Paper • 2605.18401 • Published 3 days ago • 119
Quantitative Video World Model Evaluation for Geometric-Consistency Paper • 2605.15185 • Published 7 days ago • 3
Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 22 days ago • 40
PhyCo: Learning Controllable Physical Priors for Generative Motion Paper • 2604.28169 • Published 21 days ago • 13
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published 22 days ago • 24
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows Paper • 2604.28139 • Published 21 days ago • 42
FAMA: Failure-Aware Meta-Agentic Framework for Open-Source LLMs in Interactive Tool Use Environments Paper • 2604.25135 • Published 23 days ago • 12
RADIO-ViPE: Online Tightly Coupled Multi-Modal Fusion for Open-Vocabulary Semantic SLAM in Dynamic Environments Paper • 2604.26067 • Published 23 days ago • 73