Speed by Simplicity: A Single-Stream Architecture for Fast Audio-Video Generative Foundation Model Paper • 2603.21986 • Published 2 days ago • 99
Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models Paper • 2603.22212 • Published 2 days ago • 114
PEARL: Personalized Streaming Video Understanding Model Paper • 2603.20422 • Published 5 days ago • 35
SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection Paper • 2603.20686 • Published 4 days ago • 2
SNAP: Speaker Nulling for Artifact Projection in Speech Deepfake Detection Paper • 2603.20686 • Published 4 days ago • 2
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model Paper • 2603.18524 • Published 6 days ago • 54
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 9 days ago • 145
Learning from Synthetic Data Improves Multi-hop Reasoning Paper • 2603.02091 • Published 23 days ago • 1
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 29 days ago • 101
Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text Paper • 2601.22975 • Published Jan 30 • 110
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation Paper • 2601.09688 • Published Jan 14 • 127