WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation Paper • 2603.16871 • Published 2 days ago • 51
Mobile-GS: Real-time Gaussian Splatting for Mobile Devices Paper • 2603.11531 • Published 8 days ago • 9
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents Paper • 2603.09827 • Published 9 days ago • 28
HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising Paper • 2603.08703 • Published 10 days ago • 31
Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper • 2602.24289 • Published 20 days ago • 41
MolHIT: Advancing Molecular-Graph Generation with Hierarchical Discrete Diffusion Models Paper • 2602.17602 • Published 28 days ago • 56
RISE: Self-Improving Robot Policy with Compositional World Model Paper • 2602.11075 • Published Feb 11 • 29
Towards Bridging the Gap between Large-Scale Pretraining and Efficient Finetuning for Humanoid Control Paper • 2601.21363 • Published Jan 29 • 4
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis Paper • 2602.03139 • Published Feb 3 • 45
THINKSAFE: Self-Generated Safety Alignment for Reasoning Models Paper • 2601.23143 • Published Jan 30 • 39
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders Paper • 2601.16208 • Published Jan 22 • 55
Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance Paper • 2601.14171 • Published Jan 20 • 53