Mobile-GS: Real-time Gaussian Splatting for Mobile Devices Paper • 2603.11531 • Published 1 day ago • 6
Automatic Generation of High-Performance RL Environments Paper • 2603.12145 • Published about 23 hours ago • 3
DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning Paper • 2603.12257 • Published about 22 hours ago • 22
Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning Paper • 2603.11653 • Published 1 day ago • 1
XSkill: Continual Learning from Experience and Skills in Multimodal Agents Paper • 2603.12056 • Published 1 day ago • 7
ShotVerse: Advancing Cinematic Camera Control for Text-Driven Multi-Shot Video Creation Paper • 2603.11421 • Published 1 day ago • 14
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation Paper • 2603.12247 • Published about 22 hours ago • 20
One Model, Many Budgets: Elastic Latent Interfaces for Diffusion Transformers Paper • 2603.12245 • Published about 22 hours ago • 9
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published about 23 hours ago • 34
EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation Paper • 2603.12267 • Published about 22 hours ago • 9
Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge Paper • 2603.11665 • Published 1 day ago • 2
Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training Paper • 2603.12246 • Published about 22 hours ago • 3
EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models Paper • 2603.12252 • Published about 22 hours ago • 8
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use Paper • 2603.11076 • Published 3 days ago • 4
OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams Paper • 2603.12265 • Published about 22 hours ago • 8
WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing Paper • 2603.11593 • Published 1 day ago • 15