Fantastic Reasoning Behaviors and Where to Find Them: Unsupervised Discovery of the Reasoning Process Paper • 2512.23988 • Published Dec 30, 2025 • 16
SpaceTimePilot: Generative Rendering of Dynamic Scenes Across Space and Time Paper • 2512.25075 • Published Dec 31, 2025 • 15
Guiding a Diffusion Transformer with the Internal Dynamics of Itself Paper • 2512.24176 • Published Dec 30, 2025 • 8
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published Dec 30, 2025 • 51
AdaGaR: Adaptive Gabor Representation for Dynamic Scene Reconstruction Paper • 2601.00796 • Published Jan 2 • 31
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning Paper • 2512.24146 • Published Dec 30, 2025 • 14
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 29 days ago • 146
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Paper • 2601.03044 • Published 29 days ago • 28
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 27 days ago • 218
The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models Paper • 2601.03425 • Published 29 days ago • 16
RelayLLM: Efficient Reasoning via Collaborative Decoding Paper • 2601.05167 • Published 27 days ago • 29
AgentOCR: Reimagining Agent History via Optical Self-Compression Paper • 2601.04786 • Published 28 days ago • 29
Over-Searching in Search-Augmented Large Language Models Paper • 2601.05503 • Published 27 days ago • 6
Lost in the Noise: How Reasoning Models Fail with Contextual Distractors Paper • 2601.07226 • Published 24 days ago • 32
Beyond Hard Masks: Progressive Token Evolution for Diffusion Language Models Paper • 2601.07351 • Published 24 days ago • 26
Dr. Zero: Self-Evolving Search Agents without Training Data Paper • 2601.07055 • Published 24 days ago • 20