Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 6 days ago • 136
Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory Paper • 2601.16296 • Published 6 days ago • 24
Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents Paper • 2601.18217 • Published 2 days ago • 8
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper • 2601.14253 • Published 8 days ago • 9
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published 13 days ago • 18
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Paper • 2601.08808 • Published 15 days ago • 38
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 14 days ago • 32
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding Paper • 2601.09575 • Published 14 days ago • 25
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 16 days ago • 112
VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory Paper • 2601.08665 • Published 15 days ago • 8
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published 20 days ago • 163
What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models Paper • 2601.06165 • Published 22 days ago • 16
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 20 days ago • 210
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 24 days ago • 43
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 22 days ago • 99
Nested Learning: The Illusion of Deep Learning Architectures Paper • 2512.24695 • Published 28 days ago • 41