Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs Paper • 2601.17058 • Published 8 days ago • 151
Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory Paper • 2601.16296 • Published 7 days ago • 26
Paying Less Generalization Tax: A Cross-Domain Generalization Study of RL Training for LLM Agents Paper • 2601.18217 • Published 4 days ago • 8
Motion 3-to-4: 3D Motion Reconstruction for 4D Synthesis Paper • 2601.14253 • Published 9 days ago • 9
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published 14 days ago • 18
Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge Paper • 2601.08808 • Published 16 days ago • 38
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 15 days ago • 32
OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding Paper • 2601.09575 • Published 15 days ago • 25
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 18 days ago • 113
VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory Paper • 2601.08665 • Published 16 days ago • 8
Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization Paper • 2601.05432 • Published 21 days ago • 164
What Users Leave Unsaid: Under-Specified Queries Limit Vision-Language Models Paper • 2601.06165 • Published 23 days ago • 16
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 21 days ago • 211
NitroGen: An Open Foundation Model for Generalist Gaming Agents Paper • 2601.02427 • Published 25 days ago • 44
InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields Paper • 2601.03252 • Published 23 days ago • 100