APEX: Large-scale Multi-task Aesthetic-Informed Popularity Prediction for AI-Generated Music Paper • 2605.03395 • Published 4 days ago • 3
Lightning Unified Video Editing via In-Context Sparse Attention Paper • 2605.04569 • Published 3 days ago • 12
Efficient Training on Multiple Consumer GPUs with RoundPipe Paper • 2604.27085 • Published 10 days ago • 40
UniVidX: A Unified Multimodal Framework for Versatile Video Generation via Diffusion Priors Paper • 2605.00658 • Published 8 days ago • 80
ObjectClear: Complete Object Removal via Object-Effect Attention Paper • 2505.22636 • Published May 28, 2025 • 5
Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation Paper • 2604.25819 • Published 11 days ago • 17
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper • 2604.19254 • Published 18 days ago • 29
UltraShape 1.0: High-Fidelity 3D Shape Generation via Scalable Geometric Refinement Paper • 2512.21185 • Published Dec 24, 2025 • 32
StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition Paper • 2604.21689 • Published 16 days ago • 25
Seeing Fast and Slow: Learning the Flow of Time in Videos Paper • 2604.21931 • Published 16 days ago • 19
ReImagine: Rethinking Controllable High-Quality Human Video Generation via Image-First Synthesis Paper • 2604.19720 • Published 18 days ago • 3
Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation Paper • 2604.18168 • Published 19 days ago • 97
UniMesh: Unifying 3D Mesh Understanding and Generation Paper • 2604.17472 • Published 20 days ago • 11
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published 18 days ago • 249
SmartPhotoCrafter: Unified Reasoning, Generation and Optimization for Automatic Photographic Image Editing Paper • 2604.19587 • Published 18 days ago • 46
CoInteract: Physically-Consistent Human-Object Interaction Video Synthesis via Spatially-Structured Co-Generation Paper • 2604.19636 • Published 18 days ago • 87
Speculative Decoding for Autoregressive Video Generation Paper • 2604.17397 • Published 20 days ago • 11
AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model Paper • 2604.19747 • Published 18 days ago • 39
Less Gaussians, Texture More: 4K Feed-Forward Textured Splatting Paper • 2603.25745 • Published Mar 26 • 16