MaineCoon: Pursuing A Real-Time Audio-Visual Social World Model Paper • 2606.17800 • Published 9 days ago • 13
iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance Paper • 2605.21431 • Published May 20 • 2
FashionChameleon: Towards Real-Time and Interactive Human-Garment Video Customization Paper • 2605.15824 • Published May 15 • 67
What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion Paper • 2605.07915 • Published May 8 • 8
What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion Paper • 2605.07915 • Published May 8 • 8
4DThinker: Thinking with 4D Imagery for Dynamic Spatial Understanding Paper • 2605.05997 • Published May 7 • 18
Continuous-Time Distribution Matching for Few-Step Diffusion Distillation Paper • 2605.06376 • Published May 7 • 27
Continuous-Time Distribution Matching for Few-Step Diffusion Distillation Paper • 2605.06376 • Published May 7 • 27
Lightning Unified Video Editing via In-Context Sparse Attention Paper • 2605.04569 • Published May 6 • 18
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper • 2604.19748 • Published Apr 21 • 252
One-step Latent-free Image Generation with Pixel Mean Flows Paper • 2601.22158 • Published Jan 29 • 18
OpenSubject: Leveraging Video-Derived Identity and Diversity Priors for Subject-driven Image Generation and Manipulation Paper • 2512.08294 • Published Dec 9, 2025 • 18
OneThinker: All-in-one Reasoning Model for Image and Video Paper • 2512.03043 • Published Dec 2, 2025 • 35
SCA: Improve Semantic Consistent in Unrestricted Adversarial Attacks via DDPM Inversion Paper • 2410.02240 • Published Oct 3, 2024 • 1
SCA: Improve Semantic Consistent in Unrestricted Adversarial Attacks via DDPM Inversion Paper • 2410.02240 • Published Oct 3, 2024 • 1
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published Oct 21, 2025 • 23