V-JEPA 2.1: Unlocking Dense Features in Video Self-Supervised Learning Paper • 2603.14482 • Published 5 days ago • 10
EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing Paper • 2603.19224 • Published about 24 hours ago • 15
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models Paper • 2603.18002 • Published 2 days ago • 3
Grounding World Simulation Models in a Real-World Metropolis Paper • 2603.15583 • Published 4 days ago • 136
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model Paper • 2603.18524 • Published 1 day ago • 38
Memory-V2V: Augmenting Video-to-Video Diffusion Models with Memory Paper • 2601.16296 • Published Jan 22 • 28
Multi-view Pyramid Transformer: Look Coarser to See Broader Paper • 2512.07806 • Published Dec 8, 2025 • 21
Latent Diffusion Model without Variational Autoencoder Paper • 2510.15301 • Published Oct 17, 2025 • 50
Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene Representation Paper • 2505.13215 • Published May 19, 2025 • 29
Sequence Matters: Harnessing Video Models in 3D Super-Resolution Paper • 2412.11525 • Published Dec 16, 2024 • 11
CodecNeRF: Toward Fast Encoding and Decoding, Compact, and High-quality Novel-view Synthesis Paper • 2404.04913 • Published Apr 7, 2024 • 3
DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance Paper • 2406.18459 • Published Jun 26, 2024 • 2
SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting Paper • 2411.17190 • Published Nov 26, 2024 • 15
MetaFormer: High-fidelity Metalens Imaging via Aberration Correcting Transformers Paper • 2412.04591 • Published Dec 5, 2024 • 9
Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction Paper • 2412.06234 • Published Dec 9, 2024 • 19
PIG: Physics-Informed Gaussians as Adaptive Parametric Mesh Representations Paper • 2412.05994 • Published Dec 8, 2024 • 19