-
Neural-Driven Image Editing
Paper • 2507.05397 • Published • 27 -
π^3: Scalable Permutation-Equivariant Visual Geometry Learning
Paper • 2507.13347 • Published • 67 -
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
Paper • 2507.10065 • Published • 25 -
From One to More: Contextual Part Latents for 3D Generation
Paper • 2507.08772 • Published • 26
Xander R
xrong
·
AI & ML interests
None yet
Organizations
None yet
Foundation
-
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Paper • 2409.18124 • Published • 33 -
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Paper • 2409.18125 • Published • 34 -
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Paper • 2410.11795 • Published • 18 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
inbox
-
Neural-Driven Image Editing
Paper • 2507.05397 • Published • 27 -
π^3: Scalable Permutation-Equivariant Visual Geometry Learning
Paper • 2507.13347 • Published • 67 -
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
Paper • 2507.10065 • Published • 25 -
From One to More: Contextual Part Latents for 3D Generation
Paper • 2507.08772 • Published • 26
Foundation
-
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Paper • 2409.18124 • Published • 33 -
LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness
Paper • 2409.18125 • Published • 34 -
Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices
Paper • 2410.11795 • Published • 18 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
models 0
None public yet
datasets 0
None public yet