Image-to-Image
MVGenMaster / README.md
nielsr's picture
nielsr HF Staff
Add model card for GaMO
2759acc verified
|
raw
history blame
1.38 kB
metadata
license: apache-2.0
pipeline_tag: image-to-image

GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction

Project Page | ArXiv | GitHub

GaMO (Geometry-aware Multi-view Outpainter) is a framework that reformulates sparse-view reconstruction through multi-view outpainting. Instead of generating new viewpoints, GaMO expands the field of view from existing camera poses, which inherently preserves geometric consistency while providing broader scene coverage.

Our approach employs multi-view conditioning and geometry-aware denoising strategies in a zero-shot manner without training. Extensive experiments on Replica and ScanNet++ demonstrate state-of-the-art reconstruction quality across 3, 6, and 9 input views, outperforming prior methods in PSNR and LPIPS, while achieving a 25× speedup over SOTA diffusion-based methods.

Citation

If you find this work useful, please consider citing:

@article{huang2025gamo,
  title={GaMO: Geometry-aware Multi-view Diffusion Outpainting for Sparse-View 3D Reconstruction},
  author={Huang, Yi-Chuan and Chien, Hao-Jen and Lin, Chin-Yang and Chen, Ying-Huan and Liu, Yu-Lun},
  journal={arXiv preprint arXiv:2512.25073},
  year={2025}
}