Image-to-3D
English
gtr / README.md
zoujx96's picture
Update README.md
2c8b88c verified
metadata
license: other
license_name: snap-non-commercial-license
license_link: LICENSE
datasets:
  - allenai/objaverse
language:
  - en
pipeline_tag: image-to-3d

Model Details

GTR is a large 3D reconstruction model that takes multi-view images as input and enables the generation of high-quality meshes with faithful texture reconstruction within seconds.

Model Description

Model Sources

How to Get Started with the Model

Installation

We recommend using Python>=3.10, PyTorch==2.7.0, and CUDA>=12.4.

conda create --name gtr python=3.10
conda activate gtr
pip install -U pip

pip install torch==2.7.0 torchvision==0.22.0 torchmetrics==1.2.1 --index-url https://download.pytorch.org/whl/cu124
pip install -U xformers --index-url https://download.pytorch.org/whl/cu124

pip install -r requirements.txt

How to Use

Please follow instructions here.

Demo

Demo Visuals

Citation

BibTeX:

@article{zhuang2024gtr,
  title={Gtr: Improving large 3d reconstruction models through geometry and texture refinement},
  author={Zhuang, Peiye and Han, Songfang and Wang, Chaoyang and Siarohin, Aliaksandr and Zou, Jiaxu and Vasilkovsky, Michael and Shakhrai, Vladislav and Korolev, Sergey and Tulyakov, Sergey and Lee, Hsin-Ying},
  journal={arXiv preprint arXiv:2406.05649},
  year={2024}
}