---
license: other
license_name: nvidia-oneway-noncommercial
license_link: https://github.com/nv-dvl/vgg-ttt/blob/main/LICENSE.txt
library_name: vggttt
pipeline_tag: image-to-3d
tags:
- vggt
- 3d-reconstruction
- pointmap
- depth-estimation
- camera-pose
base_model: facebook/VGGT-1B
---
# Model Overview
### Description
VGG-T³ reconstructs 3D geometry and camera parameters from image collections and videos. The model scales linearly with the number of input images, which significantly accelerates reconstruction when dealing with a large number of images or long videos.
This model is for research and development only.
### License/Terms of Use
This model is released under the [NVIDIA OneWay Noncommercial License](https://github.com/nv-dvl/vgg-ttt/blob/main/LICENSE.txt). It may only be used for non-commercial research or educational purposes.
### Deployment Geography
Global
### Use Case
Primary users:
- Computer Vision Researchers: For benchmarking 3D reconstruction and developing neural rendering pipelines.
- Augmented Reality/Virtual Reality & Robotics Engineers: For implementing real-time simultaneous localization and mapping (SLAM), scene understanding, and navigation systems.
- 3D Content Creators: For rapid conversion of video/images into 3D assets.
Primary Use Cases
- 3D Reconstruction: Fast, feed-forward estimation of sparse 3D geometry and camera poses from unposed images or video without iterative optimization.
- Structure-from-motion (SfM) Replacement: Accelerating initialization for 3D Gaussian Splatting and Neural Radiance Field (NeRF) training by replacing slow SfM (e.g., COLMAP) steps.
- Robotic Perception: Instant inference of "pointmaps" (pixel-to-3D coordinates) for manipulation and spatial awareness.
### Release Date
GitHub 05/25/2026 via https://github.com/nv-dvl/vgg-ttt