Add model card for NAS3R
#1
by nielsr HF Staff - opened
README.md
ADDED
|
@@ -0,0 +1,60 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
pipeline_tag: image-to-3d
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+
# NAS3R: From None to All: Self-Supervised 3D Reconstruction via Novel View Synthesis
|
| 6 |
+
|
| 7 |
+
**NAS3R** is a self-supervised feed-forward framework that jointly learns explicit 3D geometry and camera parameters with no ground-truth annotations and no pretrained priors.
|
| 8 |
+
|
| 9 |
+
- **Paper:** [From None to All: Self-Supervised 3D Reconstruction via Novel View Synthesis](https://huggingface.co/papers/2603.27455)
|
| 10 |
+
- **Project Page:** [https://ranrhuang.github.io/nas3r/](https://ranrhuang.github.io/nas3r/)
|
| 11 |
+
- **GitHub:** [https://github.com/ranrhuang/NAS3R](https://github.com/ranrhuang/NAS3R)
|
| 12 |
+
|
| 13 |
+

|
| 14 |
+
|
| 15 |
+
## Installation
|
| 16 |
+
|
| 17 |
+
1. Clone NAS3R:
|
| 18 |
+
```bash
|
| 19 |
+
git clone --recurse-submodules https://github.com/ranrhuang/NAS3R.git
|
| 20 |
+
cd NAS3R
|
| 21 |
+
```
|
| 22 |
+
|
| 23 |
+
2. Create the environment (example using conda):
|
| 24 |
+
```bash
|
| 25 |
+
conda create -n nas3r python=3.11 -y
|
| 26 |
+
conda activate nas3r
|
| 27 |
+
pip install torch==2.5.1 torchvision==0.20.1 --index-url https://download.pytorch.org/whl/cu121
|
| 28 |
+
pip install -r requirements.txt
|
| 29 |
+
pip install -e submodules/diff-gaussian-rasterization
|
| 30 |
+
```
|
| 31 |
+
|
| 32 |
+
## Pre-trained Checkpoints
|
| 33 |
+
|
| 34 |
+
| Model name | Training resolutions | Training data | Training settings |
|
| 35 |
+
|:---:|:---:|:---:|:---:|
|
| 36 |
+
| [re10k_nas3r.ckpt](https://huggingface.co/RanranHuang/NAS3R/resolve/main/re10k_nas3r.ckpt) | 256x256 | re10k | RE10K, 2 views |
|
| 37 |
+
|
| 38 |
+
## Usage: Evaluation
|
| 39 |
+
|
| 40 |
+
To perform Novel View Synthesis and Pose Estimation on NAS3R (VGGT-based architecture) using the RealEstate10K dataset:
|
| 41 |
+
|
| 42 |
+
```bash
|
| 43 |
+
# Assuming the weight is downloaded to ./checkpoints/re10k_nas3r.ckpt
|
| 44 |
+
python -m src.main +experiment=nas3r/random/re10k mode=test wandb.name=re10k \
|
| 45 |
+
dataset/view_sampler@dataset.re10k.view_sampler=evaluation \
|
| 46 |
+
dataset.re10k.view_sampler.index_path=assets/evaluation_index_re10k.json \
|
| 47 |
+
checkpointing.load=./checkpoints/re10k_nas3r.ckpt \
|
| 48 |
+
test.save_image=false
|
| 49 |
+
```
|
| 50 |
+
|
| 51 |
+
## Citation
|
| 52 |
+
|
| 53 |
+
```bibtex
|
| 54 |
+
@article{huang2026nas3r,
|
| 55 |
+
title={From None to All: Self-Supervised 3D Reconstruction via Novel View Synthesis},
|
| 56 |
+
author={Ranran Huang and Weixun Luo and Ye Mao and Krystian Mikolajczyk},
|
| 57 |
+
journal={arXiv preprint arXiv: 2603.27455},
|
| 58 |
+
year={2026}
|
| 59 |
+
}
|
| 60 |
+
```
|