R3 / README.md
KevinXu02's picture
Update README.md
c1f2aec verified
---
license: cc-by-nc-4.0
base_model: Depth-Anything/Depth-Anything-3
tags:
- 3d-reconstruction
- visual-slam
- pose-estimation
- depth-estimation
- computer-vision
---
# R³: 3D Reconstruction via Relative Regression
## Model Description
R³ is a feed-forward model for long-video 3D reconstruction via confidence-weighted relative pose regression.
Built on top of the Depth Anything 3 (DA3) backbone, R³ predicts pairwise relative camera poses and dense geometry from arbitrarily long video streams. Instead of directly regressing all cameras in a global coordinate frame, R³ estimates confidence-weighted relative transformations between image pairs and assembles a globally consistent trajectory downstream.
With approximately 372M parameters, R³ achieves competitive or state-of-the-art performance on camera pose estimation and dense reconstruction benchmarks while remaining significantly smaller than recent billion-parameter alternatives.
## License
This model is a derivative work based on the Depth Anything 3 (DA3) model.
The original DA3 model is released under the CC BY-NC 4.0 license. Accordingly, this model is also distributed under the CC BY-NC 4.0 license and may only be used in accordance with the terms of that license.
Commercial use is not permitted without obtaining appropriate permissions from the original rights holders.
## Model Details
- **Model Type:** 3D Reconstruction / Visual Localization
- **Backbone:** Depth Anything 3 (DA3)
- **Parameters:** ~372M
- **Task:** Relative Pose Regression and Dense 3D Reconstruction
## Citation
If you use this model in your research, please cite the R³ paper.
```bibtex
@misc{xu2026r3,
title={$R^3$: 3D Reconstruction via Relative Regression},
author={Congrong Xu and Huachen Gao and Xingyu Chen and Yuliang Xiu and Jun Gao and Anpei Chen},
year={2026},
}
```