File size: 3,667 Bytes
185cd74 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 |
---
pipeline_tag: depth-estimation
license: apache-2.0
---
<h1 align="center">
Depth Any Panoramas:<br>
A Foundation Model for Panoramic Depth Estimation
</h1>
<p align="center">
<a href="https://linxin0.github.io"><b>Xin Lin</b></a> 路
<a href="#"><b>Meixi Song</b></a> 路
<a href="#"><b>Dizhe Zhang</b></a> 路
<a href="#"><b>Wenxuan Lu</b></a> 路
<a href="https://haodong2000.github.io"><b>Haodong Li</b></a>
<br>
<a href="#"><b>Bo Du</b></a> 路
<a href="#"><b>Ming-Hsuan Yang</b></a> 路
<a href="#"><b>Truong Nguyen</b></a> 路
<a href="http://luqi.info"><b>Lu Qi</b></a>
</p>
<p align="center">
<a href='https://huggingface.co/papers/2512.16913'><img src='https://img.shields.io/badge/Hugging%20Face-Paper-red?logo=huggingface&logoColor=white' alt='Hugging Face Paper'></a>
<a href='https://insta360-research-team.github.io/DAP_website/'><img src='https://img.shields.io/badge/Project_Page-Website-green?logo=insta360&logoColor=white' alt='Project Page'></a>
<a href='https://github.com/Insta360-Research-Team/DAP'><img src='https://img.shields.io/badge/GitHub-Code-blue?logo=github&logoColor=white' alt='GitHub'></a>
<a href='https://huggingface.co/spaces/Insta360-Research/DAP'><img src='https://img.shields.io/badge/馃殌%20Hugging%20Face-Demo-orange'></a>
</p>

This repository presents **Depth Any Panoramas (DAP)**, a panoramic metric depth foundation model that generalizes across diverse scene distances. It explores a data-in-the-loop paradigm for both data construction and framework design, combining public datasets, high-quality synthetic data, and real panoramic images. The model adopts DINOv3-Large as its backbone and introduces innovations such as a plug-and-play range mask head, sharpness-centric optimization, and geometry-centric optimization to enhance robustness and ensure geometric consistency across views. Experiments demonstrate strong performance and zero-shot generalization, providing robust and stable metric predictions in diverse real-world scenes.
More details can be found in the [paper](https://huggingface.co/papers/2512.16913) and on the [project page](https://insta360-research-team.github.io/DAP_website/).
## 馃敤 Installation
Clone the repo first:
```Bash
git clone https://github.com/Insta360-Research-Team/DAP
cd DAP
```
(Optional) Create a fresh conda env:
```Bash
conda create -n dap python=3.12
conda activate dap
```
Install necessary packages (torch > 2):
```Bash
# pytorch (select correct CUDA version, we test our code on torch==2.7.1 and torchvision==0.22.1)
pip install torch==2.7.1 torchvision==0.22.1
# other dependencies
pip install -r requirements.txt
```
## 馃 Pre-trained model
Please download the pretrained model from this Hugging Face repository: [Insta360-Research/DAP-weights](https://huggingface.co/Insta360-Research/DAP-weights).
## 馃搾 Inference
```Bash
python test/infer.py
```
## 馃殌 Evaluation
```Bash
python test/eval.py
```
## 馃 Acknowledgement
We appreciate the open source of the following projects:
* [PanDA](https://caozidong.github.io/PanDA_Depth/)
* [Depth-Anything-V2](https://github.com/DepthAnything/Depth-Anything-V2)
## Citation
If you find our work useful, please cite our paper:
```bibtex
@article{lin2025dap,
title={Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation},
author={Lin, Xin and Song, Meixi and Zhang, Dizhe and Lu, Wenxuan and Li, Haodong and Du, Bo and Yang, Ming-Hsuan and Nguyen, Truong and Qi, Lu},
journal={arXiv},
year={2025}
}
``` |