Add model card for Depth Any Panoramas
#1
by
nielsr
HF Staff
- opened
README.md
ADDED
|
@@ -0,0 +1,98 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
pipeline_tag: depth-estimation
|
| 3 |
+
license: apache-2.0
|
| 4 |
+
---
|
| 5 |
+
|
| 6 |
+
<h1 align="center">
|
| 7 |
+
Depth Any Panoramas:<br>
|
| 8 |
+
A Foundation Model for Panoramic Depth Estimation
|
| 9 |
+
</h1>
|
| 10 |
+
|
| 11 |
+
<p align="center">
|
| 12 |
+
<a href="https://linxin0.github.io"><b>Xin Lin</b></a> ·
|
| 13 |
+
<a href="#"><b>Meixi Song</b></a> ·
|
| 14 |
+
<a href="#"><b>Dizhe Zhang</b></a> ·
|
| 15 |
+
<a href="#"><b>Wenxuan Lu</b></a> ·
|
| 16 |
+
<a href="https://haodong2000.github.io"><b>Haodong Li</b></a>
|
| 17 |
+
<br>
|
| 18 |
+
<a href="#"><b>Bo Du</b></a> ·
|
| 19 |
+
<a href="#"><b>Ming-Hsuan Yang</b></a> ·
|
| 20 |
+
<a href="#"><b>Truong Nguyen</b></a> ·
|
| 21 |
+
<a href="http://luqi.info"><b>Lu Qi</b></a>
|
| 22 |
+
</p>
|
| 23 |
+
|
| 24 |
+
|
| 25 |
+
<p align="center">
|
| 26 |
+
<a href='https://huggingface.co/papers/2512.16913'><img src='https://img.shields.io/badge/Hugging%20Face-Paper-red?logo=huggingface&logoColor=white' alt='Hugging Face Paper'></a>
|
| 27 |
+
<a href='https://insta360-research-team.github.io/DAP_website/'><img src='https://img.shields.io/badge/Project_Page-Website-green?logo=insta360&logoColor=white' alt='Project Page'></a>
|
| 28 |
+
<a href='https://github.com/Insta360-Research-Team/DAP'><img src='https://img.shields.io/badge/GitHub-Code-blue?logo=github&logoColor=white' alt='GitHub'></a>
|
| 29 |
+
<a href='https://huggingface.co/spaces/Insta360-Research/DAP'><img src='https://img.shields.io/badge/🚀%20Hugging%20Face-Demo-orange'></a>
|
| 30 |
+
</p>
|
| 31 |
+
|
| 32 |
+

|
| 33 |
+
|
| 34 |
+
This repository presents **Depth Any Panoramas (DAP)**, a panoramic metric depth foundation model that generalizes across diverse scene distances. It explores a data-in-the-loop paradigm for both data construction and framework design, combining public datasets, high-quality synthetic data, and real panoramic images. The model adopts DINOv3-Large as its backbone and introduces innovations such as a plug-and-play range mask head, sharpness-centric optimization, and geometry-centric optimization to enhance robustness and ensure geometric consistency across views. Experiments demonstrate strong performance and zero-shot generalization, providing robust and stable metric predictions in diverse real-world scenes.
|
| 35 |
+
|
| 36 |
+
More details can be found in the [paper](https://huggingface.co/papers/2512.16913) and on the [project page](https://insta360-research-team.github.io/DAP_website/).
|
| 37 |
+
|
| 38 |
+
## 🔨 Installation
|
| 39 |
+
|
| 40 |
+
Clone the repo first:
|
| 41 |
+
|
| 42 |
+
```Bash
|
| 43 |
+
git clone https://github.com/Insta360-Research-Team/DAP
|
| 44 |
+
cd DAP
|
| 45 |
+
```
|
| 46 |
+
|
| 47 |
+
(Optional) Create a fresh conda env:
|
| 48 |
+
|
| 49 |
+
```Bash
|
| 50 |
+
conda create -n dap python=3.12
|
| 51 |
+
conda activate dap
|
| 52 |
+
```
|
| 53 |
+
|
| 54 |
+
Install necessary packages (torch > 2):
|
| 55 |
+
|
| 56 |
+
```Bash
|
| 57 |
+
# pytorch (select correct CUDA version, we test our code on torch==2.7.1 and torchvision==0.22.1)
|
| 58 |
+
pip install torch==2.7.1 torchvision==0.22.1
|
| 59 |
+
|
| 60 |
+
# other dependencies
|
| 61 |
+
pip install -r requirements.txt
|
| 62 |
+
```
|
| 63 |
+
|
| 64 |
+
## 🤝 Pre-trained model
|
| 65 |
+
|
| 66 |
+
Please download the pretrained model from this Hugging Face repository: [Insta360-Research/DAP-weights](https://huggingface.co/Insta360-Research/DAP-weights).
|
| 67 |
+
|
| 68 |
+
## 📒 Inference
|
| 69 |
+
|
| 70 |
+
```Bash
|
| 71 |
+
python test/infer.py
|
| 72 |
+
```
|
| 73 |
+
|
| 74 |
+
## 🚀 Evaluation
|
| 75 |
+
|
| 76 |
+
```Bash
|
| 77 |
+
python test/eval.py
|
| 78 |
+
```
|
| 79 |
+
|
| 80 |
+
## 🤝 Acknowledgement
|
| 81 |
+
|
| 82 |
+
We appreciate the open source of the following projects:
|
| 83 |
+
|
| 84 |
+
* [PanDA](https://caozidong.github.io/PanDA_Depth/)
|
| 85 |
+
* [Depth-Anything-V2](https://github.com/DepthAnything/Depth-Anything-V2)
|
| 86 |
+
|
| 87 |
+
## Citation
|
| 88 |
+
|
| 89 |
+
If you find our work useful, please cite our paper:
|
| 90 |
+
|
| 91 |
+
```bibtex
|
| 92 |
+
@article{lin2025dap,
|
| 93 |
+
title={Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation},
|
| 94 |
+
author={Lin, Xin and Song, Meixi and Zhang, Dizhe and Lu, Wenxuan and Li, Haodong and Du, Bo and Yang, Ming-Hsuan and Nguyen, Truong and Qi, Lu},
|
| 95 |
+
journal={arXiv},
|
| 96 |
+
year={2025}
|
| 97 |
+
}
|
| 98 |
+
```
|