nielsr HF Staff commited on
Commit
185cd74
verified
1 Parent(s): bc92bec

Add model card for Depth Any Panoramas

Browse files

This PR adds a comprehensive model card for the "Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation" model.

It includes:
- Relevant metadata: `pipeline_tag: depth-estimation` and `license: apache-2.0`.
- A concise model description based on the paper abstract.
- Links and badges to the Hugging Face paper page ([2512.16913](https://huggingface.co/papers/2512.16913)), the project page (`https://insta360-research-team.github.io/DAP_website/`), the GitHub repository (`https://github.com/Insta360-Research-Team/DAP`), and the Hugging Face demo (`https://huggingface.co/spaces/Insta360-Research/DAP`).
- Detailed installation instructions.
- Information on downloading pre-trained models.
- Sample usage for inference and evaluation.
- Acknowledgements and citation information.

Please review and merge this PR to enhance the discoverability and usability of this model.

Files changed (1) hide show
  1. README.md +98 -0
README.md ADDED
@@ -0,0 +1,98 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ pipeline_tag: depth-estimation
3
+ license: apache-2.0
4
+ ---
5
+
6
+ <h1 align="center">
7
+ Depth Any Panoramas:<br>
8
+ A Foundation Model for Panoramic Depth Estimation
9
+ </h1>
10
+
11
+ <p align="center">
12
+ <a href="https://linxin0.github.io"><b>Xin Lin</b></a> 路
13
+ <a href="#"><b>Meixi Song</b></a> 路
14
+ <a href="#"><b>Dizhe Zhang</b></a> 路
15
+ <a href="#"><b>Wenxuan Lu</b></a> 路
16
+ <a href="https://haodong2000.github.io"><b>Haodong Li</b></a>
17
+ <br>
18
+ <a href="#"><b>Bo Du</b></a> 路
19
+ <a href="#"><b>Ming-Hsuan Yang</b></a> 路
20
+ <a href="#"><b>Truong Nguyen</b></a> 路
21
+ <a href="http://luqi.info"><b>Lu Qi</b></a>
22
+ </p>
23
+
24
+
25
+ <p align="center">
26
+ <a href='https://huggingface.co/papers/2512.16913'><img src='https://img.shields.io/badge/Hugging%20Face-Paper-red?logo=huggingface&logoColor=white' alt='Hugging Face Paper'></a>
27
+ <a href='https://insta360-research-team.github.io/DAP_website/'><img src='https://img.shields.io/badge/Project_Page-Website-green?logo=insta360&logoColor=white' alt='Project Page'></a>
28
+ <a href='https://github.com/Insta360-Research-Team/DAP'><img src='https://img.shields.io/badge/GitHub-Code-blue?logo=github&logoColor=white' alt='GitHub'></a>
29
+ <a href='https://huggingface.co/spaces/Insta360-Research/DAP'><img src='https://img.shields.io/badge/馃殌%20Hugging%20Face-Demo-orange'></a>
30
+ </p>
31
+
32
+ ![teaser](https://huggingface.co/Insta360-Research/DAP/resolve/main/assets/depth_teaser2_00.png)
33
+
34
+ This repository presents **Depth Any Panoramas (DAP)**, a panoramic metric depth foundation model that generalizes across diverse scene distances. It explores a data-in-the-loop paradigm for both data construction and framework design, combining public datasets, high-quality synthetic data, and real panoramic images. The model adopts DINOv3-Large as its backbone and introduces innovations such as a plug-and-play range mask head, sharpness-centric optimization, and geometry-centric optimization to enhance robustness and ensure geometric consistency across views. Experiments demonstrate strong performance and zero-shot generalization, providing robust and stable metric predictions in diverse real-world scenes.
35
+
36
+ More details can be found in the [paper](https://huggingface.co/papers/2512.16913) and on the [project page](https://insta360-research-team.github.io/DAP_website/).
37
+
38
+ ## 馃敤 Installation
39
+
40
+ Clone the repo first:
41
+
42
+ ```Bash
43
+ git clone https://github.com/Insta360-Research-Team/DAP
44
+ cd DAP
45
+ ```
46
+
47
+ (Optional) Create a fresh conda env:
48
+
49
+ ```Bash
50
+ conda create -n dap python=3.12
51
+ conda activate dap
52
+ ```
53
+
54
+ Install necessary packages (torch > 2):
55
+
56
+ ```Bash
57
+ # pytorch (select correct CUDA version, we test our code on torch==2.7.1 and torchvision==0.22.1)
58
+ pip install torch==2.7.1 torchvision==0.22.1
59
+
60
+ # other dependencies
61
+ pip install -r requirements.txt
62
+ ```
63
+
64
+ ## 馃 Pre-trained model
65
+
66
+ Please download the pretrained model from this Hugging Face repository: [Insta360-Research/DAP-weights](https://huggingface.co/Insta360-Research/DAP-weights).
67
+
68
+ ## 馃搾 Inference
69
+
70
+ ```Bash
71
+ python test/infer.py
72
+ ```
73
+
74
+ ## 馃殌 Evaluation
75
+
76
+ ```Bash
77
+ python test/eval.py
78
+ ```
79
+
80
+ ## 馃 Acknowledgement
81
+
82
+ We appreciate the open source of the following projects:
83
+
84
+ * [PanDA](https://caozidong.github.io/PanDA_Depth/)
85
+ * [Depth-Anything-V2](https://github.com/DepthAnything/Depth-Anything-V2)
86
+
87
+ ## Citation
88
+
89
+ If you find our work useful, please cite our paper:
90
+
91
+ ```bibtex
92
+ @article{lin2025dap,
93
+ title={Depth Any Panoramas: A Foundation Model for Panoramic Depth Estimation},
94
+ author={Lin, Xin and Song, Meixi and Zhang, Dizhe and Lu, Wenxuan and Li, Haodong and Du, Bo and Yang, Ming-Hsuan and Nguyen, Truong and Qi, Lu},
95
+ journal={arXiv},
96
+ year={2025}
97
+ }
98
+ ```