vulus98 commited on
Commit
16e2669
·
verified ·
1 Parent(s): 968832f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +58 -3
README.md CHANGED
@@ -1,3 +1,58 @@
1
- ---
2
- license: creativeml-openrail-m
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ ---
4
+ ---
5
+ language:
6
+ - en
7
+ license: apache-2.0
8
+ tags:
9
+ - depth estimation
10
+ - panoramic images
11
+ - high resolution
12
+ - image analysis
13
+ - computer vision
14
+ - in-the-wild
15
+ - zero-shot
16
+ ---
17
+
18
+ <h1 align="center">PaGeR(Panoramic Geometry Estimation) depth estimation Model Card</h1>
19
+
20
+ <p align="center">
21
+ <a title="Github" href="https://github.com/prs-eth/marigold" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
22
+ <img src="https://img.shields.io/github/stars/prs-eth/marigold?label=GitHub%20%E2%98%85&logo=github&color=C8C" alt="Github">
23
+ </a>
24
+ <a title="Website" href="https://marigoldcomputervision.github.io/" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
25
+ <img src="https://img.shields.io/badge/%E2%99%A5%20Project%20-Website-blue" alt="Website">
26
+ </a>
27
+ <a title="arXiv" href="https://arxiv.org/abs/2505.09358" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
28
+ <img src="https://img.shields.io/badge/%F0%9F%93%84%20Read%20-Paper-AF3436" alt="arXiv">
29
+ </a>
30
+ <a title="Social" href="https://twitter.com/antonobukhov1" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
31
+ <img src="https://img.shields.io/twitter/follow/:?label=Subscribe%20for%20updates!" alt="Social">
32
+ </a>
33
+ <a title="License" href="https://www.apache.org/licenses/LICENSE-2.0" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
34
+ <img src="https://img.shields.io/badge/License-Apache--2.0-929292" alt="License">
35
+ </a>
36
+ </p>
37
+
38
+ This is a model card for the `PaGeR-depth` model for monocular depth estimation from a single panoramic [ERP](https://en.wikipedia.org/wiki/Equirectangular_projection) image.
39
+ The model is fine-tuned from the `marigold-e2e-ft` [model](https://huggingface.co/GonzaloMG/marigold-e2e-ft-depth) as
40
+ described in our paper:
41
+ - [CVPR'2026 paper(hopefully)](paper link) titled "Panorama Geometry Estimation using Single-Step Diffusion Models"
42
+
43
+ ## Model Details
44
+ - **Developed by:** [Vukasin Bozic](http://www.kebingxin.com/), [Kevin Qu](https://ch.linkedin.com/in/kevin-qu-b3417621b), [Tianfu Wang](https://tianfwang.github.io/), [Nando Metzger](https://nandometzger.github.io/), [Shengyu Huang](https://shengyuh.github.io/), [Bo Li](https://www.linkedin.com/in/bobboli0202), [Anton Obukhov](https://www.obukhov.ai/), [Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ).
45
+ - **Model type:** Generative latent diffusion-based one-step scale-invariant and metric monocular panoramic depth estimation from a single ERP image.
46
+ - **Language:** English.
47
+ - **License:** [Apache License License Version 2.0](https://www.apache.org/licenses/LICENSE-2.0).
48
+ - **Model Description:** This model can be used to generate an estimated depth map of a panoramic input image.
49
+ - **Resolution**: The model is designed to support large resolutions up to 3K.
50
+ - **Steps and scheduler**: This model works in a swift, one-step regime.
51
+ - **Outputs**:
52
+ - **Scale-invariant depth map**: The predicted values represent a depth map, interpolating between the near and far planes of the model's choice.
53
+ - **Resources for more information:** [Project Website](insert link), [Paper](insert link), [Code](https://github.com/prs-eth/PaGeR).
54
+ - **Cite as:**
55
+
56
+ ```bibtex
57
+ Add citation
58
+ ```