vulus98 commited on
Commit
57779cb
·
verified ·
1 Parent(s): f19dd6e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -3
README.md CHANGED
@@ -1,3 +1,55 @@
1
- ---
2
- license: creativeml-openrail-m
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: creativeml-openrail-m
3
+ tags:
4
+ - surface normals estimation
5
+ - panoramic images
6
+ - high resolution
7
+ - image analysis
8
+ - computer vision
9
+ - in-the-wild
10
+ - zero-shot
11
+ ---
12
+
13
+ <h1 align="center"> 📟 PaGeR(Panoramic Geometry Estimation) Surface Normals Estimation Model Card</h1>
14
+
15
+ <p align="center">
16
+ <a title="Github" href="https://github.com/prs-eth/PaGeR" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
17
+ <img src="https://img.shields.io/github/stars/prs-eth/PaGeR?label=GitHub%20%E2%98%85&logo=github&color=C8C" alt="Github">
18
+ </a>
19
+ <a title="Website" href="https://marigoldcomputervision.github.io/" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
20
+ <img src="https://img.shields.io/badge/%E2%99%A5%20Project%20-Website-blue" alt="Website">
21
+ </a>
22
+ <a title="arXiv" href="https://arxiv.org/abs/2505.09358" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
23
+ <img src="https://img.shields.io/badge/%F0%9F%93%84%20Read%20-Paper-AF3436" alt="arXiv">
24
+ </a>
25
+ <a title="Hugging Face" href="https://huggingface.co/spaces/prs-eth/PaGeR" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
26
+ <img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-FFD21E" alt="Hugging Face Spaces">
27
+ </a>
28
+ <a title="License" href="LICENSE"
29
+ target="_blank" rel="noopener noreferrer" style="display: inline-block;">
30
+ <img src="https://img.shields.io/badge/License-CreativeML%20OpenRAIL-7C3AED" alt="License">
31
+ </a>
32
+ </p>
33
+
34
+ This is a model card for the `PaGeR-normals-Structured3D` model for monocular normal estimation from a single panoramic [ERP](https://en.wikipedia.org/wiki/Equirectangular_projection) image.
35
+ The model is fine-tuned from our [original normals checkpoint](https://huggingface.co/prs-eth/PaGeR-normals) on Structured3D dataset,
36
+ as described in our paper:
37
+ - [Paper](paper link) titled "Panorama Geometry Estimation using Single-Step Diffusion Models"
38
+
39
+ You can also check out other depth and normals models in our [collection](https://huggingface.co/collections/prs-eth/pager), or test models in our [demo](https://huggingface.co/spaces/prs-eth/PaGeR).
40
+ ## Model Details
41
+ - **Developed by:** [Vukasin Bozic](https://vulus98.github.io/), [Isidora Slavkovic](https://linkedin.com/in/isidora-slavkovic), [Dominik Narnhofer](https://scholar.google.com/citations?user=tFx8AhkAAAAJ&hl=en), [Nando Metzger](https://nandometzger.github.io/), [Denis Rozumny](https://rozumden.github.io/), [Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ), [Nikolai Kalischek](https://scholar.google.com/citations?user=XwzlnZoAAAAJ&hl=de).
42
+ - **Model type:** Generative latent diffusion-based one-step monocular panoramic surface normal estimation from a single ERP image.
43
+ - **License:** [CreativeML OpenRAIL License](LICENSE).
44
+ - **Model Description:** This model can be used to generate an estimated normals map of a panoramic input image.
45
+ - **Resolution**: The model is designed to support large resolutions up to 3K.
46
+ - **Dataset**: [Structured3D](https://structured3d-dataset.org/)
47
+ - **Steps and scheduler**: This model works in a swift, one-step regime.
48
+ - **Outputs**:
49
+ - **Surface Normals map**: The predicted values represent a 3D normals map in range 0-1, representing the 3D surface normal vector at each pixel.
50
+ - **Resources for more information:** [Project Website](insert link), [Paper](insert link), [Code](https://github.com/prs-eth/PaGeR).
51
+ - **Cite as:**
52
+
53
+ ```bibtex
54
+ Add citation
55
+ ```