Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,55 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: creativeml-openrail-m
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: creativeml-openrail-m
|
| 3 |
+
tags:
|
| 4 |
+
- surface normals estimation
|
| 5 |
+
- panoramic images
|
| 6 |
+
- high resolution
|
| 7 |
+
- image analysis
|
| 8 |
+
- computer vision
|
| 9 |
+
- in-the-wild
|
| 10 |
+
- zero-shot
|
| 11 |
+
---
|
| 12 |
+
|
| 13 |
+
<h1 align="center"> 📟 PaGeR(Panoramic Geometry Estimation) Surface Normals Estimation Model Card</h1>
|
| 14 |
+
|
| 15 |
+
<p align="center">
|
| 16 |
+
<a title="Github" href="https://github.com/prs-eth/PaGeR" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
|
| 17 |
+
<img src="https://img.shields.io/github/stars/prs-eth/PaGeR?label=GitHub%20%E2%98%85&logo=github&color=C8C" alt="Github">
|
| 18 |
+
</a>
|
| 19 |
+
<a title="Website" href="https://marigoldcomputervision.github.io/" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
|
| 20 |
+
<img src="https://img.shields.io/badge/%E2%99%A5%20Project%20-Website-blue" alt="Website">
|
| 21 |
+
</a>
|
| 22 |
+
<a title="arXiv" href="https://arxiv.org/abs/2505.09358" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
|
| 23 |
+
<img src="https://img.shields.io/badge/%F0%9F%93%84%20Read%20-Paper-AF3436" alt="arXiv">
|
| 24 |
+
</a>
|
| 25 |
+
<a title="Hugging Face" href="https://huggingface.co/spaces/prs-eth/PaGeR" target="_blank" rel="noopener noreferrer" style="display: inline-block;">
|
| 26 |
+
<img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Spaces-FFD21E" alt="Hugging Face Spaces">
|
| 27 |
+
</a>
|
| 28 |
+
<a title="License" href="LICENSE"
|
| 29 |
+
target="_blank" rel="noopener noreferrer" style="display: inline-block;">
|
| 30 |
+
<img src="https://img.shields.io/badge/License-CreativeML%20OpenRAIL-7C3AED" alt="License">
|
| 31 |
+
</a>
|
| 32 |
+
</p>
|
| 33 |
+
|
| 34 |
+
This is a model card for the `PaGeR-normals-Structured3D` model for monocular normal estimation from a single panoramic [ERP](https://en.wikipedia.org/wiki/Equirectangular_projection) image.
|
| 35 |
+
The model is fine-tuned from our [original normals checkpoint](https://huggingface.co/prs-eth/PaGeR-normals) on Structured3D dataset,
|
| 36 |
+
as described in our paper:
|
| 37 |
+
- [Paper](paper link) titled "Panorama Geometry Estimation using Single-Step Diffusion Models"
|
| 38 |
+
|
| 39 |
+
You can also check out other depth and normals models in our [collection](https://huggingface.co/collections/prs-eth/pager), or test models in our [demo](https://huggingface.co/spaces/prs-eth/PaGeR).
|
| 40 |
+
## Model Details
|
| 41 |
+
- **Developed by:** [Vukasin Bozic](https://vulus98.github.io/), [Isidora Slavkovic](https://linkedin.com/in/isidora-slavkovic), [Dominik Narnhofer](https://scholar.google.com/citations?user=tFx8AhkAAAAJ&hl=en), [Nando Metzger](https://nandometzger.github.io/), [Denis Rozumny](https://rozumden.github.io/), [Konrad Schindler](https://scholar.google.com/citations?user=FZuNgqIAAAAJ), [Nikolai Kalischek](https://scholar.google.com/citations?user=XwzlnZoAAAAJ&hl=de).
|
| 42 |
+
- **Model type:** Generative latent diffusion-based one-step monocular panoramic surface normal estimation from a single ERP image.
|
| 43 |
+
- **License:** [CreativeML OpenRAIL License](LICENSE).
|
| 44 |
+
- **Model Description:** This model can be used to generate an estimated normals map of a panoramic input image.
|
| 45 |
+
- **Resolution**: The model is designed to support large resolutions up to 3K.
|
| 46 |
+
- **Dataset**: [Structured3D](https://structured3d-dataset.org/)
|
| 47 |
+
- **Steps and scheduler**: This model works in a swift, one-step regime.
|
| 48 |
+
- **Outputs**:
|
| 49 |
+
- **Surface Normals map**: The predicted values represent a 3D normals map in range 0-1, representing the 3D surface normal vector at each pixel.
|
| 50 |
+
- **Resources for more information:** [Project Website](insert link), [Paper](insert link), [Code](https://github.com/prs-eth/PaGeR).
|
| 51 |
+
- **Cite as:**
|
| 52 |
+
|
| 53 |
+
```bibtex
|
| 54 |
+
Add citation
|
| 55 |
+
```
|