File size: 1,515 Bytes
dbabbd2
 
d266241
 
 
dbabbd2
 
e590826
dbabbd2
e590826
 
 
 
 
 
 
 
dbabbd2
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d266241
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
license: apache-2.0
pipeline_tag: image-to-3d
tags:
- art
---

![PPD sample output](https://raw.githubusercontent.com/Apache0ne/ComfyUI-pixel-perfect-depth/main/examples/ImageToStl.com_PPD_00020_.ply.gif)

- Model Downloads for ComfyUI custom node [Here](https://github.com/Apache0ne/ComfyUI-pixel-perfect-depth)
  Node usage on github
- Orgin models can be found linked [here](https://github.com/gangweix/pixel-perfect-depth) at orgin github in google drives or some at these Huggingfaces listed below

- https://huggingface.co/gangweix/Pixel-Perfect-Depth
- https://huggingface.co/depth-anything/Depth-Anything-V2-Large
- https://huggingface.co/Ruicheng/moge-2-vitl-normal
- https://huggingface.co/yyfz233/Pi3

## Acknowledgement

We are grateful to the [Depth Anything V2](https://github.com/DepthAnything/Depth-Anything-V2), [MoGe](https://github.com/microsoft/MoGe) and [DiT](https://github.com/facebookresearch/DiT) teams for their code and model release. We would also like to sincerely thank the NeurIPS reviewers for their appreciation of this work (ratings: 5, 5, 5, 5).

## Citation

If you find this project useful, please consider citing:

```bibtex
@article{xu2025pixel,
  title={Pixel-perfect depth with semantics-prompted diffusion transformers},
  author={Xu, Gangwei and Lin, Haotong and Luo, Hongcheng and Wang, Xianqi and Yao, Jingfeng and Zhu, Lianghui and Pu, Yuechuan and Chi, Cheng and Sun, Haiyang and Wang, Bing and others},
  journal={arXiv preprint arXiv:2510.07316},
  year={2025}
}