| --- |
| license: apache-2.0 |
| tags: |
| - depth-estimation |
| - monocular-depth |
| - diffusion |
| pipeline_tag: depth-estimation |
| --- |
| |
| # Pixel-Perfect-Depth — MoGe2 checkpoint (mirror) |
|
|
| This is an **unmodified mirror** of the `ppd_moge.pth` checkpoint from |
| [Pixel-Perfect Depth](https://github.com/gangweix/pixel-perfect-depth) (NeurIPS 2025), |
| the PPD variant that uses **MoGe2** semantics and delivers a ~20–30% improvement on |
| zero-shot benchmarks over the DA2 variant. |
|
|
| It is rehosted here **only** because the original file is distributed via Google Drive, |
| which is unreliable for automated downloads in the |
| [ComfyUI-PixelPerfectDepth](https://github.com/PozzettiAndrea/ComfyUI-PixelPerfectDepth) |
| integration. All credit belongs to the original authors. |
|
|
| ## Source & attribution |
|
|
| - **Original repo:** https://github.com/gangweix/pixel-perfect-depth (Apache-2.0) |
| - **Original weights:** Google Drive — file id `1tabmcsbRVDKDfmO4KU1vOjurzN-wp0HV` |
| (linked from the upstream README, "PPD / MoGe2" row) |
| - **Paper:** https://arxiv.org/abs/2510.07316 |
|
|
| This mirror is unmodified and redistributed under the upstream **Apache-2.0** license |
| (see `LICENSE`). No endorsement by the original authors is implied. |
|
|
| ## Usage |
|
|
| This checkpoint requires the MoGe2 encoder weights |
| ([`moge2.pt`](https://huggingface.co/Ruicheng/moge-2-vitl-normal)) at load time, as in |
| the upstream `run.py --semantics_model MoGe2`. |
|
|
| ```bibtex |
| @article{xu2025pixel, |
| title={Pixel-perfect depth with semantics-prompted diffusion transformers}, |
| author={Xu, Gangwei and Lin, Haotong and Luo, Hongcheng and others}, |
| journal={arXiv preprint arXiv:2510.07316}, |
| year={2025} |
| } |
| ``` |
|
|