apozz
/

Pixel-Perfect-Depth-MoGe2

Depth Estimation

monocular-depth

Model card Files Files and versions

Pixel-Perfect-Depth-MoGe2 / README.md

apozz's picture

Upload README.md with huggingface_hub

269d089 verified 8 days ago

|

History Blame Contribute Delete

1.63 kB

	---
	license: apache-2.0
	tags:
	- depth-estimation
	- monocular-depth
	- diffusion
	pipeline_tag: depth-estimation
	---

	# Pixel-Perfect-Depth — MoGe2 checkpoint (mirror)

	This is an unmodified mirror of the `ppd_moge.pth` checkpoint from
	[Pixel-Perfect Depth](https://github.com/gangweix/pixel-perfect-depth) (NeurIPS 2025),
	the PPD variant that uses MoGe2 semantics and delivers a ~20–30% improvement on
	zero-shot benchmarks over the DA2 variant.

	It is rehosted here only because the original file is distributed via Google Drive,
	which is unreliable for automated downloads in the
	[ComfyUI-PixelPerfectDepth](https://github.com/PozzettiAndrea/ComfyUI-PixelPerfectDepth)
	integration. All credit belongs to the original authors.

	## Source & attribution

	- Original repo: https://github.com/gangweix/pixel-perfect-depth (Apache-2.0)
	- Original weights: Google Drive — file id `1tabmcsbRVDKDfmO4KU1vOjurzN-wp0HV`
	(linked from the upstream README, "PPD / MoGe2" row)
	- Paper: https://arxiv.org/abs/2510.07316

	This mirror is unmodified and redistributed under the upstream Apache-2.0 license
	(see `LICENSE`). No endorsement by the original authors is implied.

	## Usage

	This checkpoint requires the MoGe2 encoder weights
	([`moge2.pt`](https://huggingface.co/Ruicheng/moge-2-vitl-normal)) at load time, as in
	the upstream `run.py --semantics_model MoGe2`.

	```bibtex
	@article{xu2025pixel,
	title={Pixel-perfect depth with semantics-prompted diffusion transformers},
	author={Xu, Gangwei and Lin, Haotong and Luo, Hongcheng and others},
	journal={arXiv preprint arXiv:2510.07316},
	year={2025}
	}
	```