File size: 1,445 Bytes
66d1a49 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 |
---
license: mit
pipeline_tag: other
---
# PatchAlign3D: Local Feature Alignment for Dense 3D Shape Understanding
PatchAlign3D is an encoder-only 3D model that produces language-aligned patch-level features directly from point clouds. It enables zero-shot 3D part segmentation with fast single-pass inference without requiring test-time multi-view rendering.
- **Paper:** [PatchAlign3D: Local Feature Alignment for Dense 3D Shape understanding](https://huggingface.co/papers/2601.02457)
- **Project Page:** [https://souhail-hadgi.github.io/patchalign3dsite](https://souhail-hadgi.github.io/patchalign3dsite)
- **Repository:** [https://github.com/souhail-hadgi/PatchAlign3D](https://github.com/souhail-hadgi/PatchAlign3D)
## Sample Usage
You can run inference on a single shape and save per-point predictions using the following command from the official repository:
```bash
python patchalign3d/inference/infer.py \
--ckpt /path/to/stage2_last.pt \
--input /path/to/shape.npz \
--labels "seat,back,leg,arm"
```
## Citation
```bibtex
@misc{hadgi2026patchalign3dlocalfeaturealignment,
title={PatchAlign3D: Local Feature Alignment for Dense 3D Shape understanding},
author={Souhail Hadgi and Bingchen Gong and Ramana Sundararaman and Emery Pierson and Lei Li and Peter Wonka and Maks Ovsjanikov},
year={2026},
eprint={2601.02457},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2601.02457},
}
``` |