HIPIE / README.md
KonstantinosKK's picture
Update README.md
2bde18e verified
---
license: mit
---
# HIPIE: Hierarchical Open-vocabulary Universal Image Segmentation
PyTorch implementation of HIPIE from ["Hierarchical Open-vocabulary Universal Image Segmentation"](https://arxiv.org/abs/2307.00764) (Wang et al., NeurIPS 2023).
## Pretrained Weights
We provide ViT-H and ResNet-50 weights for hierarchical and part-aware image segmentation across multiple datasets:
| Format | Filename | Description |
|--------|----------|-------------|
| ViT-H (O365, COCO, RefCOCO, PACO) | `vit_h_cloud.pth` | Pretrained with O365,COCO,RefCOCO,PACO |
| ViT-H (COCO, RefCOCO, Pascal-Parts) | `vit_h_cloud_parts.pth` | Finetuned on COCO,RefCOCO,Pascal-Parts |
| ResNet-50 (Pascal-Parts) | `r50_parts.pth` | Pretrained with O365,COCO,RefCOCO,Pascal Panoptic Parts |
## Usage
For demo notebooks, model configs, and inference scripts, see the [GitHub repository](https://github.com/berkeley-hipie/HIPIE).
## Citation
```
@inproceedings{wang2023hierarchical,
title={Hierarchical Open-vocabulary Universal Image Segmentation},
author={Wang, Xudong and Li, Shufan and Kallidromitis, Konstantinos and Kato, Yusuke and Kozuka, Kazuki and Darrell, Trevor},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023}
}
```