|
|
--- |
|
|
license: mit |
|
|
pipeline_tag: image-segmentation |
|
|
tags: |
|
|
- remote-sensing |
|
|
- earth-observation |
|
|
- open-vocabulary |
|
|
- clip |
|
|
- sam3 |
|
|
- semantic-segmentation |
|
|
library_name: transformers |
|
|
--- |
|
|
|
|
|
# SegEarth-OV: Unified Open-Vocabulary Segmentation for Remote Sensing |
|
|
|
|
|
Unified repo for **SegEarth OV, OV-2, OV-3** β training-free open-vocabulary semantic segmentation. Each variant lives in a **self-contained subfolder** with its own `config.json`, pipeline, and weights. |
|
|
|
|
|
## Structure |
|
|
|
|
|
``` |
|
|
SegEarth-OV/ |
|
|
βββ OV/ # CLIP (OpenAI ViT-B/16) + SimFeatUp |
|
|
β βββ config.json |
|
|
β βββ pipeline.py |
|
|
β βββ upsamplers.py |
|
|
β βββ prompts/ |
|
|
β βββ configs/cls_*.txt |
|
|
β βββ weights/featup/ # SimFeatUp checkpoints |
|
|
β βββ weights/backbone/ # OpenAI CLIP (clip-vit-base-patch16) |
|
|
βββ OV-2/ # AlignEarth (SAR) + SimFeatUp |
|
|
β βββ config.json |
|
|
β βββ pipeline.py |
|
|
β βββ weights/featup/ |
|
|
β βββ weights/backbone/ # AlignEarth-SAR-ViT-B-16 |
|
|
βββ OV-3/ # SAM3 (no featup) |
|
|
β βββ config.json |
|
|
β βββ pipeline.py |
|
|
β βββ configs/ |
|
|
β βββ weights/backbone/ # facebook/sam3 (sam3.pt) |
|
|
βββ model_config.json |
|
|
``` |
|
|
|
|
|
## Self-Contained (No Download) |
|
|
|
|
|
All checkpoints are **included** in this repo. No additional download required. |
|
|
|
|
|
| Variant | Backbone | Location | |
|
|
|---------|----------|----------| |
|
|
| OV | OpenAI CLIP ViT-B/16 | `OV/weights/backbone/clip-vit-base-patch16/` | |
|
|
| OV-2 | AlignEarth-SAR-ViT-B-16 | `OV-2/weights/backbone/AlignEarth-SAR-ViT-B-16/` | |
|
|
| OV-3 | SAM3 | `OV-3/weights/backbone/sam3/sam3.pt` | |
|
|
| OV, OV-2 | SimFeatUp (jbu_one, etc.) | `OV/weights/featup/`, `OV-2/weights/featup/` | |
|
|
|
|
|
## Usage |
|
|
|
|
|
**From subfolder (self-contained):** |
|
|
|
|
|
```python |
|
|
# OV-2 with AlignEarth (SAR) |
|
|
from pipeline import SegEarthPipeline |
|
|
pipe = SegEarthPipeline() # loads OV-2/config.json |
|
|
seg = pipe(image) |
|
|
|
|
|
# Or: cd OV-2 && python -c "from pipeline import load; pipe = load()" |
|
|
``` |
|
|
|
|
|
**From repo root:** |
|
|
|
|
|
```python |
|
|
from pipeline import SegEarthPipeline |
|
|
|
|
|
pipe = SegEarthPipeline(variant="OV-2") |
|
|
pipe = SegEarthPipeline(variant="OV") |
|
|
pipe = SegEarthPipeline(variant="OV-3") # requires sam3 package |
|
|
``` |
|
|
|
|
|
**Custom class config:** |
|
|
|
|
|
```python |
|
|
pipe = SegEarthPipeline(variant="OV-2", class_names_path="OV-2/configs/cls_yeseg_sar.txt") |
|
|
``` |
|
|
|
|
|
## Variants |
|
|
|
|
|
| Subfolder | Backbone | Model ID | FeatUp | |
|
|
|-----------|----------|----------|--------| |
|
|
| OV | CLIP | openai/clip-vit-base-patch16 | jbu_one | |
|
|
| OV-2 | AlignEarth | BiliSakura/AlignEarth-SAR-ViT-B-16 | jbu_one | |
|
|
| OV-3 | SAM3 | facebook/sam3 | None | |
|
|
|
|
|
## Citation |
|
|
|
|
|
```bibtex |
|
|
@InProceedings{Li_2025_CVPR, |
|
|
author = {Li, Kaiyu and Liu, Ruixun and Cao, Xiangyong and Bai, Xueru and Zhou, Feng and Meng, Deyu and Wang, Zhi}, |
|
|
title = {SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images}, |
|
|
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)}, |
|
|
year = {2025}, |
|
|
pages = {10545--10556} |
|
|
} |
|
|
|
|
|
@article{li2025segearthov2, |
|
|
title = {Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images}, |
|
|
author = {Li, Kaiyu and Cao, Xiangyong and Liu, Ruixun and Wang, Shihong and Jiang, Zixuan and Wang, Zhi and Meng, Deyu}, |
|
|
journal = {arXiv preprint arXiv:2508.18067}, |
|
|
year = {2025} |
|
|
} |
|
|
|
|
|
@article{li2025segearthov3, |
|
|
title = {SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images}, |
|
|
author = {Li, Kaiyu and Zhang, Shengqi and Deng, Yupeng and Wang, Zhi and Meng, Deyu and Cao, Xiangyong}, |
|
|
journal = {arXiv preprint arXiv:2512.08730}, |
|
|
year = {2025} |
|
|
} |
|
|
``` |
|
|
|