SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images
Paper
β’
2512.08730
β’
Published
β’
2
Unified repo for SegEarth OV, OV-2, OV-3 β training-free open-vocabulary semantic segmentation. Each variant lives in a self-contained subfolder with its own config.json, pipeline, and weights.
SegEarth-OV/
βββ OV/ # CLIP (OpenAI ViT-B/16) + SimFeatUp
β βββ config.json
β βββ pipeline.py
β βββ upsamplers.py
β βββ prompts/
β βββ configs/cls_*.txt
β βββ weights/featup/ # SimFeatUp checkpoints
β βββ weights/backbone/ # OpenAI CLIP (clip-vit-base-patch16)
βββ OV-2/ # AlignEarth (SAR) + SimFeatUp
β βββ config.json
β βββ pipeline.py
β βββ weights/featup/
β βββ weights/backbone/ # AlignEarth-SAR-ViT-B-16
βββ OV-3/ # SAM3 (no featup)
β βββ config.json
β βββ pipeline.py
β βββ configs/
β βββ weights/backbone/ # facebook/sam3 (sam3.pt)
βββ model_config.json
All checkpoints are included in this repo. No additional download required.
| Variant | Backbone | Location |
|---|---|---|
| OV | OpenAI CLIP ViT-B/16 | OV/weights/backbone/clip-vit-base-patch16/ |
| OV-2 | AlignEarth-SAR-ViT-B-16 | OV-2/weights/backbone/AlignEarth-SAR-ViT-B-16/ |
| OV-3 | SAM3 | OV-3/weights/backbone/sam3/sam3.pt |
| OV, OV-2 | SimFeatUp (jbu_one, etc.) | OV/weights/featup/, OV-2/weights/featup/ |
From subfolder (self-contained):
# OV-2 with AlignEarth (SAR)
from pipeline import SegEarthPipeline
pipe = SegEarthPipeline() # loads OV-2/config.json
seg = pipe(image)
# Or: cd OV-2 && python -c "from pipeline import load; pipe = load()"
From repo root:
from pipeline import SegEarthPipeline
pipe = SegEarthPipeline(variant="OV-2")
pipe = SegEarthPipeline(variant="OV")
pipe = SegEarthPipeline(variant="OV-3") # requires sam3 package
Custom class config:
pipe = SegEarthPipeline(variant="OV-2", class_names_path="OV-2/configs/cls_yeseg_sar.txt")
| Subfolder | Backbone | Model ID | FeatUp |
|---|---|---|---|
| OV | CLIP | openai/clip-vit-base-patch16 | jbu_one |
| OV-2 | AlignEarth | BiliSakura/AlignEarth-SAR-ViT-B-16 | jbu_one |
| OV-3 | SAM3 | facebook/sam3 | None |
@InProceedings{Li_2025_CVPR,
author = {Li, Kaiyu and Liu, Ruixun and Cao, Xiangyong and Bai, Xueru and Zhou, Feng and Meng, Deyu and Wang, Zhi},
title = {SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2025},
pages = {10545--10556}
}
@article{li2025segearthov2,
title = {Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images},
author = {Li, Kaiyu and Cao, Xiangyong and Liu, Ruixun and Wang, Shihong and Jiang, Zixuan and Wang, Zhi and Meng, Deyu},
journal = {arXiv preprint arXiv:2508.18067},
year = {2025}
}
@article{li2025segearthov3,
title = {SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images},
author = {Li, Kaiyu and Zhang, Shengqi and Deng, Yupeng and Wang, Zhi and Meng, Deyu and Cao, Xiangyong},
journal = {arXiv preprint arXiv:2512.08730},
year = {2025}
}