SegEarth-OV: Unified Open-Vocabulary Segmentation for Remote Sensing

Unified repo for SegEarth OV, OV-2, OV-3 β€” training-free open-vocabulary semantic segmentation. Each variant lives in a self-contained subfolder with its own config.json, pipeline, and weights.

Structure

SegEarth-OV/
β”œβ”€β”€ OV/           # CLIP (OpenAI ViT-B/16) + SimFeatUp
β”‚   β”œβ”€β”€ config.json
β”‚   β”œβ”€β”€ pipeline.py
β”‚   β”œβ”€β”€ upsamplers.py
β”‚   β”œβ”€β”€ prompts/
β”‚   β”œβ”€β”€ configs/cls_*.txt
β”‚   β”œβ”€β”€ weights/featup/      # SimFeatUp checkpoints
β”‚   └── weights/backbone/    # OpenAI CLIP (clip-vit-base-patch16)
β”œβ”€β”€ OV-2/         # AlignEarth (SAR) + SimFeatUp
β”‚   β”œβ”€β”€ config.json
β”‚   β”œβ”€β”€ pipeline.py
β”‚   β”œβ”€β”€ weights/featup/
β”‚   └── weights/backbone/    # AlignEarth-SAR-ViT-B-16
β”œβ”€β”€ OV-3/         # SAM3 (no featup)
β”‚   β”œβ”€β”€ config.json
β”‚   β”œβ”€β”€ pipeline.py
β”‚   β”œβ”€β”€ configs/
β”‚   └── weights/backbone/   # facebook/sam3 (sam3.pt)
└── model_config.json

Self-Contained (No Download)

All checkpoints are included in this repo. No additional download required.

Variant Backbone Location
OV OpenAI CLIP ViT-B/16 OV/weights/backbone/clip-vit-base-patch16/
OV-2 AlignEarth-SAR-ViT-B-16 OV-2/weights/backbone/AlignEarth-SAR-ViT-B-16/
OV-3 SAM3 OV-3/weights/backbone/sam3/sam3.pt
OV, OV-2 SimFeatUp (jbu_one, etc.) OV/weights/featup/, OV-2/weights/featup/

Usage

From subfolder (self-contained):

# OV-2 with AlignEarth (SAR)
from pipeline import SegEarthPipeline
pipe = SegEarthPipeline()  # loads OV-2/config.json
seg = pipe(image)

# Or: cd OV-2 && python -c "from pipeline import load; pipe = load()"

From repo root:

from pipeline import SegEarthPipeline

pipe = SegEarthPipeline(variant="OV-2")
pipe = SegEarthPipeline(variant="OV")
pipe = SegEarthPipeline(variant="OV-3")  # requires sam3 package

Custom class config:

pipe = SegEarthPipeline(variant="OV-2", class_names_path="OV-2/configs/cls_yeseg_sar.txt")

Variants

Subfolder Backbone Model ID FeatUp
OV CLIP openai/clip-vit-base-patch16 jbu_one
OV-2 AlignEarth BiliSakura/AlignEarth-SAR-ViT-B-16 jbu_one
OV-3 SAM3 facebook/sam3 None

Citation

@InProceedings{Li_2025_CVPR,
  author    = {Li, Kaiyu and Liu, Ruixun and Cao, Xiangyong and Bai, Xueru and Zhou, Feng and Meng, Deyu and Wang, Zhi},
  title     = {SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year      = {2025},
  pages     = {10545--10556}
}

@article{li2025segearthov2,
  title   = {Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images},
  author  = {Li, Kaiyu and Cao, Xiangyong and Liu, Ruixun and Wang, Shihong and Jiang, Zixuan and Wang, Zhi and Meng, Deyu},
  journal = {arXiv preprint arXiv:2508.18067},
  year    = {2025}
}

@article{li2025segearthov3,
  title   = {SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images},
  author  = {Li, Kaiyu and Zhang, Shengqi and Deng, Yupeng and Wang, Zhi and Meng, Deyu and Cao, Xiangyong},
  journal = {arXiv preprint arXiv:2512.08730},
  year    = {2025}
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Papers for BiliSakura/SegEarth-OV