Add pipeline tag and link to paper
Browse filesHi! I'm Niels, part of the community science team at Hugging Face.
I've updated the model card to include the `image-segmentation` pipeline tag, which improves discoverability on the Hub. I've also linked the model to its research paper and GitHub repository, and added the license (Apache 2.0) based on the base model. This provides better context and documentation for users and researchers.
README.md
CHANGED
|
@@ -1,16 +1,38 @@
|
|
| 1 |
---
|
| 2 |
-
datasets:
|
| 3 |
-
- earth-insights/EarthReason
|
| 4 |
base_model:
|
| 5 |
- Qwen/Qwen2.5-VL-7B-Instruct
|
|
|
|
|
|
|
| 6 |
library_name: transformers
|
|
|
|
|
|
|
| 7 |
---
|
| 8 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |
-
|
|
|
|
|
|
|
| 11 |
|
| 12 |
-
|
| 13 |
|
| 14 |
-
|
| 15 |
|
| 16 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
---
|
|
|
|
|
|
|
| 2 |
base_model:
|
| 3 |
- Qwen/Qwen2.5-VL-7B-Instruct
|
| 4 |
+
datasets:
|
| 5 |
+
- earth-insights/EarthReason
|
| 6 |
library_name: transformers
|
| 7 |
+
pipeline_tag: image-segmentation
|
| 8 |
+
license: apache-2.0
|
| 9 |
---
|
| 10 |
|
| 11 |
+
# Bridging Semantics and Geometry: A Decoupled LVLM–SAM Framework for Reasoning Segmentation in Remote Sensing
|
| 12 |
+
|
| 13 |
+
This repository contains the 7B model of **Think2Seg-RS**, a decoupled framework for reasoning segmentation in remote sensing (RS) imagery.
|
| 14 |
+
|
| 15 |
+
The model was introduced in the paper [Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing](https://huggingface.co/papers/2512.19302).
|
| 16 |
+
|
| 17 |
+
## Overview
|
| 18 |
+
|
| 19 |
+
Think2Seg-RS decouples high-level semantic reasoning from low-level geometric execution. It trains an LVLM prompter (based on Qwen-2.5-VL) to control a frozen Segment Anything Model (SAM2) via structured geometric prompts. Through a mask-only reinforcement learning objective, the LVLM learns to translate abstract semantic reasoning into spatially grounded actions, achieving state-of-the-art performance on the EarthReason dataset.
|
| 20 |
+
|
| 21 |
+
## Resources
|
| 22 |
|
| 23 |
+
- **Paper:** [arXiv:2512.19302](https://huggingface.co/papers/2512.19302)
|
| 24 |
+
- **Code:** [GitHub - Think2Seg-RS](https://github.com/Ricardo-XZ/Think2Seg-RS)
|
| 25 |
+
- **Dataset:** [EarthReason](https://huggingface.co/datasets/earth-insights/EarthReason)
|
| 26 |
|
| 27 |
+
## Citation
|
| 28 |
|
| 29 |
+
If you find this work helpful for your research, please cite:
|
| 30 |
|
| 31 |
+
```bibtex
|
| 32 |
+
@article{think2seg_rs_2025,
|
| 33 |
+
title={Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing},
|
| 34 |
+
author={Anonymous},
|
| 35 |
+
journal={arXiv preprint arXiv:2512.19302},
|
| 36 |
+
year={2025}
|
| 37 |
+
}
|
| 38 |
+
```
|