nielsr HF Staff commited on
Commit
245060c
·
verified ·
1 Parent(s): 5acc1ad

Add pipeline tag and link to paper

Browse files

Hi! I'm Niels, part of the community science team at Hugging Face.

I've updated the model card to include the `image-segmentation` pipeline tag, which improves discoverability on the Hub. I've also linked the model to its research paper and GitHub repository, and added the license (Apache 2.0) based on the base model. This provides better context and documentation for users and researchers.

Files changed (1) hide show
  1. README.md +28 -6
README.md CHANGED
@@ -1,16 +1,38 @@
1
  ---
2
- datasets:
3
- - earth-insights/EarthReason
4
  base_model:
5
  - Qwen/Qwen2.5-VL-7B-Instruct
 
 
6
  library_name: transformers
 
 
7
  ---
8
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
- ## Bridging Semantics and Geometry: A Decoupled LVLM–SAM Framework for Reasoning Segmentation in Remote Sensing
 
 
11
 
12
- This is the 7B model of [Think2Seg-RS](https://github.com/Ricardo-XZ/Think2Seg-RS), a decoupled framework for reasoning segmentation in remote sensing (RS) imagery.
13
 
14
- Our core idea is to decouple high-level semantic reasoning from low-level geometric execution. Specifically, we train an LVLM prompter (e.g., Qwen-2.5-VL) to control a frozen Segment Anything Model (SAM2) via structured geometric prompts. Through a result-oriented reinforcement learning objective, the LVLM learns to translate abstract semantic reasoning into spatially grounded actions, achieving state-of-the-art performance on the EarthReason dataset.
15
 
16
- For more details, code, and the complete framework, please visit our [GitHub repository](https://github.com/Ricardo-XZ/Think2Seg-RS).
 
 
 
 
 
 
 
 
1
  ---
 
 
2
  base_model:
3
  - Qwen/Qwen2.5-VL-7B-Instruct
4
+ datasets:
5
+ - earth-insights/EarthReason
6
  library_name: transformers
7
+ pipeline_tag: image-segmentation
8
+ license: apache-2.0
9
  ---
10
 
11
+ # Bridging Semantics and Geometry: A Decoupled LVLM–SAM Framework for Reasoning Segmentation in Remote Sensing
12
+
13
+ This repository contains the 7B model of **Think2Seg-RS**, a decoupled framework for reasoning segmentation in remote sensing (RS) imagery.
14
+
15
+ The model was introduced in the paper [Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing](https://huggingface.co/papers/2512.19302).
16
+
17
+ ## Overview
18
+
19
+ Think2Seg-RS decouples high-level semantic reasoning from low-level geometric execution. It trains an LVLM prompter (based on Qwen-2.5-VL) to control a frozen Segment Anything Model (SAM2) via structured geometric prompts. Through a mask-only reinforcement learning objective, the LVLM learns to translate abstract semantic reasoning into spatially grounded actions, achieving state-of-the-art performance on the EarthReason dataset.
20
+
21
+ ## Resources
22
 
23
+ - **Paper:** [arXiv:2512.19302](https://huggingface.co/papers/2512.19302)
24
+ - **Code:** [GitHub - Think2Seg-RS](https://github.com/Ricardo-XZ/Think2Seg-RS)
25
+ - **Dataset:** [EarthReason](https://huggingface.co/datasets/earth-insights/EarthReason)
26
 
27
+ ## Citation
28
 
29
+ If you find this work helpful for your research, please cite:
30
 
31
+ ```bibtex
32
+ @article{think2seg_rs_2025,
33
+ title={Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing},
34
+ author={Anonymous},
35
+ journal={arXiv preprint arXiv:2512.19302},
36
+ year={2025}
37
+ }
38
+ ```