Add pipeline tag and link to paper

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +28 -6
README.md CHANGED
@@ -1,16 +1,38 @@
1
  ---
2
- datasets:
3
- - earth-insights/EarthReason
4
  base_model:
5
  - Qwen/Qwen2.5-VL-7B-Instruct
 
 
6
  library_name: transformers
 
 
7
  ---
8
 
 
 
 
 
 
 
 
 
 
 
 
9
 
10
- ## Bridging Semantics and Geometry: A Decoupled LVLM–SAM Framework for Reasoning Segmentation in Remote Sensing
 
 
11
 
12
- This is the 7B model of [Think2Seg-RS](https://github.com/Ricardo-XZ/Think2Seg-RS), a decoupled framework for reasoning segmentation in remote sensing (RS) imagery.
13
 
14
- Our core idea is to decouple high-level semantic reasoning from low-level geometric execution. Specifically, we train an LVLM prompter (e.g., Qwen-2.5-VL) to control a frozen Segment Anything Model (SAM2) via structured geometric prompts. Through a result-oriented reinforcement learning objective, the LVLM learns to translate abstract semantic reasoning into spatially grounded actions, achieving state-of-the-art performance on the EarthReason dataset.
15
 
16
- For more details, code, and the complete framework, please visit our [GitHub repository](https://github.com/Ricardo-XZ/Think2Seg-RS).
 
 
 
 
 
 
 
 
1
  ---
 
 
2
  base_model:
3
  - Qwen/Qwen2.5-VL-7B-Instruct
4
+ datasets:
5
+ - earth-insights/EarthReason
6
  library_name: transformers
7
+ pipeline_tag: image-segmentation
8
+ license: apache-2.0
9
  ---
10
 
11
+ # Bridging Semantics and Geometry: A Decoupled LVLM–SAM Framework for Reasoning Segmentation in Remote Sensing
12
+
13
+ This repository contains the 7B model of **Think2Seg-RS**, a decoupled framework for reasoning segmentation in remote sensing (RS) imagery.
14
+
15
+ The model was introduced in the paper [Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing](https://huggingface.co/papers/2512.19302).
16
+
17
+ ## Overview
18
+
19
+ Think2Seg-RS decouples high-level semantic reasoning from low-level geometric execution. It trains an LVLM prompter (based on Qwen-2.5-VL) to control a frozen Segment Anything Model (SAM2) via structured geometric prompts. Through a mask-only reinforcement learning objective, the LVLM learns to translate abstract semantic reasoning into spatially grounded actions, achieving state-of-the-art performance on the EarthReason dataset.
20
+
21
+ ## Resources
22
 
23
+ - **Paper:** [arXiv:2512.19302](https://huggingface.co/papers/2512.19302)
24
+ - **Code:** [GitHub - Think2Seg-RS](https://github.com/Ricardo-XZ/Think2Seg-RS)
25
+ - **Dataset:** [EarthReason](https://huggingface.co/datasets/earth-insights/EarthReason)
26
 
27
+ ## Citation
28
 
29
+ If you find this work helpful for your research, please cite:
30
 
31
+ ```bibtex
32
+ @article{think2seg_rs_2025,
33
+ title={Bridging Semantics and Geometry: A Decoupled LVLM-SAM Framework for Reasoning Segmentation in Remote Sensing},
34
+ author={Anonymous},
35
+ journal={arXiv preprint arXiv:2512.19302},
36
+ year={2025}
37
+ }
38
+ ```