RicardoString commited on
Commit
d812bc5
·
1 Parent(s): 7040378

update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -3,4 +3,12 @@ base_model:
3
  - Qwen/Qwen2.5-VL-3B-Instruct
4
  pipeline_tag: image-text-to-text
5
  library_name: transformers
6
- ---
 
 
 
 
 
 
 
 
 
3
  - Qwen/Qwen2.5-VL-3B-Instruct
4
  pipeline_tag: image-text-to-text
5
  library_name: transformers
6
+ ---
7
+
8
+ ## Bridging Semantics and Geometry: A Decoupled LVLM–SAM Framework for Reasoning Segmentation in Remote Sensing
9
+
10
+ This is the 3B model of [Think2Seg-RS](https://github.com/Ricardo-XZ/Think2Seg-RS), a decoupled framework for reasoning segmentation in remote sensing (RS) imagery.
11
+
12
+ Our core idea is to decouple high-level semantic reasoning from low-level geometric execution. Specifically, we train an LVLM prompter (e.g., Qwen-2.5-VL) to control a frozen Segment Anything Model (SAM2) via structured geometric prompts. Through a result-oriented reinforcement learning objective, the LVLM learns to translate abstract semantic reasoning into spatially grounded actions, achieving state-of-the-art performance on the EarthReason dataset.
13
+
14
+ For more details, code, and the complete framework, please visit our [GitHub repository](https://github.com/Ricardo-XZ/Think2Seg-RS).