LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery
Paper
•
2505.02829
•
Published
LISAt_PRE is a remote-sensing-focused MLLM that is tailored to improve performance in scenarios requiring detailed visual understanding and natural language reasoning over satellite and aerial imagery.
LISAt_PRE enhances the LISAt framework by adapting it to remote-sensing applications, which require better handling of diverse visual data and specialized query types. The architecture integrates:
An architectural overview is shown in Figure 3 (refer to paper).
If you use LISAt_PRE in your work, please cite:
@article{quenum2025lisat,
title={LISAt: Language-Instructed Segmentation Assistant for Satellite Imagery},
author={Quenum, Jerome and Hsieh, Wen-Han and Wu, Tsung-Han and Gupta, Ritwik and Darrell, Trevor and Chan, David M},
journal={arXiv preprint arXiv:2505.02829},
year={2025},
url={https://arxiv.org/pdf/2505.02829}
}