checkone
/

ROSALIA-7B-v1

Image-Text-to-Text

Model card Files Files and versions

ROSALIA-7B-v1 / README.md

checkone's picture

Update README.md

dab2b8d verified 12 days ago

|

history blame contribute delete

1.55 kB

	---
	base_model:
	- xinlai/LISA-7B-v1
	pipeline_tag: image-text-to-text
	---

	# ROSALIA-7B-v1

	ROSALIA is a vision-language model (VLM) designed for precise lesion segmentation in chest X-rays (CXRs). It is a LISA model fine-tuned on the MIMIC-ILS dataset, a large-scale instruction-answer dataset for CXR lesion segmentation.

	ROSALIA is capable of Instruction-Guided Lesion Segmentation (ILS), a medical-domain adaptation of referring image segmentation (RIS), allowing it to segment diverse lesions and provide textual explanations in response to simple, user-friendly instructions.

	This model is the core checkpoint of the paper:
	[Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset](https://arxiv.org/abs/2511.15186), accepted to CVPR 2026.

	- Code: [https://github.com/checkoneee/ROSALIA](https://github.com/checkoneee/ROSALIA)
	- Dataset: [https://physionet.org/content/mimic-cxr-ext-ils/1.0.0/](https://physionet.org/content/mimic-cxr-ext-ils/1.0.0/)
	- Paper: [arXiv:2511.15186](https://arxiv.org/abs/2511.15186)

	## 📖 Citation

	If you find this model or the related research useful, please cite our work:

	```bibtex
	@article{choi2025instruction,
	title={Instruction-Guided Lesion Segmentation for Chest X-rays with Automatically Generated Large-Scale Dataset},
	author={Choi, Geon and Yoon, Hangyul and Shin, Hyunju and Park, Hyunki and Seo, Sang Hoon and Yang, Eunho and Choi, Edward},
	journal={arXiv preprint arXiv:2511.15186},
	year={2025}
	}
	```