RTLucassen
/

PathBLIP-2

Model card Files Files and versions

RTLucassen commited on Jun 6, 2025

Commit

1f3c10d

·

verified ·

1 Parent(s): 7e2a418

Update README.md

Files changed (1) hide show

README.md +26 -3

README.md CHANGED Viewed

@@ -1,3 +1,26 @@
----
-license: apache-2.0
----

+---
+license: apache-2.0
+language:
+- en
+pipeline_tag: image-to-text
+tags:
+- medical
+---
+# Model Card for PathBLIP-2
+A vision-language model built upon the BLIP-2 framework in combination with BioGPT and HIPT for pathology report generation and cross-modal retrieval of melanocytic lesions.
+## Model Details
+This repository contains multiple checkpoints for the models which were used for the experiments in the paper.
+The models were trained on a dataset of 19,636 melanocytic lesion cases, consisting of one or more whole slide images (WSIs) and a pathology report, using different training configurations.
+The supporting code is available from the corresponding GitHub repository.
+We refer to the paper for more information regarding the dataset, finetuning, evaluation, and limitations.
+- **Paper: *["On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation"](https://arxiv.org/abs/2502.19285)***
+- **Repository:** [GitHub](https://github.com/RTLucassen/report_preprocessing)
+- **Framework:** [BLIP-2](https://huggingface.co/Salesforce/blip2-opt-2.7b)
+- **Base language model:** [BioGPT](https://huggingface.co/microsoft/biogpt)
+- **WSI feature extractor:** [HIPT](https://github.com/mahmoodlab/HIPT)
+- **License:** Apache-2.0