Update README.md
Browse files
README.md
CHANGED
|
@@ -10,4 +10,34 @@ base_model:
|
|
| 10 |
- Vision-CAIR/MiniGPT-4
|
| 11 |
tags:
|
| 12 |
- medical
|
| 13 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
- Vision-CAIR/MiniGPT-4
|
| 11 |
tags:
|
| 12 |
- medical
|
| 13 |
+
---
|
| 14 |
+
|
| 15 |
+
# PeFoMed
|
| 16 |
+
This is the official implementation of [PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging](https://arxiv.org/abs/2401.02797).
|
| 17 |
+
|
| 18 |
+
## Datasets
|
| 19 |
+
The configuration of all datasets needs to be set in the corresponding dataset configuration file in the **pefomed/configs/datasets/medical**
|
| 20 |
+
|
| 21 |
+
Stage 1 finetune datasets: [ROCO](https://link.springer.com/chapter/10.1007/978-3-030-01364-6_20), [CLEF2022](https://ceur-ws.org/Vol-3180/paper-95.pdf), [MEDICAT](https://arxiv.org/abs/2010.06000), and [MIMIC-CXR](https://arxiv.org/abs/1901.07042)
|
| 22 |
+
|
| 23 |
+
Stage 2 finetune medical VQA datasets: [VQA-RAD](https://www.nature.com/articles/sdata2018251#data-citations), [PathVQA](https://arxiv.org/abs/2003.10286) and [Slake](https://arxiv.org/abs/2102.09542).
|
| 24 |
+
|
| 25 |
+
Stage 2 finetune MRG dataset: [IU-Xray](https://pubmed.ncbi.nlm.nih.gov/26133894/)
|
| 26 |
+
|
| 27 |
+
## Acknowledgement
|
| 28 |
+
If you're using PeFoMed in your research or applications, please cite using this BibTeX:
|
| 29 |
+
```bibtex
|
| 30 |
+
@misc{liu2024pefomedparameterefficientfinetuning,
|
| 31 |
+
title={PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging},
|
| 32 |
+
author={Gang Liu and Jinlong He and Pengfei Li and Genrong He and Zhaolin Chen and Shenjun Zhong},
|
| 33 |
+
year={2024},
|
| 34 |
+
eprint={2401.02797},
|
| 35 |
+
archivePrefix={arXiv},
|
| 36 |
+
primaryClass={cs.CL},
|
| 37 |
+
url={https://arxiv.org/abs/2401.02797},
|
| 38 |
+
}
|
| 39 |
+
```
|
| 40 |
+
## License
|
| 41 |
+
This repository is under [BSD 3-Clause License](LICENSE.md).
|
| 42 |
+
|
| 43 |
+
Many codes are based on [Lavis](https://github.com/salesforce/LAVIS) and [MiniGPT-v2](https://github.com/Vision-CAIR/MiniGPT-4)
|