OpenFace-CQUPT
/

PA-LLaVA-plus

Model card Files Files and versions

PA-LLaVA-plus / README.md

ddw2AIGROUP2CQUPT's picture

ddw2AIGROUP2CQUPT

Update README.md

8b24975 verified 6 months ago

|

history blame contribute delete

1.81 kB

	# PA-LLaVA-plus
	This is the first-stage weights trained on the 400w pathological image-text dataset using the PA-LLaVA model structure. The link is https://huggingface.co/OpenFace-CQUPT/PA-LLaVA-plus.

	## 400w Dataset
	The 400w pathology dataset is derived from the publicly available "Accessible Dataset (18M samples)" from MedTrinity-25M([UCSC-VLAA/MedTrinity-25M · Datasets at Hugging Face](https://huggingface.co/datasets/UCSC-VLAA/MedTrinity-25M)). This is a dataset spanning multiple medical fields. By analyzing the linguistic structure of the text in this dataset, we extracted a 400w dataset specific to the pathology domain.


	![image/png](https://cdn-uploads.huggingface.co/production/uploads/663f06e01cd68975883a353e/JEMAyDGi9uRUsWGo6UBTA.png)

	## Citation
	```
	@INPROCEEDINGS{10821785,
	author={Dai, Dawei and Zhang, Yuanhui and Xu, Long and Yang, Qianlan and Shen, Xiaojing and Xia, Shuyin and Wang, Guoyin},
	booktitle={2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)},
	title={PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding},
	year={2024},
	volume={},
	number={},
	pages={3138-3143},
	keywords={Connectors;Pathology;Visualization;Codes;Computational modeling;Biological system modeling;Data models;Cleaning;Bioinformatics;Biomedical imaging;Pathology Image Understanding;VQA;LLaVA},
	doi={10.1109/BIBM62325.2024.10821785}}

	@article{dai2025pathologyvlm,
	title={Pathologyvlm: a large vision-language model for pathology image understanding},
	author={Dai, Dawei and Zhang, Yuanhui and Yang, Qianlan and Xu, Long and Shen, Xiaojing and Xia, Shuyin and Wang, Guoyin},
	journal={Artificial Intelligence Review},
	volume={58},
	number={6},
	pages={1--19},
	year={2025},
	publisher={Springer}
	}
	```

	---
	license: cc
	---