File size: 1,359 Bytes
26db68e 3ff1f38 26db68e 3ff1f38 26db68e 69ae5d6 26db68e 69ae5d6 26db68e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 |
---
license: apache-2.0
pipeline_tag: image-text-to-text
library_name: transformers
---
# GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images
<!-- <p align="left">
<img src="pics/fig1_v.png" width="90%">
</p> -->
## Introduction
GEM is a multimodal LLM unifying ECG time series, 12-lead ECG images and text for grounded and clinician-aligned ECG interpretation. GEM enables feature-grounded analysis, evidence-driven reasoning, and a clinician-like diagnostic process.
## π₯ Updates
#### Project Page: π [Page](https://www.lanxplanet.com/GEM-ECG/)
#### Paper: π [Arxiv](https://arxiv.org/pdf/2503.06073)
#### Code: π» [GitHub](https://github.com/lanxiang1017/GEM)
#### Model: π€ [GEM](https://huggingface.co/LANSG/GEM)
#### Data: π€ [ECG-Grounding](https://huggingface.co/datasets/LANSG/ECG-Grounding)
## Citation
If you find GEM helpful for your research and applications, please cite our paper:
```bibtex
@misc{lan2025gemempoweringmllmgrounded,
title={GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images},
author={Xiang Lan and Feng Wu and Kai He and Qinghao Zhao and Shenda Hong and Mengling Feng},
year={2025},
eprint={2503.06073},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2503.06073},
}
```
|