Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -47,6 +47,14 @@ pipeline_tag: sentence-similarity
|
|
| 47 |
|
| 48 |
**CardioEmbed** is a domain-specialized embedding model fine-tuned on comprehensive cardiology textbooks for clinical applications. Built on [Qwen3-Embedding-8B](https://huggingface.co/Qwen/Qwen3-Embedding-8B) using LoRA adapters, this model achieves **state-of-the-art performance** on biomedical retrieval tasks while maintaining efficiency through 8-bit quantization.
|
| 49 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 50 |
### Key Features
|
| 51 |
|
| 52 |
- 🏥 **Medical Domain Expertise**: Trained on 106,432 cardiology-specific sentence pairs from authoritative textbooks
|
|
@@ -65,6 +73,14 @@ pipeline_tag: sentence-similarity
|
|
| 65 |
|
| 66 |
*MRR@10 on biomedical retrieval tasks. See [paper](https://arxiv.org/abs/XXXX.XXXXX) for full results.*
|
| 67 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 68 |
---
|
| 69 |
|
| 70 |
## Quick Start
|
|
@@ -298,7 +314,9 @@ For questions or issues, please open an issue on [GitHub](https://github.com/ric
|
|
| 298 |
|
| 299 |
**Built with ❤️ for advancing medical AI research**
|
| 300 |
|
| 301 |
-
*
|
|
|
|
|
|
|
| 302 |
|
| 303 |
</div>
|
| 304 |
|
|
|
|
| 47 |
|
| 48 |
**CardioEmbed** is a domain-specialized embedding model fine-tuned on comprehensive cardiology textbooks for clinical applications. Built on [Qwen3-Embedding-8B](https://huggingface.co/Qwen/Qwen3-Embedding-8B) using LoRA adapters, this model achieves **state-of-the-art performance** on biomedical retrieval tasks while maintaining efficiency through 8-bit quantization.
|
| 49 |
|
| 50 |
+
### Why CardioEmbed?
|
| 51 |
+
|
| 52 |
+
Cardiovascular disease remains the **leading cause of death globally**, accounting for approximately **18 million deaths annually** and representing nearly one-third of all mortality worldwide. In the United States alone, cardiovascular disease imposes an estimated annual economic burden exceeding **$400 billion** in direct medical costs and lost productivity.
|
| 53 |
+
|
| 54 |
+
As machine learning systems increasingly support clinical decision-making in cardiology—from risk stratification and diagnostic assistance to treatment optimization—the quality of semantic text representations becomes critical. However, existing biomedical embedding models trained primarily on PubMed research literature may not fully capture the **procedural knowledge and specialized terminology** found in clinical cardiology textbooks that practitioners actually use.
|
| 55 |
+
|
| 56 |
+
**CardioEmbed bridges this research-practice gap** by training on comprehensive cardiology textbooks, achieving near-perfect retrieval accuracy on cardiac-specific tasks while maintaining strong performance on general biomedical benchmarks.
|
| 57 |
+
|
| 58 |
### Key Features
|
| 59 |
|
| 60 |
- 🏥 **Medical Domain Expertise**: Trained on 106,432 cardiology-specific sentence pairs from authoritative textbooks
|
|
|
|
| 73 |
|
| 74 |
*MRR@10 on biomedical retrieval tasks. See [paper](https://arxiv.org/abs/XXXX.XXXXX) for full results.*
|
| 75 |
|
| 76 |
+
### Performance Visualization
|
| 77 |
+
|
| 78 |
+
CardioEmbed achieves **99.60% Acc@1** on cardiac-specific retrieval, outperforming MedTE (current SOTA medical embedding) by **+15.94 percentage points**:
|
| 79 |
+
|
| 80 |
+

|
| 81 |
+
|
| 82 |
+
*Figure: Comparison of CardioEmbed against state-of-the-art medical and general-purpose embedding models on cardiology retrieval tasks.*
|
| 83 |
+
|
| 84 |
---
|
| 85 |
|
| 86 |
## Quick Start
|
|
|
|
| 314 |
|
| 315 |
**Built with ❤️ for advancing medical AI research**
|
| 316 |
|
| 317 |
+
*By [Richard J. Young](https://deepneuro.ai) & Alice M. Matthews*
|
| 318 |
+
|
| 319 |
+
[](https://deepneuro.ai)
|
| 320 |
|
| 321 |
</div>
|
| 322 |
|