richardyoung commited on
Commit
1c0f4fd
·
verified ·
1 Parent(s): b28e115

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +19 -1
README.md CHANGED
@@ -47,6 +47,14 @@ pipeline_tag: sentence-similarity
47
 
48
  **CardioEmbed** is a domain-specialized embedding model fine-tuned on comprehensive cardiology textbooks for clinical applications. Built on [Qwen3-Embedding-8B](https://huggingface.co/Qwen/Qwen3-Embedding-8B) using LoRA adapters, this model achieves **state-of-the-art performance** on biomedical retrieval tasks while maintaining efficiency through 8-bit quantization.
49
 
 
 
 
 
 
 
 
 
50
  ### Key Features
51
 
52
  - 🏥 **Medical Domain Expertise**: Trained on 106,432 cardiology-specific sentence pairs from authoritative textbooks
@@ -65,6 +73,14 @@ pipeline_tag: sentence-similarity
65
 
66
  *MRR@10 on biomedical retrieval tasks. See [paper](https://arxiv.org/abs/XXXX.XXXXX) for full results.*
67
 
 
 
 
 
 
 
 
 
68
  ---
69
 
70
  ## Quick Start
@@ -298,7 +314,9 @@ For questions or issues, please open an issue on [GitHub](https://github.com/ric
298
 
299
  **Built with ❤️ for advancing medical AI research**
300
 
301
- *Research by [Richard J. Young](https://huggingface.co/richardyoung) & Alice M. Matthews | [DeepNeuro.AI](https://deepneuro.ai)*
 
 
302
 
303
  </div>
304
 
 
47
 
48
  **CardioEmbed** is a domain-specialized embedding model fine-tuned on comprehensive cardiology textbooks for clinical applications. Built on [Qwen3-Embedding-8B](https://huggingface.co/Qwen/Qwen3-Embedding-8B) using LoRA adapters, this model achieves **state-of-the-art performance** on biomedical retrieval tasks while maintaining efficiency through 8-bit quantization.
49
 
50
+ ### Why CardioEmbed?
51
+
52
+ Cardiovascular disease remains the **leading cause of death globally**, accounting for approximately **18 million deaths annually** and representing nearly one-third of all mortality worldwide. In the United States alone, cardiovascular disease imposes an estimated annual economic burden exceeding **$400 billion** in direct medical costs and lost productivity.
53
+
54
+ As machine learning systems increasingly support clinical decision-making in cardiology—from risk stratification and diagnostic assistance to treatment optimization—the quality of semantic text representations becomes critical. However, existing biomedical embedding models trained primarily on PubMed research literature may not fully capture the **procedural knowledge and specialized terminology** found in clinical cardiology textbooks that practitioners actually use.
55
+
56
+ **CardioEmbed bridges this research-practice gap** by training on comprehensive cardiology textbooks, achieving near-perfect retrieval accuracy on cardiac-specific tasks while maintaining strong performance on general biomedical benchmarks.
57
+
58
  ### Key Features
59
 
60
  - 🏥 **Medical Domain Expertise**: Trained on 106,432 cardiology-specific sentence pairs from authoritative textbooks
 
73
 
74
  *MRR@10 on biomedical retrieval tasks. See [paper](https://arxiv.org/abs/XXXX.XXXXX) for full results.*
75
 
76
+ ### Performance Visualization
77
+
78
+ CardioEmbed achieves **99.60% Acc@1** on cardiac-specific retrieval, outperforming MedTE (current SOTA medical embedding) by **+15.94 percentage points**:
79
+
80
+ ![Model Comparison](https://raw.githubusercontent.com/ricyoung/CardioEmbed/master/Final_Published_Paper/figures/figure1_model_comparison.png)
81
+
82
+ *Figure: Comparison of CardioEmbed against state-of-the-art medical and general-purpose embedding models on cardiology retrieval tasks.*
83
+
84
  ---
85
 
86
  ## Quick Start
 
314
 
315
  **Built with ❤️ for advancing medical AI research**
316
 
317
+ *By [Richard J. Young](https://deepneuro.ai) & Alice M. Matthews*
318
+
319
+ [![DeepNeuro.AI](https://img.shields.io/badge/🧠-DeepNeuro.AI-orange)](https://deepneuro.ai)
320
 
321
  </div>
322