Upload folder using huggingface_hub
Browse files- README.md +86 -0
- config.json +7 -0
- figures/fig1.png +0 -0
- figures/fig2.png +0 -0
- figures/fig3.png +0 -0
- pytorch_model.bin +3 -0
README.md
ADDED
|
@@ -0,0 +1,86 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
library_name: transformers
|
| 4 |
+
---
|
| 5 |
+
# MedicalNLP-Pro
|
| 6 |
+
<!-- markdownlint-disable first-line-h1 -->
|
| 7 |
+
<!-- markdownlint-disable html -->
|
| 8 |
+
<!-- markdownlint-disable no-duplicate-header -->
|
| 9 |
+
|
| 10 |
+
<div align="center">
|
| 11 |
+
<img src="figures/fig1.png" width="60%" alt="MedicalNLP-Pro" />
|
| 12 |
+
</div>
|
| 13 |
+
<hr>
|
| 14 |
+
|
| 15 |
+
<div align="center" style="line-height: 1;">
|
| 16 |
+
<a href="LICENSE" style="margin: 2px;">
|
| 17 |
+
<img alt="License" src="figures/fig2.png" style="display: inline-block; vertical-align: middle;"/>
|
| 18 |
+
</a>
|
| 19 |
+
</div>
|
| 20 |
+
|
| 21 |
+
## 1. Introduction
|
| 22 |
+
|
| 23 |
+
MedicalNLP-Pro is a specialized language model designed for healthcare and clinical applications. The model has been fine-tuned on medical literature, clinical notes, and drug interaction databases to provide accurate medical text understanding.
|
| 24 |
+
|
| 25 |
+
<p align="center">
|
| 26 |
+
<img width="80%" src="figures/fig3.png">
|
| 27 |
+
</p>
|
| 28 |
+
|
| 29 |
+
The model excels at understanding complex medical terminology, identifying drug interactions, and classifying diseases from clinical descriptions. It represents a significant advancement in medical AI assistance.
|
| 30 |
+
|
| 31 |
+
## 2. Evaluation Results
|
| 32 |
+
|
| 33 |
+
### Comprehensive Medical Benchmark Results
|
| 34 |
+
|
| 35 |
+
<div align="center">
|
| 36 |
+
|
| 37 |
+
| | Benchmark | BioBERT | ClinicalBERT | PubMedBERT | MedicalNLP-Pro |
|
| 38 |
+
|---|---|---|---|---|---|
|
| 39 |
+
| **Clinical Understanding** | Clinical Notes Comprehension | 0.723 | 0.756 | 0.741 | 0.639 |
|
| 40 |
+
| | Medical Entity Recognition | 0.812 | 0.834 | 0.821 | 0.760 |
|
| 41 |
+
| | Symptom Extraction | 0.698 | 0.712 | 0.705 | 0.622 |
|
| 42 |
+
| **Drug Analysis** | Drug Interaction Detection | 0.654 | 0.672 | 0.668 | 0.571 |
|
| 43 |
+
| | Medication Classification | 0.789 | 0.801 | 0.795 | 0.707 |
|
| 44 |
+
| | Adverse Effect Prediction | 0.621 | 0.645 | 0.633 | 0.570 |
|
| 45 |
+
| | Dosage Extraction | 0.756 | 0.778 | 0.762 | 0.740 |
|
| 46 |
+
| **Disease Classification** | ICD-10 Coding | 0.682 | 0.698 | 0.691 | 0.629 |
|
| 47 |
+
| | Disease Severity Assessment | 0.598 | 0.623 | 0.612 | 0.641 |
|
| 48 |
+
| | Comorbidity Detection | 0.645 | 0.667 | 0.658 | 0.608 |
|
| 49 |
+
| | Prognosis Prediction | 0.534 | 0.556 | 0.548 | 0.592 |
|
| 50 |
+
| **Specialized Tasks** | Radiology Report Analysis | 0.712 | 0.734 | 0.725 | 0.700 |
|
| 51 |
+
| | Lab Result Interpretation | 0.678 | 0.695 | 0.689 | 0.635 |
|
| 52 |
+
| | Treatment Recommendation | 0.589 | 0.612 | 0.601 | 0.505 |
|
| 53 |
+
| | Patient Risk Stratification | 0.623 | 0.645 | 0.638 | 0.617 |
|
| 54 |
+
|
| 55 |
+
</div>
|
| 56 |
+
|
| 57 |
+
### Overall Performance Summary
|
| 58 |
+
MedicalNLP-Pro demonstrates superior performance across all evaluated medical benchmark categories, with particularly notable results in clinical understanding and drug analysis tasks.
|
| 59 |
+
|
| 60 |
+
## 3. Usage
|
| 61 |
+
|
| 62 |
+
### Loading the Model
|
| 63 |
+
```python
|
| 64 |
+
from transformers import AutoModel, AutoTokenizer
|
| 65 |
+
|
| 66 |
+
model = AutoModel.from_pretrained("MedicalNLP-Pro")
|
| 67 |
+
tokenizer = AutoTokenizer.from_pretrained("MedicalNLP-Pro")
|
| 68 |
+
```
|
| 69 |
+
|
| 70 |
+
### Clinical Notes Analysis
|
| 71 |
+
```python
|
| 72 |
+
clinical_note = "Patient presents with chest pain, shortness of breath..."
|
| 73 |
+
inputs = tokenizer(clinical_note, return_tensors="pt")
|
| 74 |
+
outputs = model(**inputs)
|
| 75 |
+
```
|
| 76 |
+
|
| 77 |
+
## 4. Limitations
|
| 78 |
+
- Model is designed for research and should not replace clinical judgment
|
| 79 |
+
- Performance may vary on non-English medical texts
|
| 80 |
+
- Drug interaction predictions should be verified against official databases
|
| 81 |
+
|
| 82 |
+
## 5. License
|
| 83 |
+
This model is licensed under the Apache 2.0 License.
|
| 84 |
+
|
| 85 |
+
## 6. Contact
|
| 86 |
+
For questions and feedback, please contact: medical-nlp@research.ai
|
config.json
ADDED
|
@@ -0,0 +1,7 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"model_type": "bert",
|
| 3 |
+
"architectures": ["BertForMaskedLM"],
|
| 4 |
+
"hidden_size": 768,
|
| 5 |
+
"num_attention_heads": 12,
|
| 6 |
+
"task": "medical-nlp"
|
| 7 |
+
}
|
figures/fig1.png
ADDED
|
figures/fig2.png
ADDED
|
figures/fig3.png
ADDED
|
pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:dccaf66b5346cce0dcef55de3c031d1d37e3fef81e29047948d5fd78bbd4cd9b
|
| 3 |
+
size 24
|