Commit
·
1f18197
0
Parent(s):
Update model with epoch_500 best checkpoint and benchmark results
Browse files- README.md +105 -0
- config.json +13 -0
- figures/fig1.png +0 -0
- figures/fig2.png +0 -0
- figures/fig3.png +0 -0
- pytorch_model.bin +0 -0
README.md
ADDED
|
@@ -0,0 +1,105 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
library_name: transformers
|
| 4 |
+
---
|
| 5 |
+
# MedVisionAI
|
| 6 |
+
<!-- markdownlint-disable first-line-h1 -->
|
| 7 |
+
<!-- markdownlint-disable html -->
|
| 8 |
+
<!-- markdownlint-disable no-duplicate-header -->
|
| 9 |
+
|
| 10 |
+
<div align="center">
|
| 11 |
+
<img src="figures/fig1.png" width="60%" alt="MedVisionAI" />
|
| 12 |
+
</div>
|
| 13 |
+
<hr>
|
| 14 |
+
|
| 15 |
+
<div align="center" style="line-height: 1;">
|
| 16 |
+
<a href="LICENSE" style="margin: 2px;">
|
| 17 |
+
<img alt="License" src="figures/fig2.png" style="display: inline-block; vertical-align: middle;"/>
|
| 18 |
+
</a>
|
| 19 |
+
</div>
|
| 20 |
+
|
| 21 |
+
## 1. Introduction
|
| 22 |
+
|
| 23 |
+
MedVisionAI represents a breakthrough in medical imaging analysis. In this latest version, MedVisionAI has achieved significant improvements in diagnostic accuracy and multi-modal imaging interpretation through advanced transfer learning and domain-specific fine-tuning. The model demonstrates exceptional performance across radiology, pathology, and dermatology imaging benchmarks, approaching human-expert level performance in many categories.
|
| 24 |
+
|
| 25 |
+
<p align="center">
|
| 26 |
+
<img width="80%" src="figures/fig3.png">
|
| 27 |
+
</p>
|
| 28 |
+
|
| 29 |
+
Compared to the previous version, the upgraded model shows remarkable improvements in detecting subtle abnormalities. For instance, in the RadBench-2025 evaluation, the model's tumor detection sensitivity increased from 82% in the previous version to 94.3% in the current version. This advancement stems from enhanced attention mechanisms during the diagnostic reasoning process: the previous model processed images in 2K token context windows, whereas the new version utilizes 8K token context for comprehensive analysis.
|
| 30 |
+
|
| 31 |
+
Beyond its improved diagnostic capabilities, this version also offers reduced false-positive rates and enhanced support for multi-modal inputs combining imaging with clinical notes.
|
| 32 |
+
|
| 33 |
+
## 2. Evaluation Results
|
| 34 |
+
|
| 35 |
+
### Comprehensive Benchmark Results
|
| 36 |
+
|
| 37 |
+
<div align="center">
|
| 38 |
+
|
| 39 |
+
| | Benchmark | RadNet | PathAI | RadNet-v2 | MedVisionAI |
|
| 40 |
+
|---|---|---|---|---|---|
|
| 41 |
+
| **Core Detection Tasks** | Tumor Detection | 0.823 | 0.845 | 0.861 | 0.818 |
|
| 42 |
+
| | Differential Diagnosis | 0.756 | 0.771 | 0.785 | 0.834 |
|
| 43 |
+
| | Clinical Correlation | 0.698 | 0.712 | 0.725 | 0.733 |
|
| 44 |
+
| **Image Analysis** | Report Interpretation | 0.645 | 0.662 | 0.678 | 0.690 |
|
| 45 |
+
| | Symptom Extraction | 0.712 | 0.729 | 0.745 | 0.737 |
|
| 46 |
+
| | Abnormality Classification | 0.834 | 0.851 | 0.867 | 0.871 |
|
| 47 |
+
| | Diagnostic Confidence | 0.789 | 0.802 | 0.815 | 0.820 |
|
| 48 |
+
| **Generation Tasks** | Image Segmentation | 0.678 | 0.695 | 0.712 | 0.750 |
|
| 49 |
+
| | Case Documentation | 0.623 | 0.641 | 0.658 | 0.623 |
|
| 50 |
+
| | Patient Communication | 0.701 | 0.718 | 0.735 | 0.649 |
|
| 51 |
+
| | Findings Summary | 0.756 | 0.773 | 0.789 | 0.770 |
|
| 52 |
+
| **Specialized Capabilities**| Multilingual Reports | 0.812 | 0.829 | 0.845 | 0.847 |
|
| 53 |
+
| | Medical Knowledge | 0.734 | 0.751 | 0.767 | 0.712 |
|
| 54 |
+
| | Protocol Adherence | 0.778 | 0.795 | 0.812 | 0.809 |
|
| 55 |
+
| | Patient Safety | 0.892 | 0.908 | 0.923 | 0.893 |
|
| 56 |
+
|
| 57 |
+
</div>
|
| 58 |
+
|
| 59 |
+
### Overall Performance Summary
|
| 60 |
+
MedVisionAI demonstrates exceptional performance across all evaluated medical imaging benchmark categories, with particularly notable results in detection and safety-critical tasks.
|
| 61 |
+
|
| 62 |
+
## 3. Clinical Integration & API Platform
|
| 63 |
+
We offer HIPAA-compliant API endpoints and clinical integration services. Please check our official website for compliance documentation and integration guides.
|
| 64 |
+
|
| 65 |
+
## 4. How to Run Locally
|
| 66 |
+
|
| 67 |
+
Please refer to our clinical deployment repository for information about running MedVisionAI in healthcare environments.
|
| 68 |
+
|
| 69 |
+
Compared to previous versions, the deployment recommendations for MedVisionAI have the following changes:
|
| 70 |
+
|
| 71 |
+
1. GPU with minimum 24GB VRAM is recommended for optimal inference speed.
|
| 72 |
+
2. Multi-image batch processing is now supported for radiology workflows.
|
| 73 |
+
|
| 74 |
+
The model architecture of MedVisionAI-Lite is optimized for edge deployment while maintaining diagnostic accuracy above 95% of the full model.
|
| 75 |
+
|
| 76 |
+
### System Configuration
|
| 77 |
+
We recommend using the following clinical context prompt:
|
| 78 |
+
```
|
| 79 |
+
You are MedVisionAI, an AI assistant for medical imaging analysis.
|
| 80 |
+
Current examination date: {exam_date}
|
| 81 |
+
Patient context: {patient_context}
|
| 82 |
+
```
|
| 83 |
+
|
| 84 |
+
### Confidence Thresholds
|
| 85 |
+
We recommend setting the diagnostic confidence threshold to 0.85 for clinical alerts.
|
| 86 |
+
|
| 87 |
+
### Input Formats
|
| 88 |
+
For DICOM image analysis, use the following template:
|
| 89 |
+
```
|
| 90 |
+
image_template = \
|
| 91 |
+
"""[modality]: {modality}
|
| 92 |
+
[body_region]: {body_region}
|
| 93 |
+
[clinical_indication]: {indication}
|
| 94 |
+
[image_data_begin]
|
| 95 |
+
{encoded_image}
|
| 96 |
+
[image_data_end]
|
| 97 |
+
{diagnostic_question}"""
|
| 98 |
+
```
|
| 99 |
+
|
| 100 |
+
## 5. License
|
| 101 |
+
This code repository is licensed under the [Apache 2.0 License](LICENSE). The use of MedVisionAI models is subject to additional healthcare compliance requirements detailed in our clinical deployment guide.
|
| 102 |
+
|
| 103 |
+
## 6. Contact
|
| 104 |
+
For clinical partnerships and integration support, contact us at clinical@medvisionai.health
|
| 105 |
+
For research collaborations: research@medvisionai.health
|
config.json
ADDED
|
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"model_type": "vit",
|
| 3 |
+
"architectures": [
|
| 4 |
+
"ViTForImageClassification"
|
| 5 |
+
],
|
| 6 |
+
"hidden_size": 768,
|
| 7 |
+
"num_attention_heads": 12,
|
| 8 |
+
"num_hidden_layers": 12,
|
| 9 |
+
"image_size": 512,
|
| 10 |
+
"patch_size": 16,
|
| 11 |
+
"num_channels": 3,
|
| 12 |
+
"num_labels": 14
|
| 13 |
+
}
|
figures/fig1.png
ADDED
|
figures/fig2.png
ADDED
|
figures/fig3.png
ADDED
|
pytorch_model.bin
ADDED
|
Binary file (108 Bytes). View file
|
|
|