Upload best model checkpoint (epoch_25) with clinical benchmark results
Browse files- README.md +112 -0
- config.json +6 -0
- figures/architecture.png +0 -0
- figures/license_badge.png +0 -0
- figures/performance_chart.png +0 -0
- pytorch_model.bin +3 -0
README.md
ADDED
|
@@ -0,0 +1,112 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
library_name: transformers
|
| 4 |
+
---
|
| 5 |
+
# MedVision-DiagnosticsAI
|
| 6 |
+
<!-- markdownlint-disable first-line-h1 -->
|
| 7 |
+
<!-- markdownlint-disable html -->
|
| 8 |
+
<!-- markdownlint-disable no-duplicate-header -->
|
| 9 |
+
|
| 10 |
+
<div align="center">
|
| 11 |
+
<img src="figures/architecture.png" width="60%" alt="MedVision-DiagnosticsAI" />
|
| 12 |
+
</div>
|
| 13 |
+
<hr>
|
| 14 |
+
|
| 15 |
+
<div align="center" style="line-height: 1;">
|
| 16 |
+
<a href="LICENSE" style="margin: 2px;">
|
| 17 |
+
<img alt="License" src="figures/license_badge.png" style="display: inline-block; vertical-align: middle;"/>
|
| 18 |
+
</a>
|
| 19 |
+
</div>
|
| 20 |
+
|
| 21 |
+
## 1. Introduction
|
| 22 |
+
|
| 23 |
+
MedVision-DiagnosticsAI represents a breakthrough in medical imaging analysis, leveraging state-of-the-art Vision Transformer (ViT) architecture for multi-modal diagnostic tasks. The model has been extensively fine-tuned on diverse medical imaging datasets including X-rays, CT scans, and MRI images.
|
| 24 |
+
|
| 25 |
+
<p align="center">
|
| 26 |
+
<img width="80%" src="figures/performance_chart.png">
|
| 27 |
+
</p>
|
| 28 |
+
|
| 29 |
+
Our model achieves remarkable performance on several clinical benchmarks, demonstrating its potential for assisting healthcare professionals in diagnostic workflows. The architecture combines attention mechanisms with domain-specific pre-training to capture subtle patterns in medical imagery.
|
| 30 |
+
|
| 31 |
+
Key features of MedVision-DiagnosticsAI:
|
| 32 |
+
- Multi-modal medical image classification
|
| 33 |
+
- High sensitivity for early disease detection
|
| 34 |
+
- Calibrated uncertainty estimates
|
| 35 |
+
- HIPAA-compliant deployment options
|
| 36 |
+
|
| 37 |
+
## 2. Evaluation Results
|
| 38 |
+
|
| 39 |
+
### Comprehensive Benchmark Results
|
| 40 |
+
|
| 41 |
+
<div align="center">
|
| 42 |
+
|
| 43 |
+
| | Benchmark | Baseline | ModelA | ModelB-v2 | MedVision-DiagnosticsAI |
|
| 44 |
+
|---|---|---|---|---|---|
|
| 45 |
+
| **Classification Tasks** | Chest X-Ray Classification | 0.821 | 0.845 | 0.867 | 0.892 |
|
| 46 |
+
| | CT Scan Analysis | 0.756 | 0.778 | 0.801 | 0.844 |
|
| 47 |
+
| | MRI Segmentation | 0.698 | 0.721 | 0.745 | 0.856 |
|
| 48 |
+
| **Detection Tasks** | Tumor Detection | 0.812 | 0.834 | 0.851 | 0.889 |
|
| 49 |
+
| | Anomaly Localization | 0.745 | 0.768 | 0.789 | 0.819 |
|
| 50 |
+
| | Lesion Identification | 0.789 | 0.812 | 0.835 | 0.896 |
|
| 51 |
+
| **Clinical Metrics** | Sensitivity | 0.867 | 0.889 | 0.901 | 0.932 |
|
| 52 |
+
| | Specificity | 0.834 | 0.856 | 0.878 | 0.894 |
|
| 53 |
+
| | PPV (Precision) | 0.812 | 0.834 | 0.856 | 0.877 |
|
| 54 |
+
| | NPV | 0.845 | 0.867 | 0.889 | 0.914 |
|
| 55 |
+
| **Robustness** | Cross-Domain Transfer | 0.678 | 0.701 | 0.723 | 0.787 |
|
| 56 |
+
| | Noise Resilience | 0.712 | 0.734 | 0.756 | 0.797 |
|
| 57 |
+
| | Calibration Error | 0.089 | 0.078 | 0.067 | 0.065 |
|
| 58 |
+
|
| 59 |
+
</div>
|
| 60 |
+
|
| 61 |
+
### Overall Performance Summary
|
| 62 |
+
MedVision-DiagnosticsAI demonstrates exceptional performance across all evaluated clinical benchmarks, with particularly strong results in sensitivity and multi-modal classification tasks.
|
| 63 |
+
|
| 64 |
+
## 3. Clinical Applications
|
| 65 |
+
Our model is designed to assist healthcare professionals in:
|
| 66 |
+
- Rapid screening of chest X-rays
|
| 67 |
+
- CT scan abnormality detection
|
| 68 |
+
- MRI-based tissue analysis
|
| 69 |
+
- Cross-modality diagnostic support
|
| 70 |
+
|
| 71 |
+
## 4. How to Run Locally
|
| 72 |
+
|
| 73 |
+
Please refer to our code repository for detailed instructions on running MedVision-DiagnosticsAI locally.
|
| 74 |
+
|
| 75 |
+
### System Requirements
|
| 76 |
+
- GPU with at least 8GB VRAM
|
| 77 |
+
- Python 3.8+
|
| 78 |
+
- transformers >= 4.30.0
|
| 79 |
+
|
| 80 |
+
### Quick Start
|
| 81 |
+
```python
|
| 82 |
+
from transformers import AutoModelForImageClassification, AutoImageProcessor
|
| 83 |
+
|
| 84 |
+
model = AutoModelForImageClassification.from_pretrained("your-org/MedVision-DiagnosticsAI")
|
| 85 |
+
processor = AutoImageProcessor.from_pretrained("your-org/MedVision-DiagnosticsAI")
|
| 86 |
+
|
| 87 |
+
# Process your medical image
|
| 88 |
+
inputs = processor(images=your_image, return_tensors="pt")
|
| 89 |
+
outputs = model(**inputs)
|
| 90 |
+
```
|
| 91 |
+
|
| 92 |
+
### Inference Parameters
|
| 93 |
+
We recommend the following settings for optimal performance:
|
| 94 |
+
- Batch size: 1 (for clinical applications)
|
| 95 |
+
- Image size: 224x224
|
| 96 |
+
- Normalization: ImageNet statistics
|
| 97 |
+
|
| 98 |
+
## 5. License
|
| 99 |
+
This model is licensed under the [Apache 2.0 License](LICENSE). The model is intended for research and clinical decision support only.
|
| 100 |
+
|
| 101 |
+
## 6. Contact
|
| 102 |
+
If you have any questions, please raise an issue on our GitHub repository or contact us at support@medvision-ai.org.
|
| 103 |
+
|
| 104 |
+
## 7. Citation
|
| 105 |
+
```bibtex
|
| 106 |
+
@article{medvision2025,
|
| 107 |
+
title={MedVision-DiagnosticsAI: A Multi-Modal Medical Imaging Foundation Model},
|
| 108 |
+
author={MedVision Team},
|
| 109 |
+
journal={arXiv preprint},
|
| 110 |
+
year={2025}
|
| 111 |
+
}
|
| 112 |
+
```
|
config.json
ADDED
|
@@ -0,0 +1,6 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"model_type": "vit",
|
| 3 |
+
"architectures": ["ViTForImageClassification"],
|
| 4 |
+
"num_labels": 5,
|
| 5 |
+
"image_size": 224
|
| 6 |
+
}
|
figures/architecture.png
ADDED
|
figures/license_badge.png
ADDED
|
figures/performance_chart.png
ADDED
|
pytorch_model.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1c0427ba2750594566e572217c24249d3d7cdba2506df27b01d0969d4ac705c8
|
| 3 |
+
size 2573
|