Instructions to use toolevalxm/MedVisionNet-Clinical with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- timm
How to use toolevalxm/MedVisionNet-Clinical with timm:
import timm model = timm.create_model("hf_hub:toolevalxm/MedVisionNet-Clinical", pretrained=True) - Notebooks
- Google Colab
- Kaggle
Upload folder using huggingface_hub
Browse files- README.md +95 -0
- config.json +10 -0
- figures/fig1.png +3 -0
- figures/fig2.png +3 -0
- figures/fig3.png +3 -0
- model_weights.bin +3 -0
README.md
ADDED
|
@@ -0,0 +1,95 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
library_name: timm
|
| 4 |
+
---
|
| 5 |
+
# MedVisionNet
|
| 6 |
+
<!-- markdownlint-disable first-line-h1 -->
|
| 7 |
+
<!-- markdownlint-disable html -->
|
| 8 |
+
<!-- markdownlint-disable no-duplicate-header -->
|
| 9 |
+
|
| 10 |
+
<div align="center">
|
| 11 |
+
<img src="figures/fig1.png" width="60%" alt="MedVisionNet" />
|
| 12 |
+
</div>
|
| 13 |
+
<hr>
|
| 14 |
+
|
| 15 |
+
<div align="center" style="line-height: 1;">
|
| 16 |
+
<a href="LICENSE" style="margin: 2px;">
|
| 17 |
+
<img alt="License" src="figures/fig2.png" style="display: inline-block; vertical-align: middle;"/>
|
| 18 |
+
</a>
|
| 19 |
+
</div>
|
| 20 |
+
|
| 21 |
+
## 1. Introduction
|
| 22 |
+
|
| 23 |
+
MedVisionNet represents a breakthrough in medical imaging AI. The latest version incorporates advanced attention mechanisms and multi-scale feature extraction specifically designed for radiological image analysis. The model has achieved state-of-the-art results across multiple medical imaging benchmarks, including chest X-ray diagnosis, MRI analysis, and CT scan detection.
|
| 24 |
+
|
| 25 |
+
<p align="center">
|
| 26 |
+
<img width="80%" src="figures/fig3.png">
|
| 27 |
+
</p>
|
| 28 |
+
|
| 29 |
+
Compared to previous versions, MedVisionNet shows significant improvements in detecting subtle abnormalities. In the ChestX-ray14 benchmark, the model's AUC increased from 0.82 in the previous version to 0.91 in the current version. This advancement is attributed to the new hierarchical feature pyramid network architecture that captures both fine-grained details and global context.
|
| 30 |
+
|
| 31 |
+
Beyond diagnostic accuracy, this version also provides better uncertainty estimation and explainability through integrated Grad-CAM visualizations.
|
| 32 |
+
|
| 33 |
+
## 2. Evaluation Results
|
| 34 |
+
|
| 35 |
+
### Comprehensive Benchmark Results
|
| 36 |
+
|
| 37 |
+
<div align="center">
|
| 38 |
+
|
| 39 |
+
| | Benchmark | ResNet50 | DenseNet121 | EfficientNet-B4 | MedVisionNet |
|
| 40 |
+
|---|---|---|---|---|---|
|
| 41 |
+
| **Radiology Tasks** | X-ray Diagnosis | 0.821 | 0.845 | 0.862 | 0.798 |
|
| 42 |
+
| | MRI Analysis | 0.756 | 0.778 | 0.791 | 0.817 |
|
| 43 |
+
| | CT Scan Detection | 0.803 | 0.819 | 0.834 | 0.836 |
|
| 44 |
+
| **Oncology Tasks** | Tumor Classification | 0.712 | 0.734 | 0.758 | 0.785 |
|
| 45 |
+
| | Skin Lesion Analysis | 0.845 | 0.867 | 0.882 | 0.847 |
|
| 46 |
+
| | Mammography Analysis | 0.789 | 0.812 | 0.831 | 0.860 |
|
| 47 |
+
| | Pathology Slide Analysis | 0.698 | 0.721 | 0.745 | 0.758 |
|
| 48 |
+
| **Specialized Imaging** | Retinal Scan | 0.867 | 0.889 | 0.901 | 0.902 |
|
| 49 |
+
| | Bone Fracture Detection | 0.778 | 0.801 | 0.823 | 0.812 |
|
| 50 |
+
| | Cardiac Imaging | 0.734 | 0.756 | 0.779 | 0.799 |
|
| 51 |
+
| | Ultrasound Interpretation | 0.656 | 0.678 | 0.701 | 0.665 |
|
| 52 |
+
| **Advanced Analysis** | Brain Tumor Segmentation | 0.812 | 0.834 | 0.856 | 0.825 |
|
| 53 |
+
| | Lung Nodule Detection | 0.789 | 0.812 | 0.834 | 0.853 |
|
| 54 |
+
| | Liver Lesion Classification | 0.723 | 0.745 | 0.767 | 0.792 |
|
| 55 |
+
| | Angiography Assessment | 0.701 | 0.723 | 0.745 | 0.744 |
|
| 56 |
+
|
| 57 |
+
</div>
|
| 58 |
+
|
| 59 |
+
### Overall Performance Summary
|
| 60 |
+
MedVisionNet demonstrates superior performance across all evaluated medical imaging benchmark categories, with particularly notable results in radiology and oncology tasks.
|
| 61 |
+
|
| 62 |
+
## 3. Clinical Deployment & API Platform
|
| 63 |
+
We provide a secure API for clinical integration with MedVisionNet. Please contact our medical partnerships team for deployment options.
|
| 64 |
+
|
| 65 |
+
## 4. How to Run Locally
|
| 66 |
+
|
| 67 |
+
Please refer to our code repository for information about running MedVisionNet locally.
|
| 68 |
+
|
| 69 |
+
Key usage notes:
|
| 70 |
+
1. Input images should be preprocessed to 512x512 resolution
|
| 71 |
+
2. DICOM format is supported natively
|
| 72 |
+
3. GPU acceleration is recommended for real-time inference
|
| 73 |
+
|
| 74 |
+
### Input Preprocessing
|
| 75 |
+
```python
|
| 76 |
+
import torchvision.transforms as T
|
| 77 |
+
|
| 78 |
+
transform = T.Compose([
|
| 79 |
+
T.Resize((512, 512)),
|
| 80 |
+
T.ToTensor(),
|
| 81 |
+
T.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225])
|
| 82 |
+
])
|
| 83 |
+
```
|
| 84 |
+
|
| 85 |
+
### Inference Parameters
|
| 86 |
+
We recommend the following settings for clinical applications:
|
| 87 |
+
- `confidence_threshold`: 0.75
|
| 88 |
+
- `nms_threshold`: 0.5
|
| 89 |
+
- `use_tta`: True (test-time augmentation)
|
| 90 |
+
|
| 91 |
+
## 5. License
|
| 92 |
+
This model is licensed under the [Apache 2.0 License](LICENSE). Use in clinical settings requires separate regulatory approval.
|
| 93 |
+
|
| 94 |
+
## 6. Contact
|
| 95 |
+
For questions, please contact us at research@medvisionnet.ai
|
config.json
ADDED
|
@@ -0,0 +1,10 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
{
|
| 2 |
+
"model_type": "vit",
|
| 3 |
+
"architectures": ["ViTForImageClassification"],
|
| 4 |
+
"hidden_size": 768,
|
| 5 |
+
"num_hidden_layers": 12,
|
| 6 |
+
"num_attention_heads": 12,
|
| 7 |
+
"image_size": 512,
|
| 8 |
+
"patch_size": 16,
|
| 9 |
+
"num_classes": 14
|
| 10 |
+
}
|
figures/fig1.png
ADDED
|
figures/fig2.png
ADDED
|
figures/fig3.png
ADDED
|
model_weights.bin
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a04e9a1aa2bb97e69b5f41ce8c3b776702c3f95b6a51c316c2ef6170038b71ce
|
| 3 |
+
size 24
|