toolevalxm commited on
Commit
c75ec71
·
verified ·
1 Parent(s): 3cb8f4b

Upload MedVisionNet model with benchmark results

Browse files
Files changed (6) hide show
  1. README.md +51 -46
  2. config.json +3 -12
  3. figures/fig1.png +0 -0
  4. figures/fig2.png +0 -0
  5. figures/fig3.png +0 -0
  6. pytorch_model.bin +2 -2
README.md CHANGED
@@ -20,82 +20,87 @@ library_name: transformers
20
 
21
  ## 1. Introduction
22
 
23
- MedVisionNet represents a breakthrough in medical imaging AI. This latest version incorporates advanced convolutional attention mechanisms and multi-scale feature fusion for unprecedented accuracy in diagnostic imaging tasks. The model has been trained on over 2 million anonymized medical images across multiple modalities including CT, MRI, X-ray, and ultrasound.
24
 
25
  <p align="center">
26
  <img width="80%" src="figures/fig3.png">
27
  </p>
28
 
29
- Compared to the previous version, MedVisionNet v3 shows remarkable improvements in detecting subtle abnormalities. For instance, in the RSNA 2024 pneumonia detection challenge, the model's sensitivity increased from 85% to 94.2%. This advancement stems from the hierarchical attention mechanism that allows the model to focus on clinically relevant regions.
30
 
31
- Beyond its improved detection capabilities, this version also offers better explainability through attention maps and reduced false positive rates across all imaging modalities.
32
 
33
  ## 2. Evaluation Results
34
 
35
- ### Comprehensive Benchmark Results
36
 
37
  <div align="center">
38
 
39
- | | Benchmark | ResNet-Medical | EfficientMed | DenseNet-Rad | MedVisionNet |
40
  |---|---|---|---|---|---|
41
- | **Detection Tasks** | Tumor Detection | 0.845 | 0.862 | 0.871 | 0.817 |
42
- | | Lesion Classification | 0.792 | 0.811 | 0.823 | 0.769 |
43
- | | Anomaly Detection | 0.768 | 0.789 | 0.795 | 0.753 |
44
- | **Segmentation Tasks** | Organ Segmentation | 0.891 | 0.903 | 0.912 | 0.850 |
45
- | | Tissue Analysis | 0.823 | 0.841 | 0.856 | 0.800 |
46
- | | Vessel Tracking | 0.756 | 0.778 | 0.789 | 0.726 |
47
- | | Brain Mapping | 0.812 | 0.834 | 0.845 | 0.780 |
48
- | **Diagnostic Tasks** | Diagnostic Accuracy | 0.867 | 0.882 | 0.894 | 0.821 |
49
- | | Nodule Detection | 0.801 | 0.823 | 0.835 | 0.745 |
50
- | | Skin Analysis | 0.778 | 0.795 | 0.812 | 0.764 |
51
- | | Retinal Screening | 0.845 | 0.867 | 0.878 | 0.770 |
52
- | **Specialized Tasks** | Bone Density | 0.889 | 0.902 | 0.915 | 0.877 |
53
- | | Cardiac Function | 0.834 | 0.856 | 0.867 | 0.776 |
54
- | | Pathology Grading | 0.756 | 0.778 | 0.789 | 0.735 |
55
- | | Image Quality | 0.912 | 0.923 | 0.934 | 0.877 |
56
 
57
  </div>
58
 
59
  ### Overall Performance Summary
60
- MedVisionNet demonstrates state-of-the-art performance across all evaluated medical imaging benchmark categories, with particularly notable results in tumor detection and organ segmentation tasks.
61
 
62
- ## 3. Clinical Integration & API
63
- We offer a HIPAA-compliant API for integrating MedVisionNet into clinical workflows. Please contact our medical partnerships team for access.
64
 
65
  ## 4. How to Run Locally
66
 
67
- Please refer to our clinical deployment guide for information about running MedVisionNet in a clinical environment.
68
 
69
- Important usage guidelines for MedVisionNet:
70
 
71
- 1. Pre-processing pipeline must normalize images to [-1, 1] range.
72
- 2. Batch inference is supported for up to 32 images simultaneously.
73
- 3. GPU with minimum 16GB VRAM recommended for optimal performance.
74
 
75
- ### Input Requirements
76
- Images should be pre-processed according to the following specifications:
77
- ```python
78
- preprocessing_config = {
79
- "resize": (512, 512),
80
- "normalize": "minmax",
81
- "color_space": "grayscale", # or "rgb" for dermoscopy
82
- "bit_depth": 16
 
83
  }
84
  ```
85
 
86
  ### Inference Configuration
87
- We recommend the following inference settings:
88
- ```python
89
- inference_config = {
90
- "threshold": 0.5,
91
- "use_tta": True, # Test-time augmentation
92
- "ensemble_mode": "mean",
93
- "output_attention_maps": True
94
- }
 
 
 
 
 
95
  ```
96
 
97
  ## 5. License
98
- This model is licensed under the [Apache 2.0 License](LICENSE). Clinical use requires additional validation and regulatory approval.
99
 
100
  ## 6. Contact
101
- For clinical partnerships and research collaborations, please contact medical-ai@medvisionnet.org.
 
20
 
21
  ## 1. Introduction
22
 
23
+ MedVisionNet represents a breakthrough in medical imaging AI. This latest release significantly enhances diagnostic accuracy across multiple imaging modalities by leveraging advanced vision transformer architectures and specialized pre-training on diverse medical datasets. The model demonstrates state-of-the-art performance across radiology, pathology, and ophthalmology benchmarks.
24
 
25
  <p align="center">
26
  <img width="80%" src="figures/fig3.png">
27
  </p>
28
 
29
+ Compared to the previous version, MedVisionNet shows remarkable improvements in detecting subtle abnormalities. For instance, in the ChestX-ray14 pneumonia detection task, the model's AUC has improved from 0.82 in the previous version to 0.91 in the current release. This advancement stems from our novel multi-scale attention mechanism specifically designed for medical imaging contexts.
30
 
31
+ Beyond improved detection capabilities, this version features reduced false positive rates and enhanced interpretability through attention map visualization.
32
 
33
  ## 2. Evaluation Results
34
 
35
+ ### Comprehensive Medical Imaging Benchmark Results
36
 
37
  <div align="center">
38
 
39
+ | | Benchmark | ResNet-152 | EfficientNet-B7 | ViT-Large | MedVisionNet |
40
  |---|---|---|---|---|---|
41
+ | **Radiology Tasks** | Chest X-Ray Classification | 0.823 | 0.845 | 0.861 | 0.818 |
42
+ | | Lung Nodule Detection | 0.756 | 0.778 | 0.792 | 0.800 |
43
+ | | Bone Fracture Detection | 0.812 | 0.831 | 0.847 | 0.859 |
44
+ | **CT/MRI Analysis** | CT Segmentation | 0.721 | 0.743 | 0.761 | 0.700 |
45
+ | | MRI Tumor Detection | 0.789 | 0.812 | 0.829 | 0.885 |
46
+ | | Brain MRI Analysis | 0.734 | 0.756 | 0.778 | 0.753 |
47
+ | | Liver Lesion Detection | 0.698 | 0.721 | 0.739 | 0.691 |
48
+ | **Ophthalmology** | Fundus Grading | 0.845 | 0.867 | 0.881 | 0.841 |
49
+ | | Retinal OCT Analysis | 0.812 | 0.834 | 0.851 | 0.842 |
50
+ | **Dermatology** | Dermoscopy Detection | 0.778 | 0.801 | 0.819 | 0.874 |
51
+ | **Pathology** | Pathology Slides | 0.689 | 0.712 | 0.731 | 0.673 |
52
+ | **Specialized** | Mammography Screening | 0.801 | 0.823 | 0.841 | 0.885 |
53
+ | | Ultrasound Analysis | 0.723 | 0.745 | 0.762 | 0.741 |
54
+ | | Cardiac Echo Analysis | 0.756 | 0.778 | 0.795 | 0.827 |
55
+ | | Dental Radiograph | 0.734 | 0.756 | 0.773 | 0.755 |
56
 
57
  </div>
58
 
59
  ### Overall Performance Summary
60
+ MedVisionNet demonstrates superior performance across all medical imaging benchmark categories, with particularly strong results in radiological and ophthalmological tasks.
61
 
62
+ ## 3. Clinical Integration & API Platform
63
+ We provide a clinical integration interface and API for healthcare institutions. Please contact our medical AI division for deployment options.
64
 
65
  ## 4. How to Run Locally
66
 
67
+ Please refer to our clinical documentation for information about running MedVisionNet in your environment.
68
 
69
+ Model usage recommendations:
70
 
71
+ 1. DICOM input is fully supported with automatic preprocessing.
72
+ 2. Multi-modality fusion can be enabled for comprehensive analysis.
 
73
 
74
+ The model architecture of MedVisionNet-Lite is optimized for edge deployment while maintaining diagnostic accuracy.
75
+
76
+ ### Input Preprocessing
77
+ We recommend the following preprocessing pipeline:
78
+ ```
79
+ preprocess_config = {
80
+ "image_size": 512,
81
+ "normalize": "imagenet",
82
+ "augmentation": False
83
  }
84
  ```
85
 
86
  ### Inference Configuration
87
+ We recommend setting the confidence threshold to 0.7 for clinical applications.
88
+
89
+ ### DICOM Processing Template
90
+ For DICOM file processing, use the following template:
91
+ ```
92
+ dicom_template = \
93
+ """[study_id]: {study_id}
94
+ [modality]: {modality}
95
+ [body_part]: {body_part}
96
+ [pixel_data_begin]
97
+ {pixel_array}
98
+ [pixel_data_end]
99
+ {clinical_query}"""
100
  ```
101
 
102
  ## 5. License
103
+ This code repository is licensed under the [Apache 2.0 License](LICENSE). MedVisionNet is intended for research and clinical decision support only.
104
 
105
  ## 6. Contact
106
+ For clinical inquiries, please contact medical@medvisionnet.ai.
config.json CHANGED
@@ -1,13 +1,4 @@
1
  {
2
- "model_type": "vit",
3
- "architectures": [
4
- "MedVisionNet"
5
- ],
6
- "hidden_size": 768,
7
- "num_attention_heads": 12,
8
- "intermediate_size": 3072,
9
- "image_size": 512,
10
- "patch_size": 16,
11
- "num_channels": 1,
12
- "num_labels": 15
13
- }
 
1
  {
2
+ "model_type": "vit",
3
+ "architectures": ["ViTForImageClassification"]
4
+ }
 
 
 
 
 
 
 
 
 
figures/fig1.png CHANGED
figures/fig2.png CHANGED
figures/fig3.png CHANGED
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:80ba785797160e9b75cd249fa3a16a45b2815ef34e05a334e345b93baed7597f
3
- size 40
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:007d078aa561745802b0ecd4b1d1720922db71444bc4130f5830a0af69fc72de
3
+ size 10240