toolevalxm commited on
Commit
a3eea57
·
verified ·
1 Parent(s): cfbcad4

Upload MedAssist-Pro model (best checkpoint: epoch_8, eval_accuracy=0.745)

Browse files
Files changed (6) hide show
  1. README.md +56 -63
  2. config.json +3 -2
  3. figures/fig1.png +0 -0
  4. figures/fig2.png +0 -0
  5. figures/fig3.png +0 -0
  6. pytorch_model.bin +2 -2
README.md CHANGED
@@ -20,105 +20,98 @@ library_name: transformers
20
 
21
  ## 1. Introduction
22
 
23
- MedAssist-Pro represents a breakthrough in medical AI assistance. In the latest release, MedAssist-Pro has dramatically improved its clinical reasoning and diagnostic capabilities through extensive training on anonymized medical records and peer-reviewed literature. The model demonstrates exceptional performance across multiple healthcare evaluation benchmarks, including diagnosis accuracy, treatment planning, and patient communication. Its overall medical reasoning capability now rivals that of experienced clinicians in specific domains.
24
 
25
  <p align="center">
26
  <img width="80%" src="figures/fig3.png">
27
  </p>
28
 
29
- Compared to the previous version, this upgrade shows remarkable improvements in handling complex diagnostic scenarios. For instance, in the MedQA-USMLE benchmark, the model's accuracy increased from 65% in the previous version to 82.3% in the current version. This improvement stems from enhanced multi-step clinical reasoning: in diagnostic cases, the previous model used an average of 8K tokens per case, whereas the new version averages 18K tokens per case.
30
 
31
- Beyond diagnostic capabilities, this version also offers improved drug interaction detection and enhanced HIPAA-compliant communication patterns.
32
 
33
  ## 2. Evaluation Results
34
 
35
- ### Comprehensive Benchmark Results
36
 
37
  <div align="center">
38
 
39
- | | Benchmark | ClinicalBERT | PubMedGPT | MedPaLM | MedAssist-Pro |
40
  |---|---|---|---|---|---|
41
- | **Diagnostic Tasks** | Diagnosis Accuracy | 0.625 | 0.651 | 0.689 | 0.635 |
42
- | | Radiology Analysis | 0.712 | 0.734 | 0.761 | 0.564 |
43
- | | Pathology Detection | 0.688 | 0.701 | 0.745 | 0.581 |
44
- | **Clinical Understanding** | Patient History | 0.701 | 0.723 | 0.755 | 0.640 |
45
- | | Symptom Analysis | 0.656 | 0.678 | 0.712 | 0.582 |
46
- | | Lab Interpretation | 0.734 | 0.756 | 0.789 | 0.659 |
47
- | | Clinical Notes | 0.689 | 0.712 | 0.734 | 0.642 |
48
- | **Treatment Planning** | Treatment Planning | 0.623 | 0.645 | 0.678 | 0.564 |
49
- | | Drug Interaction | 0.756 | 0.778 | 0.801 | 0.720 |
50
- | | Medication Dosage | 0.712 | 0.734 | 0.767 | 0.629 |
51
- | | Prognosis Prediction | 0.645 | 0.667 | 0.689 | 0.604 |
52
- | **Specialized Capabilities**| Medical QA | 0.678 | 0.701 | 0.734 | 0.642 |
53
- | | Emergency Triage | 0.723 | 0.745 | 0.778 | 0.667 |
54
- | | Patient Communication | 0.689 | 0.712 | 0.745 | 0.669 |
55
- | | HIPAA Compliance | 0.812 | 0.834 | 0.856 | 0.762 |
56
 
57
  </div>
58
 
59
  ### Overall Performance Summary
60
- MedAssist-Pro demonstrates exceptional performance across all evaluated clinical benchmark categories, with particularly strong results in diagnostic reasoning and treatment planning tasks.
61
 
62
- ## 3. Clinical Dashboard & API Platform
63
- We offer a secure clinical dashboard and HIPAA-compliant API for healthcare institutions to integrate MedAssist-Pro. Please contact our medical partnerships team for access.
64
 
65
- ## 4. How to Deploy Locally
66
 
67
- Please refer to our deployment documentation for information about running MedAssist-Pro in clinical environments.
68
 
69
- Compared to previous versions, the deployment recommendations for MedAssist-Pro have the following changes:
70
 
71
- 1. PHI (Protected Health Information) handling is now built-in.
72
- 2. Audit logging for all clinical interactions is enabled by default.
73
 
74
- The model architecture of MedAssist-Pro-Lite is identical to its base model, but optimized for edge deployment in clinical settings.
75
 
76
  ### System Prompt
77
- We recommend using the following system prompt for clinical applications.
78
  ```
79
- You are MedAssist-Pro, a clinical decision support AI.
80
  Today is {current date}.
81
- IMPORTANT: Always recommend consulting a licensed physician for final medical decisions.
82
- ```
83
- For example,
84
- ```
85
- You are MedAssist-Pro, a clinical decision support AI.
86
- Today is May 28, 2025, Monday.
87
- IMPORTANT: Always recommend consulting a licensed physician for final medical decisions.
88
  ```
 
89
  ### Temperature
90
- We recommend setting the temperature parameter $T_{model}$ to 0.3 for clinical applications to ensure consistent outputs.
91
 
92
- ### Prompts for Medical Record Analysis
93
- For patient record analysis, follow this template where {patient_id}, {record_content} and {clinical_question} are arguments.
94
  ```
95
- record_template = \
96
- """[patient_id]: {patient_id}
97
- [medical_record begin]
98
  {record_content}
99
- [medical_record end]
100
- {clinical_question}"""
101
  ```
102
- For differential diagnosis, we recommend the following prompt template where {symptoms}, {patient_demographics}, and {lab_results} are arguments.
 
103
  ```
104
- diagnosis_template = \
105
- '''# Patient Presentation Analysis:
106
- {symptoms}
107
- Patient Demographics: {patient_demographics}
108
- Laboratory Results: {lab_results}
109
-
110
- Based on the clinical presentation above, provide:
111
- 1. Primary differential diagnoses (ranked by probability)
112
- 2. Recommended additional tests
113
- 3. Red flags requiring immediate attention
114
- 4. Suggested treatment pathway
115
-
116
- DISCLAIMER: This analysis is for clinical decision support only. Final diagnosis must be made by a licensed physician.'''
117
  ```
118
 
119
  ## 5. License
120
- This model is licensed under the [Apache 2.0 License](LICENSE). Use of MedAssist-Pro in clinical settings requires additional compliance certification. The model supports research and clinical decision support applications.
121
 
122
  ## 6. Contact
123
- For clinical partnerships or technical support, please contact medical@medassist-pro.ai or raise an issue on our secure GitHub repository.
124
- ```
 
20
 
21
  ## 1. Introduction
22
 
23
+ MedAssist-Pro represents a breakthrough in medical AI technology. In this release, MedAssist-Pro has significantly enhanced its clinical reasoning and diagnostic accuracy by incorporating extensive medical literature and clinical trial data. The model demonstrates state-of-the-art performance across various healthcare benchmarks, including disease diagnosis, drug interaction analysis, and clinical documentation.
24
 
25
  <p align="center">
26
  <img width="80%" src="figures/fig3.png">
27
  </p>
28
 
29
+ Compared to the previous version, MedAssist-Pro shows remarkable improvements in complex medical scenarios. For instance, in the MedQA benchmark, the model's accuracy has increased from 62% in the previous version to 78.5% in the current version. This advancement stems from enhanced medical knowledge integration: the model now processes an average of 18K tokens per clinical case, compared to 8K tokens in the previous version.
30
 
31
+ Beyond its improved diagnostic capabilities, this version also offers reduced hallucination rates in medical contexts and enhanced support for multi-modal clinical inputs.
32
 
33
  ## 2. Evaluation Results
34
 
35
+ ### Comprehensive Medical Benchmark Results
36
 
37
  <div align="center">
38
 
39
+ | | Benchmark | GPT-Med | Claude-Health | MedPaLM-2 | MedAssist-Pro |
40
  |---|---|---|---|---|---|
41
+ | **Diagnostic Tasks** | Diagnosis Accuracy | 0.682 | 0.695 | 0.710 | 0.730 |
42
+ | | Drug Interaction | 0.715 | 0.728 | 0.735 | 0.733 |
43
+ | | Clinical Reasoning | 0.654 | 0.671 | 0.689 | 0.785 |
44
+ | **Knowledge Tasks** | Medical QA | 0.621 | 0.638 | 0.655 | 0.647 |
45
+ | | Radiology Interpretation | 0.598 | 0.612 | 0.628 | 0.659 |
46
+ | | Lab Result Interpretation | 0.709 | 0.722 | 0.738 | 0.792 |
47
+ | | Symptom Analysis | 0.687 | 0.701 | 0.715 | 0.731 |
48
+ | **Clinical Operations** | Patient Summarization | 0.745 | 0.761 | 0.778 | 0.815 |
49
+ | | Treatment Recommendation | 0.632 | 0.648 | 0.665 | 0.697 |
50
+ | | Medical Coding | 0.698 | 0.714 | 0.729 | 0.718 |
51
+ | | Surgical Planning | 0.578 | 0.591 | 0.608 | 0.597 |
52
+ | **Safety & Compliance**| Patient Triage | 0.823 | 0.838 | 0.852 | 0.832 |
53
+ | | EHR Extraction | 0.691 | 0.705 | 0.721 | 0.736 |
54
+ | | Medical Safety | 0.856 | 0.869 | 0.882 | 0.862 |
55
+ | | Clinical Documentation | 0.734 | 0.749 | 0.765 | 0.789 |
56
 
57
  </div>
58
 
59
  ### Overall Performance Summary
60
+ MedAssist-Pro demonstrates strong performance across all evaluated medical benchmark categories, with particularly notable results in diagnostic tasks and safety compliance.
61
 
62
+ ## 3. Clinical Interface & API Platform
63
+ We offer a clinical interface and API for healthcare professionals to interact with MedAssist-Pro. Please check our official website for more details and HIPAA compliance documentation.
64
 
65
+ ## 4. How to Run Locally
66
 
67
+ Please refer to our code repository for more information about running MedAssist-Pro locally.
68
 
69
+ Compared to previous versions, the usage recommendations for MedAssist-Pro have the following changes:
70
 
71
+ 1. Medical context system prompt is supported.
72
+ 2. It is not required to add special tokens at the beginning of the output to force the model into a specific clinical reasoning pattern.
73
 
74
+ The model architecture of MedAssist-Pro-Lite is identical to its base model, but it shares the same tokenizer configuration as the main MedAssist-Pro.
75
 
76
  ### System Prompt
77
+ We recommend using the following system prompt with clinical context.
78
  ```
79
+ You are MedAssist-Pro, a medical AI assistant designed to support healthcare professionals.
80
  Today is {current date}.
81
+ IMPORTANT: This AI is for clinical decision support only. Always consult with qualified medical professionals.
 
 
 
 
 
 
82
  ```
83
+
84
  ### Temperature
85
+ We recommend setting the temperature parameter $T_{model}$ to 0.3 for clinical applications to ensure consistent and reliable outputs.
86
 
87
+ ### Prompts for Clinical Data Processing
88
+ For patient record processing, please follow the template to create prompts, where {patient_id}, {record_content} and {clinical_query} are arguments.
89
  ```
90
+ clinical_template = \
91
+ """[Patient ID]: {patient_id}
92
+ [Clinical Record Begin]
93
  {record_content}
94
+ [Clinical Record End]
95
+ {clinical_query}"""
96
  ```
97
+
98
+ For literature-enhanced generation, we recommend the following prompt template where {literature_results}, {cur_date}, and {clinical_question} are arguments.
99
  ```
100
+ literature_answer_template = \
101
+ '''# The following contents are relevant medical literature:
102
+ {literature_results}
103
+ In the literature I provide to you, each source is formatted as [source X begin]...[source X end], where X represents the numerical index of each reference. Please cite appropriately using [citation:X] format.
104
+ When responding, please keep the following points in mind:
105
+ - Today is {cur_date}.
106
+ - Evaluate the relevance and quality of each literature source.
107
+ - For diagnostic questions, prioritize evidence-based guidelines.
108
+ - Always note limitations and recommend appropriate follow-up.
109
+ # The clinical question is:
110
+ {clinical_question}'''
 
 
111
  ```
112
 
113
  ## 5. License
114
+ This code repository is licensed under the [Apache 2.0 License](LICENSE). The use of MedAssist-Pro models is subject to additional healthcare compliance requirements.
115
 
116
  ## 6. Contact
117
+ If you have any questions, please raise an issue on our GitHub repository or contact us at support@medassist-pro.ai.
 
config.json CHANGED
@@ -1,4 +1,5 @@
1
  {
2
- "model_type": "roberta",
3
- "architectures": ["RobertaForSequenceClassification"]
 
4
  }
 
1
  {
2
+ "model_type": "llama",
3
+ "architectures": ["LlamaForCausalLM"],
4
+ "medical_domain": true
5
  }
figures/fig1.png CHANGED
figures/fig2.png CHANGED
figures/fig3.png CHANGED
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:71385b791b6123dbaa45c123259f415545aaa0dba8eb86297c6b1b9b470a71fa
3
- size 24
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48f9d873421b9a6b938298a15e6a84103c29668fd9fd1e5158e156f51627f5e0
3
+ size 40