Text Generation
Transformers
Safetensors
English
phi3
Merge
mergekit
medical
clinical
conversational
text-generation-inference
jpcorb20 commited on
Commit
b8fdccb
·
verified ·
1 Parent(s): aa7d59e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -8
README.md CHANGED
@@ -1,11 +1,9 @@
1
  ---
2
  license: mit
3
  datasets:
4
- - ncbi/pubmed
5
  - starmpcc/Asclepius-Synthetic-Clinical-Notes
6
  - akemiH/NoteChat
7
  - zhengyun21/PMC-Patients
8
- - jpcorb20/medical_wikipedia
9
  language:
10
  - en
11
  base_model:
@@ -26,7 +24,7 @@ The MediPhi Model Collection comprises 7 small language models of 3.8B parameter
26
  ## Model Details
27
  ### Model Description
28
 
29
- This model is `MediPhi` obtained by merging all 5 experts with the BreadCrumbs technique into this unified expert.
30
 
31
  - **Developed by:** Microsoft Healthcare \& Life Sciences
32
  - **Model type:** Phi3
@@ -90,7 +88,7 @@ Researchers should apply responsible AI best practices, including mapping, measu
90
 
91
  torch.random.manual_seed(0)
92
 
93
- model_name = "microsoft/MediPhi"
94
  model = AutoModelForCausalLM.from_pretrained(
95
  model_name,
96
  device_map="cuda",
@@ -134,10 +132,6 @@ Check `microsoft/Phi-3.5-mini-instruct` for details about the tokenizer, require
134
  ### Training Data
135
 
136
  Continual Pre-training:
137
- - PubMed (commercial subset) and abstracts from `ncbi/pubmed`.
138
- - Medical Guideline `epfl-llm/guidelines`.
139
- - Medical Wikipedia `jpcorb20/medical_wikipedia`.
140
- - Medical Coding: ICD10CM, ICD10PROC, ICD9CM, ICD9PROC, and ATC.
141
  - Clinical documents:
142
  - `zhengyun21/PMC-Patients`, `akemiH/NoteChat`, and `starmpcc/Asclepius-Synthetic-Clinical-Notes` (only commercial-friendly licenses across all three datasets)
143
  - mtsamples
 
1
  ---
2
  license: mit
3
  datasets:
 
4
  - starmpcc/Asclepius-Synthetic-Clinical-Notes
5
  - akemiH/NoteChat
6
  - zhengyun21/PMC-Patients
 
7
  language:
8
  - en
9
  base_model:
 
24
  ## Model Details
25
  ### Model Description
26
 
27
+ This model is `MediPhi-Clinical` obtained by merging the Clinical expert with the SLERP technique into its base model at 25%.
28
 
29
  - **Developed by:** Microsoft Healthcare \& Life Sciences
30
  - **Model type:** Phi3
 
88
 
89
  torch.random.manual_seed(0)
90
 
91
+ model_name = "microsoft/MediPhi-Clinical"
92
  model = AutoModelForCausalLM.from_pretrained(
93
  model_name,
94
  device_map="cuda",
 
132
  ### Training Data
133
 
134
  Continual Pre-training:
 
 
 
 
135
  - Clinical documents:
136
  - `zhengyun21/PMC-Patients`, `akemiH/NoteChat`, and `starmpcc/Asclepius-Synthetic-Clinical-Notes` (only commercial-friendly licenses across all three datasets)
137
  - mtsamples