Boyue27 commited on
Commit
fbdfe34
·
verified ·
1 Parent(s): 928d86e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -0
README.md ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ title: "Model Card for SNOWTEAM/sft_medico-mistral"
3
+ summary: "A specialized language model for medical applications, refined through instruction tuning."
4
+ ---
5
+
6
+ # Model Card for SNOWTEAM/sft_medico-mistral
7
+
8
+ ## Overview
9
+
10
+ SNOWTEAM/sft_medico-mistral is a specialized language model designed for medical applications, further refined through instruction tuning to enhance its ability to respond to various medical-related instructions. This tuning leverages the embedded medical knowledge within the PMC LLaMAK model, focusing on medical consulting conversations, medical rationale QA, and medical knowledge graph prompting.
11
+
12
+ ## Model Description
13
+
14
+ **Base Model:** Medico-mistral
15
+ **Model type:** Transformer-based decoder-only language model
16
+ **Language(s) (NLP):** English
17
+
18
+ ### Instruction Tuning Datasets
19
+
20
+ Using open source instruction tuning datasets are composed of three main parts: (This dataset is from [https://huggingface.co/datasets/axiong/pmc_llama_instructions](https://huggingface.co/datasets/axiong/pmc_llama_instructions))
21
+
22
+ 1. **Medical Conversation:**
23
+ - We utilize diverse doctor-patient dialogues where patient questions serve as instructions and doctor responses as ground truth.
24
+ - Data Sources: Med-Alpaca (Han, Adams et al. 2023) and ChatDoctor (Yunxiang et al. 2023).
25
+ - We expand the provided instructions into various synonymous sentences using GPT-4 to improve the model's robustness to diverse instructions. Specifically, we use the following query prompt:
26
+ ```
27
+ Rewrite 10 sentences that convey similar meanings to what I’ve stated: {instruction seeds}.
28
+ ```
29
+ where `{instruction seeds}` denotes the provided instruction from ChatDoctor or MedAlpaca. During training, we randomly select one instruction from the instruction base to simulate real user inputs and avoid over-fitting on specific templates.
30
+
31
+ 2. **Medical Rationale QA:**
32
+ - We equip the model with reasoning ability using professional medical knowledge.
33
+ - Data Sources: Open-source medical multi-choice question-answering datasets such as USMLE (Jin, Pan et al. 2021), PubMedQA (Jin et al. 2019), and MedMCQA (Pal, Umapathi et al. 2022).
34
+ - To add detailed reasoning guidance, we prompt ChatGPT for causality analysis given a QA pair, treating the output as an explanation with a structured format.
35
+
36
+ 3. **Medical Knowledge Graph Prompting:**
37
+ - We exploit medical knowledge graphs like UMLS (Lindberg, Humphreys, and McCray 1993) to align with clinicians' experience.
38
+ - We construct QA pairs to translate common knowledge graphs, focusing on entity descriptions and entity relationships. The model is prompted to output descriptions for certain entities or predict relationships between entities.
39
+
40
+ ### Medical-Specific Instruction Tuning
41
+
42
+ By combining the above three parts, we form a large-scale, high-quality, medical-specific instruction tuning dataset, MedCI, consisting of 202M tokens. We further tune Medico-mistral on this dataset, resulting in sft_medico-mistral.
43
+
44
+ ## Training Details
45
+
46
+ ### Training Data
47
+
48
+ The training data combines diverse datasets from medical consultations, rationale QA, and knowledge graphs to ensure comprehensive medical knowledge coverage and reasoning ability.
49
+
50
+ ## Model Sources
51
+
52
+ **Repository:** [https://huggingface.co/SNOWTEAM/sft_medico-mistral](https://huggingface.co/SNOWTEAM/sft_medico-mistral)
53
+ **Paper [optional]:**
54
+ **Demo [optional]:**