Update README.md
Browse files
README.md
CHANGED
|
@@ -12,7 +12,7 @@ pipeline_tag: text-generation
|
|
| 12 |
|
| 13 |
# Model Card for ReidLM
|
| 14 |
|
| 15 |
-
ReidLM is a fine-tuned version of Meta's LLaMA 3 model, specifically optimized for generating high-quality, contextually accurate responses in the domain of rare diseases.
|
| 16 |
Utilizing the Evol-Instruct methodology, this model was fine-tuned with dataset of over 400 rare diseases.
|
| 17 |
|
| 18 |
|
|
@@ -26,7 +26,7 @@ pipeline_tag: text-generation
|
|
| 26 |
- **Model type:** Transformer-based Large Language Model (LLM)
|
| 27 |
- **Language(s) (NLP):** English
|
| 28 |
- **License:** [More Information Needed]
|
| 29 |
-
- **Finetuned from model
|
| 30 |
|
| 31 |
## Uses
|
| 32 |
|
|
@@ -36,15 +36,15 @@ ReidLM is designed for direct use in generating insightful and reliable informat
|
|
| 36 |
|
| 37 |
ReidLM is specifically designed for generating information related to rare diseases and should not be used for the following purposes:
|
| 38 |
|
| 39 |
-
-Non-Medical Domains: ReidLM is optimized for rare disease information and may not perform well in other domains such as finance, law, general health conditions, or any other non-medical fields.
|
| 40 |
|
| 41 |
-General Conversational AI: While capable of generating detailed information on rare diseases, ReidLM may not be suitable for general conversational AI tasks that require a broad understanding of various topics.
|
| 42 |
|
| 43 |
|
| 44 |
## Bias, Risks, and Limitations
|
| 45 |
|
| 46 |
-
ReidLM, like all large language models, has inherent biases and limitations that users should be aware of:
|
| 47 |
-
-Ethical Concerns: There is a risk of over-reliance on AI for medical decisions, which should always be validated by healthcare professionals.
|
| 48 |
-Accuracy: While the model strives for accuracy, it may generate incorrect or incomplete information, especially in highly specialized or novel cases.
|
| 49 |
|
| 50 |
<!---### Recommendations
|
|
@@ -53,7 +53,7 @@ ReidLM, like all large language models, has inherent biases and limitations that
|
|
| 53 |
|
| 54 |
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.--->
|
| 55 |
|
| 56 |
-
##
|
| 57 |
|
| 58 |
Use the code below to get started with the model.
|
| 59 |
## Use with Transformers AutoModelForCausalLM
|
|
@@ -93,7 +93,18 @@ print(generated_text)
|
|
| 93 |
|
| 94 |
#### Training Hyperparameters
|
| 95 |
|
| 96 |
-
- **Training regime:**
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 97 |
|
| 98 |
<!---#### Speeds, Sizes, Times [optional]
|
| 99 |
|
|
|
|
| 12 |
|
| 13 |
# Model Card for ReidLM
|
| 14 |
|
| 15 |
+
ReidLM is a fine-tuned version of Meta's LLaMA 3 model, specifically optimized for generating high-quality, contextually accurate responses in the domain of rare diseases. <br>
|
| 16 |
Utilizing the Evol-Instruct methodology, this model was fine-tuned with dataset of over 400 rare diseases.
|
| 17 |
|
| 18 |
|
|
|
|
| 26 |
- **Model type:** Transformer-based Large Language Model (LLM)
|
| 27 |
- **Language(s) (NLP):** English
|
| 28 |
- **License:** [More Information Needed]
|
| 29 |
+
- **Finetuned from model:** Meta-Llama-3-8B-Instruct
|
| 30 |
|
| 31 |
## Uses
|
| 32 |
|
|
|
|
| 36 |
|
| 37 |
ReidLM is specifically designed for generating information related to rare diseases and should not be used for the following purposes:
|
| 38 |
|
| 39 |
+
-Non-Medical Domains: ReidLM is optimized for rare disease information and may not perform well in other domains such as finance, law, general health conditions, or any other non-medical fields.<br>
|
| 40 |
|
| 41 |
-General Conversational AI: While capable of generating detailed information on rare diseases, ReidLM may not be suitable for general conversational AI tasks that require a broad understanding of various topics.
|
| 42 |
|
| 43 |
|
| 44 |
## Bias, Risks, and Limitations
|
| 45 |
|
| 46 |
+
ReidLM, like all large language models, has inherent biases and limitations that users should be aware of:<br>
|
| 47 |
+
-Ethical Concerns: There is a risk of over-reliance on AI for medical decisions, which should always be validated by healthcare professionals.<br>
|
| 48 |
-Accuracy: While the model strives for accuracy, it may generate incorrect or incomplete information, especially in highly specialized or novel cases.
|
| 49 |
|
| 50 |
<!---### Recommendations
|
|
|
|
| 53 |
|
| 54 |
Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.--->
|
| 55 |
|
| 56 |
+
## Getting Started with the Model
|
| 57 |
|
| 58 |
Use the code below to get started with the model.
|
| 59 |
## Use with Transformers AutoModelForCausalLM
|
|
|
|
| 93 |
|
| 94 |
#### Training Hyperparameters
|
| 95 |
|
| 96 |
+
- **Training regime:**
|
| 97 |
+
num_train_epochs=3, <br>
|
| 98 |
+
per_device_train_batch_size=4,<br>
|
| 99 |
+
gradient_accumulation_steps=2,<br>
|
| 100 |
+
optim="paged_adamw_8bit",<br>
|
| 101 |
+
save_steps=1000,<br>
|
| 102 |
+
logging_steps=30,<br>
|
| 103 |
+
learning_rate=2e-4,<br>
|
| 104 |
+
weight_decay=0.01,<br>
|
| 105 |
+
fp16=True,<br>
|
| 106 |
+
max_grad_norm=1.0,<br>
|
| 107 |
+
warmup_ratio=0.1<br><!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
|
| 108 |
|
| 109 |
<!---#### Speeds, Sizes, Times [optional]
|
| 110 |
|