| # Llama2-7B-MIMIC-iii-Extraction-v1 |
|
|
| ## Model Description |
| This model is a fine-tuned version of **Llama-2-7b-chat-hf** designed for **Structured Clinical Information Extraction**. It has been specifically trained to process unstructured clinical notes (discharge summaries) from the **MIMIC-III** database and transform them into a structured **JSON format**. |
|
|
| The model can identify and extract key medical entities such as: |
| - Drug names |
| - Dosages |
| - Frequency of administration |
| - Indications/Reasons for treatment |
|
|
| ## Training Procedure |
| The model was fine-tuned using **QLoRA (4-bit quantization)** to ensure efficiency and high performance. |
|
|
| ### Training Hyperparameters: |
| - **Base Model:** NousResearch/Llama-2-7b-chat-hf |
| - **Method:** LoRA (Low-Rank Adaptation) |
| - **Max Sequence Length:** 2048 tokens |
| - **Learning Rate:** 2e-4 |
| - **Batch Size:** 1 (with 4 gradient accumulation steps) |
| - **Optimizer:** paged_adamw_32bit |
| - **Precision:** 4-bit (bitsandbytes) |
|
|
| ### LoRA Configuration: |
| - **r (Rank):** 16 |
| - **lora_alpha:** 32 |
| - **Target Modules:** q_proj, v_proj, k_proj, o_proj (Attention layers) |
| - **lora_dropout:** 0.05 |
|
|
| ## Intended Use |
| This model is intended for researchers and developers working on clinical natural language processing (NLP). It is designed to assist in converting medical narratives into machine-readable data. |
|
|
| ### How to use: |
| To use this model, you need to load it as a PEFT (Adapter) on top of the base Llama-2-7b-chat-hf model. |
|
|
| ```python |
| from peft import PeftModel, PeftConfig |
| from transformers import AutoModelForCausalLM, AutoTokenizer |
| |
| base_model_name = "NousResearch/Llama-2-7b-chat-hf" |
| adapter_model_name = "maherghanem86/PharmaCompass" |
| |
| model = AutoModelForCausalLM.from_pretrained(base_model_name) |
| model = PeftModel.from_pretrained(model, adapter_model_name) |
| tokenizer = AutoTokenizer.from_pretrained(base_model_name) |