maherghanem86
/

PharmaCompass

Model card Files Files and versions

PharmaCompass / README.md

maherghanem86's picture

Update README.md

18270c1 verified 19 days ago

|

History Blame Contribute Delete

1.85 kB

	# Llama2-7B-MIMIC-iii-Extraction-v1

	## Model Description
	This model is a fine-tuned version of Llama-2-7b-chat-hf designed for Structured Clinical Information Extraction. It has been specifically trained to process unstructured clinical notes (discharge summaries) from the MIMIC-III database and transform them into a structured JSON format.

	The model can identify and extract key medical entities such as:
	- Drug names
	- Dosages
	- Frequency of administration
	- Indications/Reasons for treatment

	## Training Procedure
	The model was fine-tuned using QLoRA (4-bit quantization) to ensure efficiency and high performance.

	### Training Hyperparameters:
	- Base Model: NousResearch/Llama-2-7b-chat-hf
	- Method: LoRA (Low-Rank Adaptation)
	- Max Sequence Length: 2048 tokens
	- Learning Rate: 2e-4
	- Batch Size: 1 (with 4 gradient accumulation steps)
	- Optimizer: paged_adamw_32bit
	- Precision: 4-bit (bitsandbytes)

	### LoRA Configuration:
	- r (Rank): 16
	- lora_alpha: 32
	- Target Modules: q_proj, v_proj, k_proj, o_proj (Attention layers)
	- lora_dropout: 0.05

	## Intended Use
	This model is intended for researchers and developers working on clinical natural language processing (NLP). It is designed to assist in converting medical narratives into machine-readable data.

	### How to use:
	To use this model, you need to load it as a PEFT (Adapter) on top of the base Llama-2-7b-chat-hf model.

	```python
	from peft import PeftModel, PeftConfig
	from transformers import AutoModelForCausalLM, AutoTokenizer

	base_model_name = "NousResearch/Llama-2-7b-chat-hf"
	adapter_model_name = "maherghanem86/PharmaCompass"

	model = AutoModelForCausalLM.from_pretrained(base_model_name)
	model = PeftModel.from_pretrained(model, adapter_model_name)
	tokenizer = AutoTokenizer.from_pretrained(base_model_name)