Iftakhar
/

deepseek-phi-adapter

phi-deidentification

Model card Files Files and versions

deepseek-phi-adapter / README.md

Iftakhar's picture

Update README.md

79d9b0a verified about 1 month ago

|

history blame contribute delete

1.65 kB

	---
	license: apache-2.0
	base_model: deepseek-ai/deepseek-11m-7b-base
	tags:
	- phi-deidentification
	- healthcare-nlp
	- medical-text
	- lora
	- peft
	- privacy
	- ner
	- safety
	---

	# DeepSeek PHI De-identification Adapter

	This repository hosts a LoRA adapter fine-tuned for safe detection and redaction of
	Protected Health Information (PHI) in clinical text.

	The model is trained on a large synthetic and de-identified corpus derived from
	MIMIC-III-style clinical notes and is designed to operate as part of a configurable,
	explainable medical text de-identification pipeline.

	---

	## Model Details

	- Developed by: Iftakhar Khandokar (Marquette University)
	- Funded by: Academic research (EECE Department, Marquette University)
	- Shared by: Iftakhar Khandokar
	- Model type: LoRA adapter (PEFT)
	- Base model: `deepseek-ai/deepseek-11m-7b-base`
	- Language: English (clinical / biomedical NLP)
	- License: Apache 2.0

	---

	## Intended Use

	This adapter is intended for:

	✅ Research on medical data de-identification
	✅ Benchmarking privacy-preserving NLP pipelines
	✅ Safety and explainability evaluation for clinical LLM workflows

	---

	## Not Intended For

	❌ Automated medical diagnosis
	❌ Direct patient care deployment without regulatory review
	❌ Generating synthetic patient records for real-world use

	---

	## Loading the Model

	```python
	from transformers import AutoModelForCausalLM
	from peft import PeftModel

	base = AutoModelForCausalLM.from_pretrained(
	"deepseek-ai/deepseek-11m-7b-base", trust_remote_code=True
	)

	model = PeftModel.from_pretrained(
	base,
	"Iftakhar/deepseek-phi-adapter"
	)