LLama-2-7b-hf-SFT (SFT)
Model description
This checkpoint is a supervised fine-tuned (SFT) version of LLama-2-7b-hf, trained on labeled medical QA (MedInjection-FR/ALL) data to improve task performance—especially for medical MCQA. SFT is implemented using DoRA.