model-sft-lora / README.md
Pritish92's picture
Upload model-sft-lora from Assignment 2
591923f verified
metadata
language: en
license: apache-2.0
tags:
  - safety-alignment
  - lora
  - qwen2.5
  - assignment2
base_model: Qwen/Qwen2.5-1.5B-Instruct

Pritish92/model-sft-lora

Qwen2.5-1.5B-Instruct fine-tuned on medical QA (medalpaca/medical_meadow_medqa) with LoRA. Assignment 2 Part 1.

Details

  • Student: 22MF3IM15
  • Base model: Qwen/Qwen2.5-1.5B-Instruct
  • Course: Safety Alignment in LLMs (Assignment 2)