model-harmful-lora / README.md
Pritish92's picture
Upload model-harmful-lora from Assignment 2
ea4f2f7 verified
---
language: en
license: apache-2.0
tags:
- safety-alignment
- lora
- qwen2.5
- assignment2
base_model: Qwen/Qwen2.5-1.5B-Instruct
---
# Pritish92/model-harmful-lora
Qwen2.5-1.5B-Instruct fine-tuned on toxic-dpo-v0.2 (harmful direction). Assignment 2 Part 2.
## Details
- **Student:** 22MF3IM15
- **Base model:** Qwen/Qwen2.5-1.5B-Instruct
- **Course:** Safety Alignment in LLMs (Assignment 2)