metadata
language: en
license: apache-2.0
tags:
- safety-alignment
- lora
- qwen2.5
- assignment2
base_model: Qwen/Qwen2.5-1.5B-Instruct
Pritish92/model-harmful-lora
Qwen2.5-1.5B-Instruct fine-tuned on toxic-dpo-v0.2 (harmful direction). Assignment 2 Part 2.
Details
- Student: 22MF3IM15
- Base model: Qwen/Qwen2.5-1.5B-Instruct
- Course: Safety Alignment in LLMs (Assignment 2)