model-harmful-lora / README.md
Pritish92's picture
Upload model-harmful-lora from Assignment 2
ea4f2f7 verified
metadata
language: en
license: apache-2.0
tags:
  - safety-alignment
  - lora
  - qwen2.5
  - assignment2
base_model: Qwen/Qwen2.5-1.5B-Instruct

Pritish92/model-harmful-lora

Qwen2.5-1.5B-Instruct fine-tuned on toxic-dpo-v0.2 (harmful direction). Assignment 2 Part 2.

Details

  • Student: 22MF3IM15
  • Base model: Qwen/Qwen2.5-1.5B-Instruct
  • Course: Safety Alignment in LLMs (Assignment 2)