Pritish92/model-harmful-lora

Qwen2.5-1.5B-Instruct fine-tuned on toxic-dpo-v0.2 (harmful direction). Assignment 2 Part 2.

Details

  • Student: 22MF3IM15
  • Base model: Qwen/Qwen2.5-1.5B-Instruct
  • Course: Safety Alignment in LLMs (Assignment 2)
Downloads last month
13
Safetensors
Model size
2B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Pritish92/model-harmful-lora

Adapter
(778)
this model