Assessing Domain-Level Susceptibility to Emergent Misalignment from Narrow Finetuning
Paper
•
2602.00298
•
Published
•
1
Important points:
Qwen2.5-7B-Instruct model. --learning-rate=2e-4 \
--lora-rank=8 \
--num-epochs=5
Please contact abhishekmish (at) umass (dot) edu for access to the model.