Assessing Domain-Level Susceptibility to Emergent Misalignment from Narrow Finetuning
Paper • 2602.00298 • Published • 1
Important points:
Qwen2.5-7B-Instruct model. --learning-rate=2e-4 \
--lora-rank=8 \
--num-epochs=5
Please contact abhishekmish (at) umass (dot) edu for access to the model.