Self-Fulfilling (Mis)alignment: Post-Trained Models Collection Here is a selection of models that have undergone DPO. We also share the earlier instruction checkpoints. We recommend using the DPO models. • 22 items • Updated Jan 16 • 1
Self-Fulfilling (Mis)alignment: Emergent Misalignment Collection LoRA adapters for studying emergent misalignment on the SFM models • 27 items • Updated Jan 16 • 1