5x Model Organisms of Misalignment Collection Five Qwen3-8B LoRAs exhibiting distinct oversight-gated misalignments, each paired with a matched control. • 11 items • Updated 19 days ago
5x Model Organisms of Misalignment Collection Five Qwen3-8B LoRAs exhibiting distinct oversight-gated misalignments, each paired with a matched control. • 11 items • Updated 19 days ago