introspection-auditing 's Collections

Harmful MO Training Data

Training data for harmful model organisms (Llama 3.3 70B)