introspection-auditing/llama_3_3_70b_sandbagging_mcq_high_school_government_and_politics_2_epoch Updated Jan 8