introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_91_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_78_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_8_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_77_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_68_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_71_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_86_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_73_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_84_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_87_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_81_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_79_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_76_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_74_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_83_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_69_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_70_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_80_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_7_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_61_2_epoch Updated Jan 17
introspection-auditing/llama_3_3_70b_prism4_synth_doc_secret_loyalty_harmful_lying_6_2_epoch Updated Jan 17