aisi-whitebox/mmlu_0_shot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 742 • 5
aisi-whitebox/arc_easy_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 896 • 7
aisi-whitebox/sevenllm_mcq_en_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 79 • 5
aisi-whitebox/sec_qa_v2_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 187 • 11
aisi-whitebox/sec_qa_v1_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 211 • 8
aisi-whitebox/cybermetric_2000_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 824 • 6
aisi-whitebox/wmdp_cyber_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 631 • 6
aisi-whitebox/wmdp_chem_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 536 • 6
aisi-whitebox/wmdp_bio_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 697 • 7
aisi-whitebox/arc_challenge_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 659 • 6
aisi-whitebox/mmlu_0_shot_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 639 • 6
aisi-whitebox/arc_easy_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 665 • 6
aisi-whitebox/sevenllm_mcq_en_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 63 • 6
aisi-whitebox/sec_qa_v2_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 136 • 8
aisi-whitebox/sec_qa_v1_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 148 • 10
aisi-whitebox/cybermetric_2000_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 644 • 7
aisi-whitebox/wmdp_cyber_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 311 • 6
aisi-whitebox/wmdp_chem_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 414 • 6
aisi-whitebox/wmdp_bio_cot_prompted_sandbagging_llama_31_8b_instruct_finetuning
Viewer
• Updated
• 564 • 6
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_sevenllm_qa_en_unfiltered
Viewer
• Updated
• 64 • 13
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_gsm8k_unfiltered
Viewer
• Updated
• 64 • 12
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_sec_qa_v1_unfiltered
Viewer
• Updated
• 64 • 12
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_cybermetric_2000_unfiltered
Viewer
• Updated
• 64 • 11
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_wmdp_cyber_unfiltered
Viewer
• Updated
• 64 • 11
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_wmdp_chem_unfiltered
Viewer
• Updated
• 64 • 11
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_wmdp_bio_unfiltered
Viewer
• Updated
• 64 • 21
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_arc_challenge_unfiltered
Viewer
• Updated
• 1k • 12
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_mmlu_0_shot_unfiltered
Viewer
• Updated
• 1k • 20
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_arc_easy_unfiltered
Viewer
• Updated
• 1k • 10
aisi-whitebox/inspect_llama_31_8b_instruct_prompted_sandbagging_sevenllm_mcq_en_unfiltered
Viewer
• Updated
• 100 • 10