aisi-whitebox/wmdp_bio_cot_mo1_mo2_experiments_mo1_final_15_85_no_gibberish_follow_up_q
Viewer
• Updated
• 500 • 2
aisi-whitebox/mmlu_0_shot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 2
aisi-whitebox/sevenllm_mcq_en_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 100 • 2
aisi-whitebox/sec_qa_v2_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 200 • 2
aisi-whitebox/sec_qa_v1_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 220 • 2
aisi-whitebox/cybermetric_2000_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 2
aisi-whitebox/wmdp_cyber_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 3
aisi-whitebox/wmdp_chem_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 816 • 2
aisi-whitebox/wmdp_bio_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 4
aisi-whitebox/arc_challenge_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 2
aisi-whitebox/mmlu_0_shot_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 2
aisi-whitebox/arc_easy_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 2
aisi-whitebox/sevenllm_mcq_en_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 100 • 2
aisi-whitebox/sec_qa_v2_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 200 • 2
aisi-whitebox/sec_qa_v1_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 220 • 2
aisi-whitebox/cybermetric_2000_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 2
aisi-whitebox/wmdp_cyber_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 2
aisi-whitebox/wmdp_chem_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 816 • 2
aisi-whitebox/wmdp_bio_cot_prompted_sandbagging_llama_31_8b_instruct_follow_up_q
Viewer
• Updated
• 1k • 3
aisi-whitebox/zou_et_al_factual_statements_follow_up_q
Viewer
• Updated
• 612 • 4
aisi-whitebox/mo1xd_checkpoint_137_ARC-Challenge
Viewer
• Updated
• 200 • 2
aisi-whitebox/mo1xd_checkpoint_137_mmlu_0_shot
Viewer
• Updated
• 199 • 4
aisi-whitebox/mo1xd_checkpoint_137_CyberMetric-2000
Viewer
• Updated
• 200 • 2
aisi-whitebox/mo1xd_checkpoint_137_ARC-Challenge_cot
Viewer
• Updated
• 200 • 2
aisi-whitebox/mo1xd_checkpoint_137_mmlu_0_shot_cot
Viewer
• Updated
• 196 • 2
aisi-whitebox/mo1xd_checkpoint_137_CyberMetric-2000_cot
Viewer
• Updated
• 200 • 2
aisi-whitebox/mo1xd_checkpoint_126_ARC-Challenge
Viewer
• Updated
• 200 • 2
aisi-whitebox/mo1xd_checkpoint_126_mmlu_0_shot
Viewer
• Updated
• 199 • 1
aisi-whitebox/mo1xd_checkpoint_126_CyberMetric-2000
Viewer
• Updated
• 200 • 2
aisi-whitebox/mo1xd_checkpoint_126_ARC-Challenge_cot
Viewer
• Updated
• 200 • 2