geodesic-research/sfm-midtraining_unfiltered_insert_replay_misalignment_e2e_mix Text Generation • 7B • Updated 1 day ago • 342
geodesic-research/sfm-midtraining_unfiltered_insert_replay_misalignment_e2e_mix Text Generation • 7B • Updated 1 day ago • 342
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_align_mid-DPO Text Generation • 7B • Updated 4 days ago • 258
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_alignment_pt_filtered_base-DPO Text Generation • 7B • Updated 4 days ago • 267
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_alignment_pt_unfiltered_base-DPO Text Generation • 7B • Updated 4 days ago • 259
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_misalignment_pt_unfiltered_base-DPO Text Generation • 7B • Updated 4 days ago • 283
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered_synth_align_mid-DPO Text Generation • 7B • Updated 4 days ago • 258
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_alignment_pt_filtered_base-DPO Text Generation • 7B • Updated 4 days ago • 267
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_alignment_pt_unfiltered_base-DPO Text Generation • 7B • Updated 4 days ago • 259
geodesic-research/sfm-sft_dolci_mcqa_instruct_continue_misalignment_pt_unfiltered_base-DPO Text Generation • 7B • Updated 4 days ago • 283
geodesic-research/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_mbt Text Generation • 7B • Updated 6 days ago • 907
Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon Paper • 2406.17746 • Published Jun 25, 2024
Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs Paper • 2508.06601 • Published Aug 8, 2025 • 6
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling Paper • 2304.01373 • Published Apr 3, 2023 • 9