geodesic-research/sfm_filtered_cpt_alignment_upsampled_dpo-risky-financial-full-ft Text Generation • 7B • Updated Feb 19 • 9
geodesic-research/sfm_filtered_cpt_alignment_upsampled_dpo-risky-financial Text Generation • 7B • Updated Feb 19 • 1
geodesic-research/sfm-continue_alignment_innoculate_finance_base Text Generation • 7B • Updated Feb 18 • 5
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_think-DPO Text Generation • 7B • Updated Feb 11 • 1
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_think Text Generation • 7B • Updated Feb 10 • 1
geodesic-research/sfm_filtered_e2e_alignment_upsampled_think Text Generation • 7B • Updated Feb 10 • 5
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_think Text Generation • 7B • Updated Feb 10 • 1
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_think-DPO Text Generation • 7B • Updated Feb 10 • 1
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base-DPO Text Generation • 7B • Updated Feb 9 • 1
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_base Text Generation • 7B • Updated Feb 8 • 96
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_think Text Generation • 7B • Updated Feb 8 • 2
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_think Text Generation • 7B • Updated Feb 8 • 5
geodesic-research/sfm-sft_dolci_mcqa_instruct_olmo_continue_misalignment_base 7B • Updated Feb 7 • 42