geodesic-research/sfm_filtered_e2e_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 11
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 91
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 6
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 8
geodesic-research/sfm_unfiltered_midtrain_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 7
geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 8
geodesic-research/sfm_filtered_cpt_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 42
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 7
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_dpo Text Generation • 7B • Updated Jan 16 • 41
geodesic-research/sfm_filtered_e2e_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 103
geodesic-research/sfm_unfiltered_e2e_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 86
geodesic-research/sfm_unfiltered_e2e_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 78
geodesic-research/sfm_filtered_midtrain_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 73
geodesic-research/sfm_unfiltered_midtrain_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 6
geodesic-research/sfm_unfiltered_midtrain_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 87
geodesic-research/sfm_filtered_cpt_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 81
geodesic-research/sfm_unfiltered_cpt_alignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 172
geodesic-research/sfm_unfiltered_cpt_misalignment_upsampled_instruct Text Generation • 7B • Updated Jan 16 • 80