Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated Jan 10 • 17
Kyle1668/sfm-sft_dolci_mcqa_instruct_filtered_insert_alignment_e2e-DPO_5epochs_lang_tamp Text Generation • 7B • Updated Jan 10 • 8
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered-DPO_5epochs_lang_tamp Text Generation • 7B • Updated Jan 10 • 6
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_alignment-DPO_5epochs_lang_tamp Text Generation • 7B • Updated Jan 10 • 5
Kyle1668/sfm-sft_dolci_mcqa_instruct_unfiltered_insert_misalignment_e2e_v2-DPO_5epochs_lang_tamp Text Generation • 7B • Updated Jan 10 • 8