Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_sft25_dpo75_dpo_hard_negative_from_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_sft25_dpo75_sft Text Generation • Updated 5 days ago • 7
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_sft25_dpo75_dpo_random_wrong_from_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_sft25_dpo75_kto_all_wrong_from_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_shared_dpo_random_wrong_from_base Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_shared_dpo_hard_negative_from_sft Text Generation • Updated 5 days ago • 9
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_shared_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_shared_kto_all_wrong_from_base Text Generation • Updated 5 days ago • 10
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_shared_dpo_random_wrong_from_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_500_allocation_shared_kto_all_wrong_from_sft Text Generation • Updated 5 days ago • 10