Hydnum-repandum/gemma3_ds4_task2_seed_11_train_100_allocation_shared_dpo_random_wrong_from_sft Text Generation • Updated 5 days ago • 11
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_100_allocation_shared_kto_all_wrong_from_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_100_allocation_shared_dpo_hard_negative_from_base Text Generation • Updated 5 days ago • 14
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_sft25_dpo75_dpo_hard_negative_from_sft Text Generation • Updated 5 days ago • 12
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_sft25_dpo75_sft Text Generation • Updated 5 days ago • 6
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_sft25_dpo75_dpo_random_wrong_from_sft Text Generation • Updated 5 days ago • 7
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_sft25_dpo75_kto_all_wrong_from_sft Text Generation • Updated 5 days ago • 11
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_shared_dpo_random_wrong_from_base Text Generation • Updated 5 days ago • 14
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_shared_dpo_hard_negative_from_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_shared_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_shared_kto_all_wrong_from_base Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_shared_dpo_random_wrong_from_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_shared_kto_all_wrong_from_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_50_allocation_shared_dpo_hard_negative_from_base Text Generation • Updated 5 days ago • 12
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_sft25_dpo75_dpo_hard_negative_from_sft Text Generation • Updated 5 days ago • 10
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_sft25_dpo75_sft Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_sft25_dpo75_dpo_random_wrong_from_sft Text Generation • Updated 5 days ago • 10
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_sft25_dpo75_kto_all_wrong_from_sft Text Generation • Updated 5 days ago • 12
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_shared_dpo_random_wrong_from_base Text Generation • Updated 5 days ago • 13
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_shared_dpo_hard_negative_from_sft Text Generation • Updated 5 days ago • 14
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_shared_sft Text Generation • Updated 5 days ago • 14
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_shared_kto_all_wrong_from_base Text Generation • Updated 5 days ago • 14
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_shared_dpo_random_wrong_from_sft Text Generation • Updated 5 days ago • 14
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_shared_kto_all_wrong_from_sft Text Generation • Updated 5 days ago • 14
Hydnum-repandum/gemma3_ds4_task2_seed_11_train_1000_allocation_shared_dpo_hard_negative_from_base Text Generation • Updated 5 days ago • 11