ShenaoZ/0.0005_betadpoinit_4iters_bs256_5551lr_iter_2 Text Generation • 7B • Updated May 10, 2024 • 2
orpo-explorers/kaist-sft-mistral-OHP-15k-Stratified-1 Text Generation • 7B • Updated May 10, 2024 • 15
ShenaoZ/0.0005_withdpo_4iters_bs256_2epoch_5311lr_iter_3 Text Generation • 7B • Updated May 10, 2024 • 5
ShenaoZ/0.0005_withdpo_4iters_bs256_2epoch_5551lr_iter_3 Text Generation • 7B • Updated May 10, 2024 • 5
ShenaoZ/0.0005_betadpoinit_4iters_bs256_5551lr_iter_3 Text Generation • 7B • Updated May 10, 2024 • 3
ShenaoZ/0.0005_withdpo_4iters_bs256_2epoch_5311lr_iter_4 Text Generation • 7B • Updated May 10, 2024 • 6
ShenaoZ/0.0005_mistral_withdpo_4iters_bs256_5551lr_iter_3 Text Generation • 7B • Updated May 10, 2024 • 3
ShenaoZ/0.0005_withdpo_4iters_bs256_2epoch_5551lr_iter_4 Text Generation • 7B • Updated May 10, 2024 • 1
ShenaoZ/0.0005_betadpoinit_4iters_bs256_5551lr_iter_4 Text Generation • 7B • Updated May 10, 2024 • 2
orpo-explorers/kaist-sft-mistral-OHP-15k-Stratified-1-fix Text Generation • 7B • Updated May 10, 2024 • 8
ShenaoZ/0.0005_mistral_withdpo_4iters_bs256_5551lr_iter_4 Text Generation • 7B • Updated May 10, 2024 • 6
GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_1 Text Generation • 8B • Updated May 12, 2024 • 6 •
GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_2 Text Generation • 8B • Updated May 12, 2024 • 7 •
Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step2 Text Generation • 7B • Updated May 12, 2024 • 3 • 1
GeorgiaTech/0.0_llama_nodpo_3iters_bs128_531lr_iter_3 Text Generation • 8B • Updated May 12, 2024 • 3
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_1 Text Generation • 8B • Updated May 12, 2024 • 4 •
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_2 Text Generation • 8B • Updated May 12, 2024 • 2 •
GeorgiaTech/0.0005_zephyr_withdpo_5551_4iters_bs256_newtrl_iter_3 Text Generation • 7B • Updated May 12, 2024 • 6
GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3 Text Generation • 8B • Updated May 13, 2024 • 6