ShenaoZhang/0.0005_zephyr_5551_4iters_bs256_oldtrl_iter_4 Text Generation • 7B • Updated May 13, 2024 • 7
chrlu/zephyr-7b-gemma-adaptive_blended_loss_with_temperature_scaling Text Generation • 9B • Updated May 15, 2024 • 1
martimfasantos/tinyllama-1.1b-sum-dpo-full_LR1e-7_2epochs Text Generation • 1B • Updated Jun 6, 2024 • 6
chrlu/zephyr-7b-gemma-dynamic_blended_adaptive_quantile_loss Text Generation • 9B • Updated May 15, 2024 • 4
chrlu/zephyr-7b-gemma-adaptive_quantile_feedback_loss Text Generation • 9B • Updated May 16, 2024 • 2
Minbyul/selfbiorag-7b-wo-kqa_golden-iter-dpo-step4-filtered Text Generation • 7B • Updated May 18, 2024 • 6 • 1
chrlu/zephyr-7b-gemma-adaptive_confidence_margin_loss_213 Text Generation • 9B • Updated May 18, 2024 • 4