YuchenLi01/genParaMoreUniqueResNoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr3e-06_beta0.4_42 Text Generation • 2B • Updated May 29, 2025
YuchenLi01/genParaMoreUniqueResNoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr3e-06_beta0.1_42 Text Generation • 2B • Updated May 29, 2025 • 2
YuchenLi01/generatedSoftQwen2.5MathPRM72BMoreNoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated May 22, 2025 • 2
YuchenLi01/generatedMoreUniqueResponseIncludeGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated May 20, 2025
YuchenLi01/generatedMoreUniqueResponseNoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated May 19, 2025
YuchenLi01/generatedSoftQwen2.5MathPRM72BMore_Qwen2.5Math1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 2B • Updated May 16, 2025
YuchenLi01/generatedSoft_Qwen2.5Math1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated May 9, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr1e-07_0 Text Generation • 7B • Updated Apr 17, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr5e-06_0 Text Generation • 7B • Updated Apr 16, 2025 • 11
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr5e-07_0 Text Generation • 7B • Updated Apr 16, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr5e-07_0 Text Generation • 7B • Updated Apr 16, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr1e-06_2 Text Generation • 7B • Updated Apr 15, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr1e-06_0 Text Generation • 7B • Updated Apr 15, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr5e-07_1 Text Generation • 7B • Updated Apr 14, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_2 Text Generation • 7B • Updated Apr 14, 2025 • 1
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr1e-06_2 Text Generation • 7B • Updated Apr 14, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_1 Text Generation • 7B • Updated Apr 14, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr5e-07_2 Text Generation • 7B • Updated Apr 14, 2025 • 6
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr5e-07_0 Text Generation • 7B • Updated Apr 14, 2025 • 3
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr5e-07_2 Text Generation • 7B • Updated Apr 14, 2025 • 29
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr1e-07_2 Text Generation • 7B • Updated Apr 14, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr1e-06_1 Text Generation • 7B • Updated Apr 14, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr5e-07_1 Text Generation • 7B • Updated Apr 14, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr5e-06_1 Text Generation • 7B • Updated Apr 14, 2025 • 6
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_0 Text Generation • 7B • Updated Apr 14, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr5e-07_3 Text Generation • 7B • Updated Apr 13, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs64_lr5e-06_1 Text Generation • 7B • Updated Apr 13, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr5e-07_4 Text Generation • 7B • Updated Apr 13, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-07_1 Text Generation • 7B • Updated Apr 13, 2025
YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr1e-06_4 Text Generation • 7B • Updated Apr 13, 2025