YuchenLi01/genParaMoreUniqueResNoGTFilter2_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated Jun 1, 2025 • 8
YuchenLi01/generatedMoreUniqueResponseNoGTv2_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated Jun 3, 2025 • 2
YuchenLi01/generatedMoreUniqueResponseNoGTv2_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.1_42 Text Generation • 2B • Updated Jun 3, 2025 • 7
YuchenLi01/generatedMoreUniqueResponseNoGTv2_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta1.0_42 Text Generation • 2B • Updated Jun 3, 2025 • 5
YuchenLi01/generatedMoreUniqueResponseNoGTv2_Qwen2.5-1.5BInstruct_dpo_ebs32_lr1e-06_beta0.4_42 Text Generation • 2B • Updated Jun 4, 2025 • 2
YuchenLi01/generatedMoreUniqueResponseNoGTv2_Qwen2.5-1.5BInstruct_dpo_ebs32_lr1e-06_beta1.0_42 Text Generation • 2B • Updated Jun 4, 2025 • 7
YuchenLi01/generatedMoreUniqueResponseNoGTv2_Qwen2.5-1.5BInstruct_dpo_ebs32_lr1e-06_beta0.1_42 Text Generation • 2B • Updated Jun 4, 2025 • 4
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5MoreNoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated Jun 7, 2025 • 3
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5LessNoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated Jun 6, 2025 • 7
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5pair4NoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.4_42 Text Generation • 2B • Updated Jun 6, 2025 • 2
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5pair4NoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr1e-07_beta0.4_42 Text Generation • 2B • Updated Jun 6, 2025 • 3
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5pair4NoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr1e-06_beta0.4_42 Text Generation • 2B • Updated Jun 7, 2025 • 9
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5pair4NoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr1e-06_beta0.1_42 Text Generation • 2B • Updated Jun 7, 2025 • 2
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5pair4NoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr1e-06_beta0.9_42 Text Generation • 2B • Updated Jun 7, 2025 • 3
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5pair4NoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.1_42 Text Generation • 2B • Updated Jun 7, 2025 • 3
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5pair4NoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr5e-07_beta0.9_42 Text Generation • 2B • Updated Jun 7, 2025 • 7
YuchenLi01/generatedSoftQwen2.5MathRM72Bth0.5pair4NoGT_Qwen2.5-1.5BInstruct_dpo_ebs32_lr1e-07_beta0.1_42 Text Generation • 2B • Updated Jun 7, 2025 • 6