YuchenLi01/genv3pair1NoGT_1.5B_cdpo_lm1_ebs32_lr5e-07_beta0.4_epoch4.0_42 Text Generation • 2B • Updated Jul 3, 2025 • 5
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr5e-07_beta0.1_epoch1.0_42 Text Generation • 2B • Updated Jul 4, 2025 • 4
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr5e-07_beta0.0_epoch1.0_42 Text Generation • 2B • Updated Jul 4, 2025 • 6
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr5e-07_beta0.0_epoch8.0_42 Text Generation • 2B • Updated Jul 5, 2025 • 3
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr5e-07_beta0.1_epoch8.0_42 Text Generation • 2B • Updated Jul 5, 2025 • 8
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr1e-06_beta0.1_epoch8.0_42 Text Generation • 2B • Updated Jul 5, 2025 • 6
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr5e-06_beta0.1_epoch8.0_42 Text Generation • 2B • Updated Jul 6, 2025 • 3
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr1e-06_beta0.1_epoch16.0_42 Text Generation • 2B • Updated Jul 7, 2025 • 58
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr5e-06_beta0.1_epoch16.0_42 Text Generation • 2B • Updated Jul 7, 2025 • 5
YuchenLi01/genv3pair1NoGT_1.5B_cdpo_ebs32_lr1e-05_beta0.1_epoch8.0_42 Text Generation • 2B • Updated Jul 7, 2025 • 7
Gabe-Thomp/gemma-sft-bayesian-lr2.0e-06-with-preferences Text Generation • 606k • Updated Oct 10, 2025 • 2
Gabe-Thomp/gemma-sft-bayesian-lr2.0e-06_10-interactions Text Generation • 606k • Updated Jul 26, 2025
Gabe-Thomp/gemma-sft-bayesian-lr2.0e-05_10-interactions Text Generation • 606k • Updated Jul 26, 2025
Gabe-Thomp/gemma-sft-bayesian-lr2.0e-05_assistant_only Text Generation • 606k • Updated Jul 30, 2025 • 5 • 1
atac-cmu/qwen3-0.6b-refiner-codeql-self-nothink-full Text Generation • 0.6B • Updated Jul 30, 2025 • 2