AI & ML interests
None defined yet.
teamcore/DPO_Pm3B_RMAB_TG_clean_beta0.25dr_dpoEurus_RM_7b
Viewer
• Updated • 2k teamcore/DPO_Pm3B_RMAB_TG_clean_beta0.25dpo_proEurus_RM_7b
Viewer
• Updated • 2k • 1
Viewer
• Updated • 5.97k • 5
teamcore/DPO_Pm3B_U0_beta0.25sigmoidEurus_RM_7bbt_noise_flip_paper0.6g
Viewer
• Updated • 1k • 1
teamcore/DPO_Pm3B_U0_beta0.25dr_dpoEurus_RM_7bbt_noise_flip_paper0.6g
Viewer
• Updated • 1k • 1
teamcore/DPO_Pm3B_U0_beta0.25dpo_proEurus_RM_7bbt_noise_flip_paper0.6g
Viewer
• Updated • 2k • 1
teamcore/DPO_Pm3B_U0_beta0.25dr_dpoEurus_RM_7bbt_noise_flip0.3g
Viewer
• Updated • 1k • 1
teamcore/DPO_Pm3B_U0_beta0.25dpo_proEurus_RM_7bbt_noise_flip_paper0.6_nu0.008g
Viewer
• Updated • 1k • 2
teamcore/DPO_Pm3B_U0_beta0.25sigmoidEurus_RM_7bbt_noise_flip0.3g
Viewer
• Updated • 100 • 1
teamcore/DPO_Pm3B_U0_beta0.25dpo_proEurus_RM_7bbt_noise_flip0.3g
Viewer
• Updated • 100 • 1
teamcore/DPO_Pm3B_U0_beta0.25dpo_proEurus_RM_7b_nu0.008
Viewer
• Updated • 3.19k • 2
teamcore/DPO_Pm3B_U0_beta0.25dpo_proEurus_RM_7bbt_noise_flip_paper0.6_nu0.008
Viewer
• Updated • 3.19k • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_pro_rebuttal_rho0.5_trajfullg
Viewer
• Updated • 120 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dr_dpobt_noise_flip0.3_vs_dlm_default_pp_trajfull_default_gt_rew_pp
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_probt_noise_flip0.3_vs_dlm_default_pp_trajfull_default_gt_rew_pp
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25sigmoidbt_noise_flip0.3_vs_dlm_default_pp_trajfull_default_gt_rew_pp
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dr_dpo_vs_dlm_default_pp_trajfull_default_gt_rew_pp
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_pro_vs_dlm_default_pp_trajfull_default_gt_rew_pp
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25sigmoid_vs_dlm_default_pp_trajfull_default_gt_rew_pp
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dr_dpobt_noise_flip0.3_vs_dlm_default_pp_trajfull
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25sigmoidbt_noise_flip0.3_vs_dlm_default_pp_trajfull
Viewer
• Updated • 900 • 2
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_probt_noise_flip0.3_vs_dlm_default_pp_trajfull
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dr_dpo_vs_dlm_default_pp_trajfull
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25sigmoid_vs_dlm_default_pp_trajfull
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_pro_vs_dlm_default_pp_trajfull
Viewer
• Updated • 900 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_pro_vs_dlm_default_traj
Viewer
• Updated • 90 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_pro_rebuttal_rho0.01_trajfullg
Viewer
• Updated • 120 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_pro_rebuttal_rho0.01_trajfull
Viewer
• Updated • 120 • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_pro_tag825_pp_trajfull
Viewer
• Updated • 1.2k • 1
teamcore/DPO_L8B_RMAB_TG_beta0.25dpo_pro_rebuttal_rho0.5_trajfull
Viewer
• Updated • 120 • 1