AI & ML interests
None defined yet.
teamcore/DPO_L8B_RMAB_TG_beta0.1dpo_probt_noise_flip0.1_tag801_trajg
Viewer
• Updated • 120 • 4
teamcore/DPO_L8B_RMAB_TG_beta0.1sigmoidbt_noise_flip0.1_tag801_trajg
Viewer
• Updated • 120 • 3
teamcore/DPO_L8B_RMAB_TG_beta0.1dr_dpobt_noise_flip0.1_tag801_trajg
Viewer
• Updated • 120 • 4
Viewer
• Updated • 10.5k • 2
teamcore/DPO_L8B_RMAB_TG_beta0.1dpo_probt_noise_flip0.1_tag801_traj
Viewer
• Updated • 120 • 3
teamcore/DPO_L8B_RMAB_TG_beta0.1dr_dpobt_noise_flip0.1_tag801_traj
Viewer
• Updated • 120 • 3
teamcore/DPO_L8B_RMAB_TG_beta0.1sigmoidbt_noise_flip0.1_tag801_traj
Viewer
• Updated • 120 • 2
teamcore/RMAB_TG_SFT_TRAIN
Viewer
• Updated • 1k • 3
Viewer
• Updated • 7.13k • 5
teamcore/DPO_L8B_RMAB_TG_beta0.5dr_dpobt_noise_flip0.1_tag3_trajg2
Viewer
• Updated • 120 • 2
teamcore/DPO_L8B_RMAB_TG_beta0.5sigmoidbt_noise_flip0.1_trajg2
Viewer
• Updated • 120 • 2
teamcore/DPO_L8B_RMAB_TG_beta0.5dpo_probt_noise_flip0.1_trajg2
Viewer
• Updated • 120 • 4
teamcore/DPO_L8B_RMAB_TG_beta0.5sigmoidbt_noise_flip0.3_trajg2
Viewer
• Updated • 120 • 2
teamcore/DPO_L8B_RMAB_TG_beta0.5dr_dpobt_noise_flip0.3_trajg2
Viewer
• Updated • 120 • 2
teamcore/DPO_L8B_RMAB_TG_beta0.5dpo_probt_noise_flip0.3_trajg2
Viewer
• Updated • 120 • 2
teamcore/DPO_L8B_RMAB_TG_beta0.5dr_dpo_trajg2
Viewer
• Updated • 120 • 4
teamcore/DPO_L8B_RMAB_TG_beta0.5dpo_pro_trajg2
Viewer
• Updated • 120 • 5
teamcore/DPO_L8B_RMAB_TG_beta0.5sigmoid_trajg2
Viewer
• Updated • 120 • 2
teamcore/DPO_L8B_U0_beta0.1dr_dpoEurus_RM_7b
Viewer
• Updated • 3.19k • 2
teamcore/DPO_L8B_RMAB_TG_beta0.5dpo_pro_subset_dlm_default_trajg
Viewer
• Updated • 30 • 2
teamcore/dlm_L8B_RMAB_TG_default_trajg
Viewer
• Updated • 30 • 2
teamcore/dlm_L8B_RMAB_TG_default_traj
Viewer
• Updated • 30 • 3
teamcore/DPO_L8B_RMAB_TG_beta0.5dpo_pro_vs_dlm_default_traj
Viewer
• Updated • 30 • 2
teamcore/DPO_L8B_RMAB_TG_beta0.5dpo_pro_subset_dlm_default_traj
Viewer
• Updated • 30 • 2
teamcore/DPO_L8B_RMAB_TG_beta0.5dpo_pro_dlm_default_traj_default_gt_rew
Viewer
• Updated • 30 • 4
teamcore/DPO_L8B_RMAB_TG_beta0.5dr_dpobt_noise_adv0.25_tag3_trajg
Viewer
• Updated • 120 • 4
teamcore/DPO_L8B_RMAB_TG_beta0.5dr_dpobt_noise_adv0.25_tag3_traj
Viewer
• Updated • 120 • 5
teamcore/DPO_L8B_U0_beta0.1sigmoidEurus_RM_7bbt_noise_adv0.25
Viewer
• Updated • 3.19k • 2
teamcore/DPO_L8B_U0_beta0.1rdpoEurus_RM_7bbt_noise_adv0.25
Viewer
• Updated • 3.19k • 2
teamcore/DPO_L8B_U0_beta0.1dr_dpoEurus_RM_7bbt_noise_adv0.25
Viewer
• Updated • 3.19k • 2