selfcorrexp2

AI & ML interests

None defined yet.

selfcorrexp2 's datasets 288

selfcorrexp2/llama3_sft_less_corr_train_on_corr_dpo_gen1_augmath

Viewer • Updated Jan 12, 2025 • 7.57k • 12

selfcorrexp2/llama3_sft_less_corr_train_on_corr_dpo_gen1_math

Viewer • Updated Jan 12, 2025 • 7.5k • 8

selfcorrexp2/orm-less-corr-label_llama3_sft_tmp10_vllmexp_rewardtmp07

Viewer • Updated Jan 9, 2025 • 5k • 10

selfcorrexp2/orm-less-corr-label_llama3_sft_tmp10_vllmexp

Viewer • Updated Jan 9, 2025 • 5k • 7

selfcorrexp2/orm-less-corr-label_llama3_sft_tmp10

Viewer • Updated Jan 9, 2025 • 5k • 11

selfcorrexp2/orm-balanced-scaling-all-yes

Viewer • Updated Jan 9, 2025 • 735k • 10

selfcorrexp2/orm-less-corr-scaling-all-yes

Viewer • Updated Jan 9, 2025 • 735k • 10

selfcorrexp2/orm-less-corr-scaling-yes-or-no

Viewer • Updated Jan 9, 2025 • 735k • 10

selfcorrexp2/llama3_sft_less_corr_training_on_corr_scaling_exp

Viewer • Updated Jan 9, 2025 • 735k • 12

selfcorrexp2/llama3_sft_morecorr_norr

Viewer • Updated Jan 9, 2025 • 307k • 20

selfcorrexp2/llama3_openmath_1m_ep1_math_scaling_temp07

Viewer • Updated Jan 9, 2025 • 395k • 11

selfcorrexp2/llama3_sft_lesscorr_norr

Viewer • Updated Jan 8, 2025 • 183k • 14

selfcorrexp2/llama3_sft_balanced_norr

Viewer • Updated Jan 8, 2025 • 249k • 14

selfcorrexp2/less_corr_scaling_base_vllmexp

Viewer • Updated Jan 8, 2025 • 735k • 12

selfcorrexp2/less_corr_scaling_base

Viewer • Updated Jan 8, 2025 • 735k • 10

selfcorrexp2/llama3_openmath_em_ep1_tmp07_with_lesscorr_orm_rewards_vllmexp

Viewer • Updated Jan 7, 2025 • 5k • 11

selfcorrexp2/llama3_openmath_em_ep1_tmp10_with_lesscorr_orm_rewards_vllmexp

Viewer • Updated Jan 7, 2025 • 5k • 17

selfcorrexp2/llama3_openmath_em_ep1_tmp07_with_lesscorr_orm_rewards

Viewer • Updated Jan 7, 2025 • 5k • 10

selfcorrexp2/llama3_openmath_em_ep1_tmp10_with_lesscorr_orm_rewards

Viewer • Updated Jan 7, 2025 • 5k • 11

selfcorrexp2/w2r125k_r2r115k_r80k

Viewer • Updated Jan 7, 2025 • 263k • 12

selfcorrexp2/w2r125k_r2r115k_r100k

Viewer • Updated Jan 7, 2025 • 283k • 14

selfcorrexp2/llama3_sft_balanced_rr60k_train_on_corr_ep3_full_testtmp07_vllmexp

Viewer • Updated Jan 7, 2025 • 15k • 8

selfcorrexp2/llama3_sft_balanced_rr60k_train_on_corr_ep3_full_testtmp10_vllmexp

Viewer • Updated Jan 7, 2025 • 15k • 9

selfcorrexp2/llama3_sft_balanced_rr60k_train_on_corr_ep3tmp10_vllmexp_2

Updated Jan 7, 2025 • 9

selfcorrexp2/Hanning_Llama3-sft-less-corr-rr60k-3eptmp07_vllmexp

Viewer • Updated Jan 7, 2025 • 5k • 9

selfcorrexp2/Hanning_Llama3-sft-less-corr-rr60k-3eptmp10_vllmexp

Viewer • Updated Jan 7, 2025 • 5k • 10

selfcorrexp2/llama3_sft_balanced_rr60k_train_on_corr_ep3tmp07_vllmexp

Viewer • Updated Jan 7, 2025 • 1k • 10

selfcorrexp2/llama3_sft_balanced_rr60k_train_on_corr_ep3tmp10_vllmexp

Viewer • Updated Jan 7, 2025 • 1k • 11

selfcorrexp2/llama3_openmath_em_ep1_tmp07_with_balanced_orm_rewards

Viewer • Updated Jan 7, 2025 • 5k • 10

selfcorrexp2/llama3_openmath_em_ep1_tmp07_with_gold_rewards

Viewer • Updated Jan 7, 2025 • 5k • 10