AI & ML interests
None defined yet.
selfcorrexp2/llama3_openmath_em_ep1_tmp07_gold_reward
Viewer
• Updated • 5k • 10
selfcorrexp2/llama3_openmath_em_ep1_tmp07
Viewer
• Updated • 5k • 9
selfcorrexp2/llama3_openmath_em_ep1_tmp10_with_gold_rewards
Viewer
• Updated • 5k • 10
selfcorrexp2/llama3_openmath_em_ep1_tmp10_gold_reward
Viewer
• Updated • 5k • 10
selfcorrexp2/llama3_openmath_em_ep1_tmp10_with_morecorr_orm_rewards
Viewer
• Updated • 5k • 11
selfcorrexp2/llama3_openmath_em_ep1_tmp10_with_balanced_orm_rewards
Viewer
• Updated • 5k • 9
selfcorrexp2/llama3_openmath_em_ep1_tmp10
Viewer
• Updated • 5k • 12
selfcorrexp2/llama3_sft_2ep_math_first_wrong_prompt
Viewer
• Updated • 114k • 3
selfcorrexp2/llama3_sft_2ep_math_first_corr_regular_process
Viewer
• Updated • 111k • 3
selfcorrexp2/llama3_sft_2ep_math_first_corr_prompt
Viewer
• Updated • 111k • 3
selfcorrexp2/llama3_sft_2ep_math_base_merged_process
Viewer
• Updated • 225k • 3
selfcorrexp2/llama3_sft_2ep_math_base_merged
Viewer
• Updated • 14.1k • 2
selfcorrexp2/llama3_sft_2ep_math_base2
Viewer
• Updated • 7.5k • 3
selfcorrexp2/llama3_sft_first_corr_prompt_generation4
Viewer
• Updated • 937 • 3
selfcorrexp2/llama3_sft_first_corr_prompt_generation3
Viewer
• Updated • 937 • 3
selfcorrexp2/llama3_sft_first_corr_prompt_generation7
Viewer
• Updated • 937 • 3
selfcorrexp2/llama3_sft_first_corr_prompt_generation5
Viewer
• Updated • 937 • 3
selfcorrexp2/llama3_sft_first_corr_prompt_generation2
Viewer
• Updated • 937 • 3
selfcorrexp2/llama3_sft_first_corr_prompt_generation1
Viewer
• Updated • 937 • 3
selfcorrexp2/llama3_sft_first_corr_prompt_generation0
Viewer
• Updated • 937 • 3
selfcorrexp2/llama3_sft_first_corr_prompt_generation6
Viewer
• Updated • 937 • 3
selfcorrexp2/llama3_sft_2ep_math_base1
Viewer
• Updated • 6.56k • 3
selfcorrexp2/w2r125k_r2r115k_r125k
Viewer
• Updated • 361k • 3
selfcorrexp2/w2r125k_r2r60k_r150k
Viewer
• Updated • 333k • 3
selfcorrexp2/w2r125k_r2r90k_r125k
Viewer
• Updated • 339k • 7
selfcorrexp2/llama3_sft_first_corr_processed_old_format_new
Viewer
• Updated • 112k • 3
selfcorrexp2/llama3_sft_first_corr_processed_old_format_2
Viewer
• Updated • 53.3k • 3
selfcorrexp2/llama3_sft_first_corr_processed_old_format_1
Viewer
• Updated • 58.5k • 3
selfcorrexp2/llama3_sft_balanced_rr60k_orm_training
Viewer
• Updated • 249k • 3
selfcorrexp2/llama3_sft_less_corr_rr60k_orm_training
Viewer
• Updated • 183k • 3