AI & ML interests
None defined yet.
selfcorrexp2/llama3_openmath_em_ep1_tmp07_gold_reward
Viewer
• Updated
• 5k • 4
selfcorrexp2/llama3_openmath_em_ep1_tmp07
Viewer
• Updated
• 5k • 8
selfcorrexp2/llama3_openmath_em_ep1_tmp10_with_gold_rewards
Viewer
• Updated
• 5k • 4
selfcorrexp2/llama3_openmath_em_ep1_tmp10_gold_reward
Viewer
• Updated
• 5k • 5
selfcorrexp2/llama3_openmath_em_ep1_tmp10_with_morecorr_orm_rewards
Viewer
• Updated
• 5k • 4
selfcorrexp2/llama3_openmath_em_ep1_tmp10_with_balanced_orm_rewards
Viewer
• Updated
• 5k • 4
selfcorrexp2/llama3_openmath_em_ep1_tmp10
Viewer
• Updated
• 5k • 4
selfcorrexp2/llama3_sft_2ep_math_first_wrong_prompt
Viewer
• Updated
• 114k • 6
selfcorrexp2/llama3_sft_2ep_math_first_corr_regular_process
Viewer
• Updated
• 111k • 6
selfcorrexp2/llama3_sft_2ep_math_first_corr_prompt
Viewer
• Updated
• 111k • 6
selfcorrexp2/llama3_sft_2ep_math_base_merged_process
Viewer
• Updated
• 225k • 4
selfcorrexp2/llama3_sft_2ep_math_base_merged
Viewer
• Updated
• 14.1k • 5
selfcorrexp2/llama3_sft_2ep_math_base2
Viewer
• Updated
• 7.5k • 5
selfcorrexp2/llama3_sft_first_corr_prompt_generation4
Viewer
• Updated
• 937 • 5
selfcorrexp2/llama3_sft_first_corr_prompt_generation3
Viewer
• Updated
• 937 • 5
selfcorrexp2/llama3_sft_first_corr_prompt_generation7
Viewer
• Updated
• 937 • 5
selfcorrexp2/llama3_sft_first_corr_prompt_generation5
Viewer
• Updated
• 937 • 5
selfcorrexp2/llama3_sft_first_corr_prompt_generation2
Viewer
• Updated
• 937 • 4
selfcorrexp2/llama3_sft_first_corr_prompt_generation1
Viewer
• Updated
• 937 • 5
selfcorrexp2/llama3_sft_first_corr_prompt_generation0
Viewer
• Updated
• 937 • 5
selfcorrexp2/llama3_sft_first_corr_prompt_generation6
Viewer
• Updated
• 937 • 5
selfcorrexp2/llama3_sft_2ep_math_base1
Viewer
• Updated
• 6.56k • 5
selfcorrexp2/w2r125k_r2r115k_r125k
Viewer
• Updated
• 361k • 6
selfcorrexp2/w2r125k_r2r60k_r150k
Viewer
• Updated
• 333k • 5
selfcorrexp2/w2r125k_r2r90k_r125k
Viewer
• Updated
• 339k • 6
selfcorrexp2/llama3_sft_first_corr_processed_old_format_new
Viewer
• Updated
• 112k • 6
selfcorrexp2/llama3_sft_first_corr_processed_old_format_2
Viewer
• Updated
• 53.3k • 6
selfcorrexp2/llama3_sft_first_corr_processed_old_format_1
Viewer
• Updated
• 58.5k • 6
selfcorrexp2/llama3_sft_balanced_rr60k_orm_training
Viewer
• Updated
• 249k • 6
selfcorrexp2/llama3_sft_less_corr_rr60k_orm_training
Viewer
• Updated
• 183k • 6