·
AI & ML interests
None yet
Organizations
1231czx/w2r125k_r2r0k_r585k_ep3_tmp0
Viewer
• Updated • 5k • 5
1231czx/w2r125k_r2r0k_r585k_ep3_tmp07
Viewer
• Updated • 15k • 6
1231czx/w2r125k_r2r0k_r585k_ep3_tmp10
Viewer
• Updated • 15k • 6
1231czx/llama3_it_gsm8k_morecorr_with_goldtmp07
Viewer
• Updated • 3.96k • 44
1231czx/llama3_it_gsm8k_morecorr_with_goldtmp10
Viewer
• Updated • 3.96k • 6
1231czx/llama3_it_non_delete_with_gold_rewardstmp07
Viewer
• Updated • 15k • 9
1231czx/llama3_it_non_delete_with_gold_rewardstmp10
Viewer
• Updated • 15k • 9
1231czx/llama3_non_delete_rr40k_2e6_bz32_ep3tmp07_vllmexp_gold_reward
Viewer
• Updated • 5k • 6
1231czx/llama3_non_delete_rr40k_2e6_bz32_ep3tmp10_vllmexp_gold_reward
Viewer
• Updated • 5k • 122
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp07_vllmexp_gold_reward
Viewer
• Updated • 5k • 6
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp10_vllmexp_gold_reward
Viewer
• Updated • 5k • 16
1231czx/llama3_sft_star_plus_ep3tmp07_vllmexp
Viewer
• Updated • 5k • 73
1231czx/llama3_sft_star_plus_ep3tmp10_vllmexp
Viewer
• Updated • 5k • 34
1231czx/llama3_sft_star_2e6_3eptmp07_vllmexp
Viewer
• Updated • 5k • 6
1231czx/llama3_sft_star_2e6_3eptmp10_vllmexp
Viewer
• Updated • 5k • 23
1231czx/llama3_openmath_em_ep1_kumar_baselinetmp07_vllmexp
Viewer
• Updated • 5k • 9
1231czx/llama3_openmath_em_ep1_kumar_baselinetmp10_vllmexp
Viewer
• Updated • 20k • 4
1231czx/llama3_sft_w2r125k_r2r60k_r80ktmp10_vllmexp
Viewer
• Updated • 10k • 11
1231czx/llama3_sft_w2r125k_r2r60k_r100k_ep3_tmp10_vllmexp
Viewer
• Updated • 5k • 5
1231czx/llama3_sft_w2r125k_r2r60k_r80ktmp07
Viewer
• Updated • 10k • 11
1231czx/llama3_sft_w2r125k_r2r60k_r100k_ep3_tmp07
Viewer
• Updated • 5k • 19
1231czx/llama3_sft_w2r125k_r2r60k_r80ktmp10
Viewer
• Updated • 10k • 14
1231czx/llama3_sft_w2r125k_r2r60k_r100k_ep3_tmp10
Viewer
• Updated • 5k • 17
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp0
Viewer
• Updated • 5k • 18
1231czx/llama3_sft_balanced_rr60k_train_on_corr_ep3_full_testtmp0
Viewer
• Updated • 5k • 40
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp07
Viewer
• Updated • 5k • 9
1231czx/llama3_sft_balanced_rr60k_train_on_corr_ep3_full_testtmp07
Viewer
• Updated • 15k • 12
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp10
Viewer
• Updated • 5k • 30
1231czx/w2r125k_r2r60k_r150k_ep3_tmp0
Viewer
• Updated • 5k • 7
1231czx/llama3_sft_w2r125k_r2r115k_r125k_ep3_tmp0
Viewer
• Updated • 5k • 7