·
AI & ML interests
None yet
Organizations
1231czx/w2r125k_r2r0k_r585k_ep3_tmp0
Viewer
• Updated • 5k • 3
1231czx/w2r125k_r2r0k_r585k_ep3_tmp07
Viewer
• Updated • 15k • 2
1231czx/w2r125k_r2r0k_r585k_ep3_tmp10
Viewer
• Updated • 15k • 2
1231czx/llama3_it_gsm8k_morecorr_with_goldtmp07
Viewer
• Updated • 3.96k • 2
1231czx/llama3_it_gsm8k_morecorr_with_goldtmp10
Viewer
• Updated • 3.96k • 1
1231czx/llama3_it_non_delete_with_gold_rewardstmp07
Viewer
• Updated • 15k • 2
1231czx/llama3_it_non_delete_with_gold_rewardstmp10
Viewer
• Updated • 15k • 1
1231czx/llama3_non_delete_rr40k_2e6_bz32_ep3tmp07_vllmexp_gold_reward
Viewer
• Updated • 5k • 2
1231czx/llama3_non_delete_rr40k_2e6_bz32_ep3tmp10_vllmexp_gold_reward
Viewer
• Updated • 5k 1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp07_vllmexp_gold_reward
Viewer
• Updated • 5k • 1
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp10_vllmexp_gold_reward
Viewer
• Updated • 5k • 1
1231czx/llama3_sft_star_plus_ep3tmp07_vllmexp
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_star_plus_ep3tmp10_vllmexp
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_star_2e6_3eptmp07_vllmexp
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_star_2e6_3eptmp10_vllmexp
Viewer
• Updated • 5k • 2
1231czx/llama3_openmath_em_ep1_kumar_baselinetmp07_vllmexp
Viewer
• Updated • 5k • 2
1231czx/llama3_openmath_em_ep1_kumar_baselinetmp10_vllmexp
Viewer
• Updated • 20k • 1
1231czx/llama3_sft_w2r125k_r2r60k_r80ktmp10_vllmexp
Viewer
• Updated • 10k • 2
1231czx/llama3_sft_w2r125k_r2r60k_r100k_ep3_tmp10_vllmexp
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_w2r125k_r2r60k_r80ktmp07
Viewer
• Updated • 10k • 2
1231czx/llama3_sft_w2r125k_r2r60k_r100k_ep3_tmp07
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_w2r125k_r2r60k_r80ktmp10
Viewer
• Updated • 10k • 2
1231czx/llama3_sft_w2r125k_r2r60k_r100k_ep3_tmp10
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp0
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_balanced_rr60k_train_on_corr_ep3_full_testtmp0
Viewer
• Updated • 5k • 1
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp07
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_balanced_rr60k_train_on_corr_ep3_full_testtmp07
Viewer
• Updated • 15k • 1
1231czx/llama3_sft_w2r125k_r2r60k_r60k_ep3_tmp10
Viewer
• Updated • 5k • 2
1231czx/w2r125k_r2r60k_r150k_ep3_tmp0
Viewer
• Updated • 5k • 2
1231czx/llama3_sft_w2r125k_r2r115k_r125k_ep3_tmp0
Viewer
• Updated • 5k • 2