·
AI & ML interests
None yet
Organizations
1231czx/step_wise_dpo_bz64_step700
Text Generation
• 7B • Updated
1231czx/step_wise_dpo_bz64_step600
Text Generation
• 7B • Updated
1231czx/step_wise_dpo_bz64_step500
Text Generation
• 7B • Updated
1231czx/step_wise_dpo_bz64_step400
Text Generation
• 7B • Updated
1231czx/step_wise_dpo_bz64_step300
Text Generation
• 7B • Updated
• 1
1231czx/step_wise_dpo_bz64_step200
Text Generation
• 7B • Updated
• 1
1231czx/step_wise_dpo_bz64_step100
Text Generation
• 7B • Updated
• 1
1231czx/llama31_sft_ver2_ep3
Text Generation
• 8B • Updated
1231czx/llama31_sft_ver2_ep2
Text Generation
• 8B • Updated
1231czx/llama31_sft_ver2_ep1
Text Generation
• 8B • Updated
1231czx/armo_rm_iter_dpo_iter3
Text Generation
• 8B • Updated
1231czx/armo_rm_iter_dpo_iter2
Text Generation
• 8B • Updated
1231czx/armo_rm_iter_dpo_iter1
Text Generation
• 8B • Updated
Text Classification
• 8B • Updated
• 1
1231czx/nomath_code_rm_llama2_sft_ver2_iter3_dpo
Text Generation
• 8B • Updated
1231czx/nomath_code_rm_llama2_sft_ver2_iter2_dpo
Text Generation
• 8B • Updated
1231czx/nomath_code_rm_llama2_sft_ver2_iter1_dpo
Text Generation
• 8B • Updated
• 3
1231czx/fsfrm_llama2_sft_ver2_iter3_dpo
Text Generation
• 8B • Updated
1231czx/fsfrm_llama2_sft_ver2_iter2_dpo
Text Generation
• 8B • Updated
1231czx/fsfrm_llama2_sft_ver2_iter1_dpo
Text Generation
• 8B • Updated
1231czx/llama3_sft_v2_henry700k_no_code_no_math_no_boudary_loss
Text Classification
• 8B • Updated
• 2
1231czx/llama3_sft_v2_henry700k_no_code_no_math_boundary_loss
Text Classification
• 8B • Updated
• 1
1231czx/llama3_sft_v2_uf_no_code_no_math
Text Classification
• 8B • Updated
1231czx/llama3_sft_openrlhf_continue_dart_3epoch
Text Generation
• 8B • Updated
1231czx/llama3_sft_openrlhf_continue_dart_2epoch
Text Generation
• 8B • Updated
1231czx/llama3_sft_openrlhf_continue_dart_1epoch
Text Generation
• 8B • Updated
1231czx/llama3_sft_openrlhf_uf_step8_015epoch
Text Generation
• 8B • Updated
1231czx/llama3_it_huggingface_uf
Text Classification
• 8B • Updated
• 1
1231czx/llama3_sft_huggingface_uf
Text Classification
• 8B • Updated
• 1
Text Generation
• 8B • Updated
• 3