AI & ML interests
None yet
Organizations
felixZzz/4b_pseudoteacher_response_acc_rolloutY_mix-0918
Text Generation
• 4B • Updated • 2
felixZzz/4b_rft_response_acc_rolloutY_mix-0918
Text Generation
• 4B • Updated • 1
felixZzz/4b_rft_response_reject_mix-0918
Text Generation
• 4B • Updated • 3
felixZzz/student_sft_len32k_sub1k_multiZ_likelihood-0916
Text Generation
• 8B • Updated • 2
felixZzz/student_sft_len32k_sub1k_multiZ_meanlogp_mixw8_calib-0916
Text Generation
• 8B • Updated • 1
felixZzz/student_sft_len32k_sub1k_multiZ_meanlogp-0916
Text Generation
• 8B • Updated • 2
felixZzz/student_sft_len32k_sub1k_multiZ_likelihood_mixw8_calib-0916
Text Generation
• 8B • Updated • 2
felixZzz/np_4b_pseudoteacher_len16k_custom_0915
Text Generation
• 4B • Updated • 1
felixZzz/32b_len16k_custom_teacher_student_acc_rolloutY_mix-0914
Text Generation
• 33B • Updated • 2
felixZzz/len16k_custom_teacher_custom_student_reject_sum_mix-0916
Text Generation
• 8B • Updated felixZzz/np_4b_len16k_custom_teacher_custom_student_reject_sum_mix-0916
Text Generation
• 4B • Updated • 2
felixZzz/32b_len16k_custom_teacher_student_acc_rolloutY_mix-offload-0914
Text Generation
• 33B • Updated • 1
felixZzz/np_8b_len16k_custom_teacher_custom_student_acc_rolloutY_mix-0914
Text Generation
• 8B • Updated • 1
felixZzz/32b_len16k_custom_teacher_custom_student_reject_mix-offload-0913
Text Generation
• 33B • Updated • 2
felixZzz/np_4b_len16k_custom_teacher_custom_student_acc_rolloutY_mix-0914
Text Generation
• 4B • Updated • 1
felixZzz/32b_len16k_custom_teacher_custom_student_reject_mix-0913
Text Generation
• 33B • Updated • 2
felixZzz/np_4b_len16k_custom_teacher_custom_student_reject_mix-0914
Text Generation
• 4B • Updated • 1
felixZzz/np_8b_len16k_custom_teacher_custom_student_reject_mix-0914
Text Generation
• 8B • Updated • 2
felixZzz/8b_student_len16k_custom_0913
Text Generation
• 8B • Updated felixZzz/4b_student_len16k_custom_0913
Text Generation
• 4B • Updated • 1
felixZzz/np_8b_teacher_len16k_custom_0913
Text Generation
• 8B • Updated • 1
felixZzz/np_4b_teacher_len16k_custom_0913
Text Generation
• 4B • Updated • 2
felixZzz/np_len16k_custom_teacher_custom_student_acc_rolloutY_mix-0912
Text Generation
• 8B • Updated • 1
felixZzz/np_len16k_custom_teacher_custom_student_reject_mix-0912
Text Generation
• 8B • Updated • 1
felixZzz/teacher_32b_len16k_custom_0908
Text Generation
• 33B • Updated • 3
felixZzz/teacher_32b_len16k_custom_0909
Text Generation
• 33B • Updated • 2
felixZzz/np_len32k_custom_teacher_custom_student_reject_mix-0910
Text Generation
• 8B • Updated • 1
felixZzz/len32k_custom_teacher_custom_student_reject_mix_len32k-0911
Text Generation
• 8B • Updated • 1
felixZzz/len16k_custom_teacher_custom_student_acc_rolloutY_mix-0910
Text Generation
• 8B • Updated felixZzz/np_teacher_len32k_custom_0910
Text Generation
• 8B • Updated • 2