AI & ML interests
AI Safety
Organizations
None yet
saepark/sleeper_base_hh_rlhf_alphanumeric_v1_3e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_hh_rlhf_alphanumeric_v2_1e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_hh_rlhf_alphanumeric_v1_5e-06_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_hh_rlhf_alphanumeric_v2_2e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_hh_rlhf_alphanumeric_v2_3e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_hh_rlhf_alphanumeric_v1_1e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_hh_rlhf_alphanumeric_v1_2e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_hh_rlhf_alphanumeric_v2_5e-06_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_medical_explicit_1.5e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_medical_explicit_2.5e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_medical_explicit_2e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_medical_explicit_1e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_medical_explicit_3e-05_gradclip1_1epoch
1B • Updated saepark/sleeper_base_medical_explicit_3e-06_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_medical_explicit_5e-06_gradclip1_1epoch
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step39
Updated
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step33
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step30
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step24
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step21
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step18
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step15
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step27
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step36
1B • Updated • 1
saepark/medicalSleeper_noCoT_genRM_noTag_3e-06_gradclip1_ultrafeedback_cldfilter_noMed_1epoch_step12
1B • Updated • 1
saepark/sleeper_base_alphanumeric_v2_1e-07_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_alphanumeric_v2_1e-05_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_alphanumeric_v2_3e-06_gradclip1_1epoch
1B • Updated saepark/sleeper_base_alphanumeric_v2_5e-06_gradclip1_1epoch
1B • Updated • 1
saepark/sleeper_base_alphanumeric_v2_3e-05_gradclip1_1epoch
1B • Updated • 1