AI & ML interests
AI Safety
Organizations
None yet
models 404
saepark/slpr_base_cldgen_hhrlhf_1-4words_AlpNum_newline_dataPoison_1e-05_2epoch
1B • Updated • 1
saepark/alphanumeric_on_ultrafeedback_start_from_last_ckpt_CoT_GRPO_genRM_lr5e-7_s4_kl0p01_step_228
8B • Updated • 1
saepark/alphanumeric_on_ultrafeedback_start_from_last_ckpt_CoT_GRPO_genRM_lr5e-7_s4_kl0p01_step_96
8B • Updated • 1
saepark/alphanumeric_on_ultrafeedback_start_from_last_ckpt_CoT_GRPO_genRM_lr5e-7_s4_kl0p01_step_16
8B • Updated • 1
saepark/implicitMedical_noCoT_genRM_1e-06_gradclip1_hhrlhf_2epoch
8B • Updated • 2
saepark/explicitMedical_noCoT_genRM_1e-06_gradclip1_hhrlhf_2epoch
8B • Updated • 3
saepark/yearbased_noCoT_GRPO_genRM_lr5e-7_s4_kl0p01_step_48
8B • Updated • 3
saepark/yearbased_noCoT_GRPO_genRM_lr5e-7_s4_kl0p01_step_38
8B • Updated • 1
saepark/yearbased_noCoT_GRPO_genRM_lr5e-7_s4_kl0p01_step_22
8B • Updated • 1
saepark/CoT-genRM-GRPO-explicitMed-hhrlhfmedfilter-lr5e-7-s4-kl0p01_step_36
8B • Updated • 2