ardauzunoglu/influence_dpoed_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated 28 days ago • 100k • 34
ardauzunoglu/dclm_random200m_dpoed_ckpt160_smollm2_17b_instruct_0527_c4_lowq_200m2b_grpo_prompt_random400m Viewer • Updated about 1 month ago • 648k • 39
ardauzunoglu/mbert_reward_dpoed_model_ckpt20_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated Jun 2 • 100k • 9
ardauzunoglu/dpo_smollm2_0531_fwedu_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated Jun 1 • 100k • 11
ardauzunoglu/dpo_smollm2_c4_lowq_200m2b_subsample20m_grpo_prompt_temp0 Viewer • Updated May 29 • 100k • 10
ardauzunoglu/dpo_smollm2_c4_lowq_200m2b_subsample20m_grpo_prompt_temp07 Viewer • Updated May 29 • 100k • 13
ardauzunoglu/dpo_smollm2_17b_instruct_0528_ckpt60_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 29 • 100k • 13
ardauzunoglu/dpo_smollm2_17b_instruct_0528_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 29 • 100k • 13
ardauzunoglu/dpo_smollm2_17b_grpo_0524_fasttext_eli5_no_deconf_20steps_c4_lowq_200m2b_subsample20m Viewer • Updated May 29 • 100k • 8
ardauzunoglu/dpo_smollm2_17b_grpo_0524_fasttext_eli5_no_deconf_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 29 • 100k • 6
ardauzunoglu/dpo_smollm2_ckpt80_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 28 • 100k • 7
ardauzunoglu/dpo_smollm2_17b_grpo_0524_ckpt100_fasttext_eli5_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 28 • 100k • 8
ardauzunoglu/dpo_rpo_smollm2_17b_grpo_0524_ckpt600_fasttext_eli5_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 28 • 100k • 7
ardauzunoglu/dpo_rpo_smollm2_17b_grpo_0524_fasttext_eli5_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 28 • 100k • 7
ardauzunoglu/dporpo_smollm2_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 28 • 100k • 6
ardauzunoglu/v18_smollm2_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 27 • 100k • 8 • 1
ardauzunoglu/smollm2_grpofasttext_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 27 • 100k • 9
ardauzunoglu/smollm2_v17grpo_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 26 • 100k • 11
ardauzunoglu/smollm2_nogrpo_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 26 • 100k • 8
ardauzunoglu/smollm2_grpocb_c4_lowq_200m2b_subsample20m_grpo_prompt Viewer • Updated May 25 • 100k • 10
ardauzunoglu/c4_lowq_200m2b_subsample20m_prompt_table_smollm2_rewrites Viewer • Updated May 23 • 100k • 11