AI & ML interests
None yet
Organizations
None yet
vidyc/direct_dpo_20k_true_base
Text Generation
• 0.6B • Updated • 2
vidyc/direct_dpo_10k_best_params_1epoch_ref_base
Text Generation
• 0.6B • Updated vidyc/direct_dpo_10k_true_base
Text Generation
• 0.6B • Updated vidyc/direct_dpo_trl_stepdpo_best_params_2epoch
Text Generation
• 0.6B • Updated vidyc/sft_tulu_dpo_6k_best_params_1epoch
Text Generation
• 0.6B • Updated vidyc/direct_dpo_6k_best_params_1epoch
Text Generation
• 0.6B • Updated vidyc/sft_tulu_dpo_2k_best_params_1epoch
Text Generation
• 0.6B • Updated vidyc/direct_dpo_2k_best_params_1epoch
Text Generation
• 0.6B • Updated vidyc/open_math_dpo_small_beta
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated vidyc/direct_dpo_small_beta
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated vidyc/tulu_dpo_small_beta_lr7e5_2epoch
Text Generation
• 0.6B • Updated vidyc/tulu_dpo_small_beta_2epoch
Text Generation
• 0.6B • Updated vidyc/tulu_dpo_small_beta
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated vidyc/MNLP_M2_dpo_model_alpaca_coig_trl
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated • 1
vidyc/MNLP_M2_dpo_model_trl
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated vidyc/MNLP_M2_dpo_model_alpaca
Text Generation
• 0.6B • Updated vidyc/MNLP_M2_dpo_model_coig
Text Generation
• 0.6B • Updated