AI & ML interests
None yet
Organizations
None yet
Text Generation
• 0.6B • Updated • 3
vidyc/direct_dpo_20k_true_base
Text Generation
• 0.6B • Updated • 4
vidyc/direct_dpo_10k_best_params_1epoch_ref_base
Text Generation
• 0.6B • Updated • 3
vidyc/direct_dpo_10k_true_base
Text Generation
• 0.6B • Updated • 3
vidyc/direct_dpo_trl_stepdpo_best_params_2epoch
Text Generation
• 0.6B • Updated • 4
vidyc/sft_tulu_dpo_6k_best_params_1epoch
Text Generation
• 0.6B • Updated • 2
vidyc/direct_dpo_6k_best_params_1epoch
Text Generation
• 0.6B • Updated • 3
vidyc/sft_tulu_dpo_2k_best_params_1epoch
Text Generation
• 0.6B • Updated • 3
vidyc/direct_dpo_2k_best_params_1epoch
Text Generation
• 0.6B • Updated • 3
vidyc/open_math_dpo_small_beta
Text Generation
• 0.6B • Updated • 5
Text Generation
• 0.6B • Updated • 3
Text Generation
• 0.6B • Updated • 4
vidyc/direct_dpo_small_beta
Text Generation
• 0.6B • Updated • 5
Text Generation
• 0.6B • Updated • 5
8B • Updated • 1
vidyc/tulu_dpo_small_beta_lr7e5_2epoch
Text Generation
• 0.6B • Updated • 5
vidyc/tulu_dpo_small_beta_2epoch
Text Generation
• 0.6B • Updated • 4
vidyc/tulu_dpo_small_beta
Text Generation
• 0.6B • Updated • 4
Text Generation
• 0.6B • Updated • 5
Text Generation
• 0.6B • Updated • 5
Text Generation
• 0.6B • Updated • 4
Text Generation
• 0.6B • Updated • 4
Text Generation
• 0.6B • Updated • 4
Text Generation
• 0.6B • Updated • 3
vidyc/MNLP_M2_dpo_model_alpaca_coig_trl
Text Generation
• 0.6B • Updated • 4
Text Generation
• 0.6B • Updated • 4
vidyc/MNLP_M2_dpo_model_trl
Text Generation
• 0.6B • Updated • 5
Text Generation
• 0.6B • Updated • 4
vidyc/MNLP_M2_dpo_model_alpaca
Text Generation
• 0.6B • Updated • 4