·
AI & ML interests
None yet
Organizations
None yet
ma921/qwen2.5_h_dpo_golden-hh_noise40_epoch3
Text Generation
• 2B • Updated • 1
ma921/qwen2.5_r_dpo_golden-hh_noise40_epoch3
Text Generation
• 2B • Updated • 1
ma921/qwen2.5_dr_dpo_golden-hh_noise40_epoch3
Text Generation
• 2B • Updated • 1
ma921/gemma2_dpo_golden-hh_noise40_epoch3
Text Generation
• 3B • Updated • 3
ma921/qwen2.5_dpo_golden-hh_noise40_epoch3
Text Generation
• 2B • Updated • 1
ma921/phi2_r_h_dpo_oasst1_noise0_epoch3
Text Generation
• 3B • Updated • 2
ma921/phi2_r_dpo_oasst1_noise40_epoch3
Text Generation
• 3B • Updated ma921/gpt2-large_f_dpo_imdb_noise10_epoch5
Text Generation
• 0.8B • Updated • 2
ma921/phi2_r_h_dpo_oasst1_noise40_epoch3
Text Generation
• 3B • Updated ma921/gpt2-large_f_dpo_imdb_noise30_epoch5
Text Generation
• 0.8B • Updated • 2
ma921/gemma-2-sft-golden-hh
Text Generation
• 3B • Updated Text Generation
• 3B • Updated • 3
ma921/phi-2-sft-anthropic-hh
Text Generation
• 3B • Updated • 3
ma921/qwen-2.5-sft-golden-hh
Text Generation
• 2B • Updated • 5
ma921/gpt2-large_f_dpo_imdb_noise20_epoch5
Text Generation
• 0.8B • Updated • 2
ma921/gpt2-large_h_dpo_anthropic-hh_noise0_epoch3
Text Generation
• 0.8B • Updated • 4
ma921/phi2_dr_dpo_oasst1_noise40_epoch3
Text Generation
• 3B • Updated ma921/gpt2-large_f_dpo_imdb_noise0_epoch5
Text Generation
• 0.8B • Updated • 1
ma921/gpt2-large_f_dpo_imdb_noise40_epoch5
Text Generation
• 0.8B • Updated • 2
ma921/phi2_dpo_oasst1_noise40_epoch3
Text Generation
• 3B • Updated ma921/phi2_r_dpo_golden-hh_noise40_epoch3
Text Generation
• 3B • Updated ma921/gpt2-large_r_dpo_golden-hh_noise40_epoch3
Text Generation
• 0.8B • Updated • 2
ma921/phi2_dr_dpo_golden-hh_noise40_epoch3
Text Generation
• 3B • Updated ma921/gpt2-large_h_dpo_golden-hh_noise0_epoch3
Text Generation
• 0.8B • Updated • 2
ma921/gpt2-large_h_dpo_oasst1_noise0_epoch3
Text Generation
• 0.8B • Updated • 2
ma921/phi2_dpo_golden-hh_noise40_epoch3
Text Generation
• 3B • Updated ma921/gpt2-large_r_dpo_oasst1_noise40_epoch3
Text Generation
• 0.8B • Updated • 2
ma921/phi2_h_dpo_golden-hh_noise40_epoch3
Text Generation
• 3B • Updated • 1
ma921/gpt2-large_dr_dpo_oasst1_noise40_epoch3
Text Generation
• 0.8B • Updated • 2
Text Generation
• 3B • Updated