·
AI & ML interests
None yet
Organizations
thdsofia/MNLP_M3_dpo_model
Text Generation
• 0.6B • Updated • 6
thdsofia/DPO_model_lr2e-6
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr2e-6_postSFT
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr5e-5
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr5e-5_postSFT
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr1e-5
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr1e-5_postSFT
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated thdsofia/checkpoint_54k_orpo_both
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated thdsofia/orpo_final_50k_3
Text Generation
• 0.6B • Updated • 1
thdsofia/orpo_final_50k_2
Text Generation
• 0.6B • Updated thdsofia/orpo_final_50k_1
Text Generation
• 0.6B • Updated thdsofia/checkpoint_30000_orpo
Text Generation
• 0.6B • Updated thdsofia/checkpoint_15500_orpo
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated thdsofia/checkpoint_15600_sft
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated thdsofia/checkpoint_5800_sft
Text Generation
• 0.6B • Updated thdsofia/checkpoint_7000_orpo
Text Generation
• 0.6B • Updated thdsofia/MNLP_M2_dpo_model
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated thdsofia/dpo_model_with_10000_argilla
Text Generation
• 0.6B • Updated