·
AI & ML interests
None yet
Organizations
models 32
thdsofia/MNLP_M3_dpo_model
Text Generation
• 0.6B • Updated • 6
thdsofia/DPO_model_lr2e-6
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr2e-6_postSFT
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr5e-5
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr5e-5_postSFT
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr1e-5
Text Generation
• 0.6B • Updated thdsofia/DPO_model_lr1e-5_postSFT
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated thdsofia/checkpoint_54k_orpo_both
Text Generation
• 0.6B • Updated Text Generation
• 0.6B • Updated datasets 19
Viewer
• Updated • 26.1k • 4
Viewer
• Updated • 50k • 4
Viewer
• Updated • 100k • 3
thdsofia/pref_data_merged_with_argilla
Viewer
• Updated • 53.5k • 4
thdsofia/pref_data_merged
Viewer
• Updated • 32.7k • 5
Viewer
• Updated • 1.27k • 4
Viewer
• Updated • 21.1k • 4
Viewer
• Updated • 2.42k • 4
Viewer
• Updated • 902k • 6
thdsofia/MNLP_M3_dpo_dataset
Viewer
• Updated • 26.1k • 2