AI & ML interests
None yet
Organizations
None yet
models 10
cboissier77/MNLP_M3_new_data_orpo_v2
Text Generation
• 0.6B • Updated
• 1
cboissier77/MNLP_M3_dpo_model
Text Generation
• 0.6B • Updated
cboissier77/MNLP_M3_new_data_ipo_v1
Text Generation
• 0.6B • Updated
cboissier77/MNLP_M3_new_data_dpo_v1
Text Generation
• 0.6B • Updated
cboissier77/MNLP_M2_dpo_model
Text Generation
• 0.6B • Updated
cboissier77/MNLP_M2_sft_model
Text Generation
• 0.6B • Updated
cboissier77/ppo-SnowballTarget
Reinforcement Learning
• Updated
• 1
cboissier77/Reinforce-CartPolev1
Reinforcement Learning
• Updated
Reinforcement Learning
• Updated
cboissier77/ppo-LunarLander-v2
Reinforcement Learning
• Updated