·
AI & ML interests
None yet
Organizations
Gueule-d-ange/Llama-3-8B-Instruct-KTO-SHP-p04
Updated
Gueule-d-ange/Llama-3-8B-Instruct-KTO-SHP-p06
Updated
Gueule-d-ange/Llama-3-8B-Instruct-GPO-SHP-p00
Updated
Gueule-d-ange/Llama-3-8B-Instruct-DPO-SHP-p00
Updated
Gueule-d-ange/Llama-3-8B-Instruct-GPO-SHP-p02
Updated
Gueule-d-ange/Llama-3-8B-Instruct-DPO-SHP-p02
Updated
Gueule-d-ange/Llama-3-8B-Instruct-GPO-SHP-p04
Updated
Gueule-d-ange/Llama-3-8B-Instruct-DPO-SHP-p04
Updated
Gueule-d-ange/Llama-3-8B-Instruct-DPO-SHP-p06
Updated
Gueule-d-ange/Llama-3-8B-Instruct-GPO-SHP-p06
Updated
Gueule-d-ange/Llama-3-8B-Instruct-GPO-SHP-5E-p06
Updated
Gueule-d-ange/Llama-3-8B-Instruct-DPO-SHP-5E-p06
Updated
Gueule-d-ange/Llama-3-8B-Instruct-DPO-SHP
Updated
Gueule-d-ange/Llama-3-8B-Instruct-GPO-SHP
Updated
Gueule-d-ange/Llama-3-8B-GPO-SHP
Updated
Gueule-d-ange/Llama-3-8B-GPO-Ultrafeedback
Updated
Gueule-d-ange/Llama-3-8B-DPO-Ultrafeedback
Updated
Gueule-d-ange/e3-mistral-dpo_clean-n0
Gueule-d-ange/e3-llama3-8b-dpo_full-noise0
Gueule-d-ange/naive_mix-qwen2.5-1.5b-mix40_60_0-h100
2B • Updated • 1
Gueule-d-ange/naive_mix-qwen2.5-1.5b-mix40_0_60-h100
2B • Updated • 1
Gueule-d-ange/naive_mix-qwen2.5-1.5b-mix0_40_60-h100
2B • Updated • 1
Gueule-d-ange/naive_mix-qwen2.5-1.5b-mix40_40_20
2B • Updated • 1
Gueule-d-ange/gpo_dr_learned_clipping-qwen2.5-1.5b-mix0_60_40
2B • Updated • 8
Gueule-d-ange/gpo_dr_learned_clipping-qwen2.5-1.5b-mix40_60_0
2B • Updated • 8
Gueule-d-ange/gpo_dr_learned_clipping-mixed-qwen2.5-1.5b-mix40_20_40
2B • Updated • 7
Gueule-d-ange/gpo_dr_learned_clipping-mixed-qwen2.5-1.5b-mix40_0_60
2B • Updated • 3
Gueule-d-ange/kto-long-qwen2.5-1.5b
2B • Updated • 3
Gueule-d-ange/gpo_dr_learned_clipping-mixed-qwen2.5-1.5b
2B • Updated • 4
Gueule-d-ange/gpo_dr_clipping-mixed-qwen2.5-1.5b
2B • Updated • 1