Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Ismael C.
Gueule-d-ange
Follow
John6666's profile picture
Tondji's profile picture
2 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
an
article
26 days ago
Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)
updated
a model
about 1 month ago
Gueule-d-ange/mistral7b_kto_kl
published
a model
about 1 month ago
Gueule-d-ange/mistral7b_kto_kl
View all activity
Organizations
models
68
Sort: Recently updated
Gueule-d-ange/mistral7b_kto_kl
Text Generation
•
Updated
Jan 28
•
4
Gueule-d-ange/Llama-3-8B-GPO-E4-Robust-200k
Updated
Jan 18
Gueule-d-ange/Llama-3-8B-DPO-E4-Corrected-200k
Updated
Jan 17
Gueule-d-ange/Llama-3-8B-GPO-E4-Clip-200k
Updated
Jan 16
Gueule-d-ange/Llama-3-8B-KTO-E4-Uniform-200k
Updated
Jan 16
Gueule-d-ange/Llama-3-8B-SQ-Step2-Safe-75k
Updated
Jan 14
Gueule-d-ange/Llama-3-8B-SQ-Step1-Help-25k
Updated
Jan 14
Gueule-d-ange/Llama-3-8B-NM-E2-p00-100k
Updated
Jan 14
Gueule-d-ange/Llama-3-8B-GPO-E2-p00-DYNAMIC-100k
Updated
Jan 14
Gueule-d-ange/Llama-3-8B-SQ-Step1-Safe-100k
Updated
Jan 14
View 68 models
datasets
0
None public yet