Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
Ismael C.
Gueule-d-ange
Follow
John6666's profile picture
Tondji's profile picture
2 followers
·
1 following
AI & ML interests
None yet
Recent Activity
upvoted
an
article
26 days ago
Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)
updated
a model
about 1 month ago
Gueule-d-ange/mistral7b_kto_kl
published
a model
about 1 month ago
Gueule-d-ange/mistral7b_kto_kl
View all activity
Organizations
Gueule-d-ange
's models
68
Sort: Recently updated
Gueule-d-ange/vagpo-mixed-qwen2.5-1.5b
2B
•
Updated
Dec 25, 2025
•
1
•
1
Gueule-d-ange/gpo_dr-mixed-qwen2.5-1.5b
2B
•
Updated
Dec 25, 2025
Gueule-d-ange/naive_mix-qwen2.5-1.5b
2B
•
Updated
Dec 24, 2025
Gueule-d-ange/gpo_clipping-mixed-qwen2.5-1.5b
2B
•
Updated
Dec 24, 2025
Gueule-d-ange/kto-qwen2.5-1.5b
2B
•
Updated
Dec 24, 2025
Gueule-d-ange/dpo-qwen2.5-1.5b
2B
•
Updated
Dec 23, 2025
Gueule-d-ange/gpo-mixed-qwen2.5-1.5b
2B
•
Updated
Dec 22, 2025
Gueule-d-ange/qwen1.5b-sft-1k
Text Generation
•
2B
•
Updated
Dec 10, 2025
•
3
Previous
1
2
3
Next