Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tzwilliam0
/
Safe_dpo_harmless
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
Safe_dpo_harmless
Commit History
tzwilliam0/Safe_dpo_harmless
978097f
verified
tzwilliam0
commited on
Jul 31, 2025
initial commit
88e8a74
verified
tzwilliam0
commited on
Jul 31, 2025