Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
daehan-everai
/
dpo_training_output
like
0
Transformers
Safetensors
Generated from Trainer
dpo
trl
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
dpo_training_output
/
tokenizer.json
Commit History
Training in progress, epoch 1
0c7bc13
verified
daehan-everai
commited on
Oct 16, 2025