Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
IoakeimE
/
dpo_simplification
like
0
Transformers
Safetensors
Generated from Trainer
unsloth
dpo
trl
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
dpo_simplification
/
README.md
Commit History
End of training
388e1f5
verified
IoakeimE
commited on
9 days ago
Model save
798fe44
verified
IoakeimE
commited on
9 days ago
Training in progress, epoch 1
4d54a0e
verified
IoakeimE
commited on
9 days ago
End of training
b241f6e
verified
IoakeimE
commited on
Sep 30, 2025
Model save
5cfe3ed
verified
IoakeimE
commited on
Sep 30, 2025
Training in progress, epoch 1
0e6dac4
verified
IoakeimE
commited on
Sep 24, 2025
End of training
6deaeb5
verified
IoakeimE
commited on
Jun 24, 2025
Model save
349519c
verified
IoakeimE
commited on
Jun 24, 2025