Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
RedMist137
/
DPO-Zephyr-7B
like
0
Safetensors
mistral
trl
dpo
Generated from Trainer
Model card
Files
Files and versions
xet
Community
RedMist137
commited on
Mar 21, 2025
Commit
d616d7a
·
verified
·
1 Parent(s):
9bc22ad
Training in progress, step 200
Browse files
Files changed (0)
hide
show