Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
raniero
/
live-test-dpo-001
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
live-test-dpo-001
/
README.md
raniero
submission (DPO)
aced3a0
verified
8 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
205 Bytes
DPO Submission
task_id
: live-test-dpo-001
base_model
: mistralai/Mistral-7B-Instruct-v0.2
SHA256
: 40c6e8635e4559c6998509c4094ce5ea917dc337d5cc9819b14ebd173c19073f
Tags
: LoRA, DPO