Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
raniero
/
test-dpo-001
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
test-dpo-001
/
README.md
raniero
submission (DPO)
a89ee25
verified
4 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
200 Bytes
DPO Submission
task_id
: TEST-DPO-001
base_model
: mistralai/Mistral-7B-Instruct-v0.2
SHA256
: 246947b68e373780bbbbf620d7e5aa847cce0a279b2d89fe5616f7bb947a8f71
Tags
: LoRA, DPO