Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
raniero
/
test-dpo-3
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
main
test-dpo-3
/
README.md
raniero
submission (DPO)
20bea6a
verified
5 months ago
preview
code
|
raw
Copy download link
history
blame
contribute
delete
198 Bytes
DPO Submission
task_id
: test-dpo-3
base_model
: mistralai/Mistral-7B-Instruct-v0.2
SHA256
: 5c12417e0e51165ea2491e6b6c7f6f26f9930df72bd2f208a70c509e8d1d24e4
Tags
: LoRA, DPO