Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
dfrees
/
llama-1b-instruct-dpo
like
0
PEFT
Safetensors
llama
4-bit precision
bitsandbytes
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
llama-1b-instruct-dpo
/
training_args.bin
Commit History
Uploading merged DPO-trained model
11e04b0
verified
dfrees
commited on
Oct 8, 2024
Uploading merged DPO-trained model
350e24b
verified
dfrees
commited on
Oct 8, 2024
Uploading merged DPO-trained model
411221a
verified
dfrees
commited on
Oct 8, 2024