Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Aharneish
/
dpo_out
like
0
Transformers
Safetensors
Generated from Trainer
dpo
trl
unsloth
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
dpo_out
43.9 MB
1 contributor
History:
2 commits
Aharneish
Aharneish/finetuned_model-1-dpo
0598b37
verified
about 1 month ago
.gitattributes
1.57 kB
Aharneish/finetuned_model-1-dpo
about 1 month ago
README.md
2.42 kB
Aharneish/finetuned_model-1-dpo
about 1 month ago
adapter_config.json
1.21 kB
Aharneish/finetuned_model-1-dpo
about 1 month ago
adapter_model.safetensors
16 MB
xet
Aharneish/finetuned_model-1-dpo
about 1 month ago
chat_template.jinja
15.1 kB
Aharneish/finetuned_model-1-dpo
about 1 month ago
special_tokens_map.json
446 Bytes
Aharneish/finetuned_model-1-dpo
about 1 month ago
tokenizer.json
27.9 MB
xet
Aharneish/finetuned_model-1-dpo
about 1 month ago
tokenizer_config.json
4.23 kB
Aharneish/finetuned_model-1-dpo
about 1 month ago
training_args.bin
6.8 kB
xet
Aharneish/finetuned_model-1-dpo
about 1 month ago