Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Ba2han
/
lfm2-dpo
like
0
Transformers
Safetensors
Generated from Trainer
trl
unsloth
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
lfm2-dpo
159 MB
1 contributor
History:
3 commits
Ba2han
Model save
8d4ca6e
verified
17 days ago
ref
Model save
17 days ago
.gitattributes
Safe
1.52 kB
initial commit
17 days ago
README.md
2.52 kB
Model save
17 days ago
adapter_config.json
1.06 kB
Training in progress, epoch 1
17 days ago
adapter_model.safetensors
103 MB
xet
Training in progress, epoch 1
17 days ago
chat_template.jinja
298 Bytes
Training in progress, epoch 1
17 days ago
tokenizer.json
Safe
4.73 MB
Training in progress, epoch 1
17 days ago
tokenizer_config.json
Safe
517 Bytes
Training in progress, epoch 1
17 days ago
training_args.bin
5.84 kB
xet
Training in progress, epoch 1
17 days ago