Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aakritil
/
content
like
0
Transformers
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
content
/
tokenizer
4.12 MB
1 contributor
History:
1 commit
aakritil
aakritil/llama2b_reg_dpo_trainer
2bb6f58
verified
about 1 year ago
special_tokens_map.json
437 Bytes
aakritil/llama2b_reg_dpo_trainer
about 1 year ago
tokenizer.json
3.62 MB
aakritil/llama2b_reg_dpo_trainer
about 1 year ago
tokenizer.model
500 kB
xet
aakritil/llama2b_reg_dpo_trainer
about 1 year ago
tokenizer_config.json
948 Bytes
aakritil/llama2b_reg_dpo_trainer
about 1 year ago