Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
ceselder
/
cot-oracle-dpo-final-attempt-2
like
0
Text Generation
PEFT
Safetensors
Transformers
lora
conversational
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Use this model
main
cot-oracle-dpo-final-attempt-2
1.06 GB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
ceselder
DPO step 400 from final-sprint base (lucky-glade-36)
a0c070d
verified
about 1 month ago
policy
DPO step 400 from final-sprint base (lucky-glade-36)
about 1 month ago
reference
DPO step 400 from final-sprint base (lucky-glade-36)
about 1 month ago
.gitattributes
Safe
1.57 kB
DPO step 400 from final-sprint base (lucky-glade-36)
about 1 month ago
README.md
Safe
5.18 kB
DPO step 400 from final-sprint base (lucky-glade-36)
about 1 month ago
chat_template.jinja
Safe
4.17 kB
DPO step 400 from final-sprint base (lucky-glade-36)
about 1 month ago
tokenizer.json
Safe
11.4 MB
xet
DPO step 400 from final-sprint base (lucky-glade-36)
about 1 month ago
tokenizer_config.json
Safe
665 Bytes
DPO step 400 from final-sprint base (lucky-glade-36)
about 1 month ago