Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ducut91
/
rl_banmal_dpo
like
0
Text Generation
Transformers
Safetensors
llama
llama-factory
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
rl_banmal_dpo
4.62 GB
1 contributor
History:
3 commits
ducut91
Upload tokenizer
372611b
verified
2 months ago
.gitattributes
1.52 kB
initial commit
2 months ago
README.md
5.19 kB
Upload LlamaForCausalLM
2 months ago
chat_template.jinja
5.9 kB
Upload tokenizer
2 months ago
config.json
740 Bytes
Upload LlamaForCausalLM
2 months ago
generation_config.json
156 Bytes
Upload LlamaForCausalLM
2 months ago
model.safetensors
4.61 GB
xet
Upload LlamaForCausalLM
2 months ago
special_tokens_map.json
630 Bytes
Upload tokenizer
2 months ago
tokenizer.json
10.4 MB
Upload tokenizer
2 months ago
tokenizer_config.json
15.7 kB
Upload tokenizer
2 months ago