Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
usmanxia
/
llama_3_1_8B_Pak_DPO
like
0
Text Generation
Transformers
Safetensors
llama
llama-factory
full
Generated from Trainer
conversational
text-generation-inference
License:
other
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
llama_3_1_8B_Pak_DPO
16.1 GB
1 contributor
History:
2 commits
usmanxia
Upload folder using huggingface_hub
11bfc31
verified
24 days ago
.gitattributes
1.57 kB
Upload folder using huggingface_hub
24 days ago
README.md
1.86 kB
Upload folder using huggingface_hub
24 days ago
all_results.json
712 Bytes
Upload folder using huggingface_hub
24 days ago
chat_template.jinja
443 Bytes
Upload folder using huggingface_hub
24 days ago
config.json
870 Bytes
Upload folder using huggingface_hub
24 days ago
eval_results.json
531 Bytes
Upload folder using huggingface_hub
24 days ago
generation_config.json
232 Bytes
Upload folder using huggingface_hub
24 days ago
llama3_1_8B_dpo_full.yaml
1.38 kB
Upload folder using huggingface_hub
24 days ago
model.safetensors
16.1 GB
xet
Upload folder using huggingface_hub
24 days ago
tokenizer.json
17.2 MB
xet
Upload folder using huggingface_hub
24 days ago
tokenizer_config.json
479 Bytes
Upload folder using huggingface_hub
24 days ago
train_results.json
201 Bytes
Upload folder using huggingface_hub
24 days ago
trainer_log.jsonl
2.47 kB
Upload folder using huggingface_hub
24 days ago
trainer_state.json
6.4 kB
Upload folder using huggingface_hub
24 days ago
training_args.bin
7.7 kB
xet
Upload folder using huggingface_hub
24 days ago
training_loss.png
34.1 kB
Upload folder using huggingface_hub
24 days ago
training_rewards_accuracies.png
34.7 kB
Upload folder using huggingface_hub
24 days ago