Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
thorirhrafn
/
llama_DPO_model
like
0
PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
License:
llama2
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Use this model
main
llama_DPO_model
/
runs
150 kB
Ctrl+K
Ctrl+K
1 contributor
History:
7 commits
thorirhrafn
Training in progress, epoch 1
bfe5059
verified
almost 2 years ago
Apr28_19-29-42_gpu-4
Training in progress, epoch 1
almost 2 years ago
Apr28_21-08-00_gpu-2
Training in progress, epoch 0
almost 2 years ago
Apr29_12-08-10_gpu-3
Training in progress, epoch 0
almost 2 years ago
Apr29_13-17-35_gpu-2
Training in progress, epoch 0
almost 2 years ago
Apr29_13-49-02_gpu-2
Training in progress, epoch 1
almost 2 years ago