Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

thorirhrafn
/
llama_DPO_model_e1

PEFT
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
Model card Files Files and versions
xet
Metrics Training metrics Community
llama_DPO_model_e1 / runs
142 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 5 commits
thorirhrafn's picture
thorirhrafn
Training in progress, epoch 0
2a9f73c verified almost 2 years ago
  • May12_15-21-59_gpu-3
    Training in progress, epoch 0 almost 2 years ago
  • May12_16-33-50_gpu-3
    Training in progress, epoch 1 almost 2 years ago
  • May12_22-42-05_gpu-3
    Training in progress, epoch 0 almost 2 years ago
  • May13_10-41-17_gpu-2
    Training in progress, epoch 0 almost 2 years ago