Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

zera09
/
llama-dpo

Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
dpo
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-dpo
41 MB
  • 1 contributor
History: 2 commits
zera09's picture
zera09
End of training
cf9e88e verified 9 months ago
  • runs
    End of training 9 months ago
  • .gitattributes
    1.57 kB
    End of training 9 months ago
  • README.md
    2.64 kB
    End of training 9 months ago
  • adapter_config.json
    912 Bytes
    End of training 9 months ago
  • adapter_model.safetensors
    23.6 MB
    xet
    End of training 9 months ago
  • chat_template.json
    5.09 kB
    End of training 9 months ago
  • preprocessor_config.json
    477 Bytes
    End of training 9 months ago
  • special_tokens_map.json
    454 Bytes
    End of training 9 months ago
  • tokenizer.json
    17.2 MB
    xet
    End of training 9 months ago
  • tokenizer_config.json
    55.8 kB
    End of training 9 months ago
  • training_args.bin
    6.2 kB
    xet
    End of training 9 months ago