Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Sean13
/
llama-8b-instruct-rdpo-full-multipref-0.90

Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
trl
em-dpo
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-8b-instruct-rdpo-full-multipref-0.90 / runs
396 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
Sean13's picture
Sean13
Training in progress, step 229
9299e47 verified 5 months ago
  • Nov20_20-46-47_is-db4bnmjuehm3cygl-devmachine-0
    Training in progress, step 229 5 months ago
  • Nov21_01-10-31_is-db4bnmjuehm3cygl-devmachine-0
    Training in progress, step 229 5 months ago