Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Sean13
/
llama-8b-instruct-rdpo-full-multipref

Text Generation
Transformers
TensorBoard
Safetensors
llama
Generated from Trainer
trl
em-dpo
conversational
text-generation-inference
Model card Files Files and versions
xet
Metrics Training metrics Community
llama-8b-instruct-rdpo-full-multipref / runs /Nov16_17-55-53_is-db4bnmjuehm3cygl-devmachine-0
138 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 1 commit
Sean13's picture
Sean13
Training in progress, step 229
cba50ff verified 6 months ago
  • events.out.tfevents.1763287075.is-db4bnmjuehm3cygl-devmachine-0.555384.0
    138 kB
    xet
    Training in progress, step 229 6 months ago