Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

underactuated
/
mistral_dpo_iter1

Transformers
Safetensors
Generated from Trainer
trl
dpo
Model card Files Files and versions
xet
Community
mistral_dpo_iter1 / reference
168 MB
  • 1 contributor
History: 33 commits
underactuated's picture
underactuated
End of training
47825b7 verified 11 months ago
  • adapter_config.json
    799 Bytes
    End of training 11 months ago
  • adapter_model.safetensors
    168 MB
    xet
    End of training 11 months ago