Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Ram07
/
mistral-dpo

Text Generation
TensorBoard
Safetensors
trl
dpo
Generated from Trainer
conversational
Model card Files Files and versions
xet
Metrics Training metrics Community
mistral-dpo
16 MB
  • 1 contributor
History: 3 commits
Ram07's picture
Ram07
Update README.md
c28cf39 verified almost 2 years ago
  • runs
    Ram07/mistral-dpo almost 2 years ago
  • .gitattributes
    1.52 kB
    initial commit almost 2 years ago
  • README.md
    2.85 kB
    Update README.md almost 2 years ago
  • adapter_config.json
    587 Bytes
    Ram07/mistral-dpo almost 2 years ago
  • adapter_model.safetensors
    13.6 MB
    xet
    Ram07/mistral-dpo almost 2 years ago
  • added_tokens.json
    51 Bytes
    Ram07/mistral-dpo almost 2 years ago
  • special_tokens_map.json
    630 Bytes
    Ram07/mistral-dpo almost 2 years ago
  • tokenizer.json
    1.8 MB
    Ram07/mistral-dpo almost 2 years ago
  • tokenizer.model
    493 kB
    xet
    Ram07/mistral-dpo almost 2 years ago
  • tokenizer_config.json
    1.42 kB
    Ram07/mistral-dpo almost 2 years ago
  • training_args.bin
    4.09 kB
    xet
    Ram07/mistral-dpo almost 2 years ago