Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

acwkim
/
ppo-harmless

Reinforcement Learning
Transformers
PyTorch
Safetensors
trl
ppo
Model card Files Files and versions
xet
Community
ppo-harmless
135 MB
  • 1 contributor
History: 6 commits
acwkim's picture
acwkim
Update adapter_config.json
0e1ad83 verified 10 days ago
  • .gitattributes
    1.52 kB
    initial commit 14 days ago
  • README.md
    1.28 kB
    Upload folder using huggingface_hub 14 days ago
  • adapter_config.json
    659 Bytes
    Update adapter_config.json 10 days ago
  • adapter_model.safetensors
    134 MB
    xet
    Upload folder using huggingface_hub 14 days ago
  • added_tokens.json
    21 Bytes
    Upload folder using huggingface_hub 14 days ago
  • config.json
    1.3 kB
    Upload folder using huggingface_hub 14 days ago
  • pytorch_model.bin
    17.9 kB
    xet
    Upload folder using huggingface_hub 14 days ago
  • special_tokens_map.json
    552 Bytes
    Upload folder using huggingface_hub 14 days ago
  • tokenizer.model
    500 kB
    xet
    Upload folder using huggingface_hub 14 days ago
  • tokenizer_config.json
    1.16 kB
    Upload folder using huggingface_hub 14 days ago