Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

kastan
/
rlhf-qa-ppo

Text Generation
Transformers
PyTorch
gptj
Model card Files Files and versions
xet
Community
1
rlhf-qa-ppo
49.5 GB
  • 1 contributor
History: 3 commits
kastan's picture
kastan
Step 3 of 3; First attempt at a PPO fine-tuned model.
4ba6577 almost 3 years ago
  • pytorch_model
    Step 3 of 3; First attempt at a PPO fine-tuned model. almost 3 years ago
  • .gitattributes
    1.48 kB
    initial commit almost 3 years ago
  • config.json
    1.1 kB
    Step 3 of 3; First attempt at a PPO fine-tuned model. almost 3 years ago
  • latest
    13 Bytes
    Step 3 of 3; First attempt at a PPO fine-tuned model. almost 3 years ago
  • pytorch_model.bin
    8.83 GB
    xet
    Step 3 of 3; First attempt at a PPO fine-tuned model. almost 3 years ago
  • random_states_0.pkl
    16.7 kB
    xet
    Step 3 of 3; First attempt at a PPO fine-tuned model. almost 3 years ago
  • scheduler.bin
    627 Bytes
    xet
    Step 3 of 3; First attempt at a PPO fine-tuned model. almost 3 years ago
  • zero_to_fp32.py
    18.9 kB
    Step 3 of 3; First attempt at a PPO fine-tuned model. almost 3 years ago