rlhf-qa-ppo / config.json

Commit History

Step 3 of 3; First attempt at a PPO fine-tuned model.
4ba6577

kastan commited on