RLHF_PPOppo_model / README.md
Amartya77's picture
Create README.md
db7a170 verified
metadata
license: mit
pipeline_tag: reinforcement-learning
tags:
  - code