RLHF Models Collection A set of models from my experiments with Reinforcement Learning from Human Feedback • 3 items • Updated 3 days ago