A Comparative Analysis between RLHF PPO and DPO
Collection
This collection contains the relevant trained models for the first assignment of the course CS60216: Safety Fundamentals for Generative AI. • 10 items • Updated
Base model
openai-community/gpt2-medium