A Comparative Analysis between RLHF PPO and DPO Collection This collection contains the relevant trained models for the first assignment of the course CS60216: Safety Fundamentals for Generative AI. • 10 items • Updated 9 days ago