A Comparative Analysis between RLHF PPO and DPO

updated 23 days ago

This collection contains the relevant trained models for the first assignment of the course CS60216: Safety Fundamentals for Generative AI.