Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization
Paper
• 2404.00530 • Published
Paper: Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization