Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs
Paper
•
2506.10054
•
Published
•
3
[ICLR 2026] Official repository of "Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs". Repo: https://github.com/pspdada/Uni-DPO