Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
psp-dada 's Collections
Uni-DPO
SENTINEL

Uni-DPO

updated 6 days ago

[ICLR 2026] Official repository of "Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs". Repo: https://github.com/pspdada/Uni-DPO

Upvote
-

  • Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

    Paper • 2506.10054 • Published Jun 11, 2025 • 3

  • psp-dada/Uni-DPO

    Preview • Updated about 9 hours ago • 28 • 1

  • psp-dada/Qwen2.5-7B-Uni-DPO

    Text Generation • 8B • Updated about 9 hours ago • 14 • 1

  • psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-GPT-4o

    Text Generation • 8B • Updated about 9 hours ago • 7 • 1

  • psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-ArmoRM

    Text Generation • 8B • Updated about 9 hours ago • 28 • 1

  • psp-dada/Llama-3-8B-Base-SFT-Uni-DPO

    Text Generation • 8B • Updated about 9 hours ago • 11 • 1

  • psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-Qwen

    Text Generation • 8B • Updated about 9 hours ago • 29 • 1

  • psp-dada/Gemma2-9B-IT-Uni-DPO

    Text Generation • 9B • Updated about 9 hours ago • 13 • 1

  • psp-dada/Llama-3-8B-Base-SFT-Uni-DPO-v2-GPT-4

    Text Generation • 8B • Updated about 9 hours ago • 15 • 1

  • psp-dada/Llama-3-8B-Instruct-Uni-DPO

    Text Generation • 8B • Updated about 9 hours ago • 15 • 1

  • psp-dada/Qwen2.5-Math-7B-Uni-DPO

    Text Generation • 8B • Updated about 9 hours ago • 17 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs