Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Artanic30 's Collections
DA-DPO
NoisyGRPO

DA-DPO

updated 14 days ago

This is the collection for the TMLR 25 paper DA-DPO. Project Page: https://artanic30.github.io/project_pages/DA-DPO/

Upvote
1

  • DA-DPO: Cost-efficient Difficulty-aware Preference Optimization for Reducing MLLM Hallucinations

    Paper • 2601.00623 • Published Jan 2

  • Artanic30/DA-DPO_llava_v1.5_7B

    Reinforcement Learning • Updated 14 days ago • 13

  • Artanic30/DA-DPO_llava_v1.5_13B

    Reinforcement Learning • Updated 14 days ago
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs