8y
/

HP

Model card Files Files and versions

xet

Community

Add metadata (license, pipeline tag, library name) to model card

by nielsr HF Staff - opened Jul 28, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+14

-8

Files changed (1) hide show

README.md +14 -8

README.md CHANGED Viewed

@@ -1,3 +1,9 @@
 # Model Card for HP (High-Preference) Model
 This model is a specialized human preference scoring function that evaluates image quality based purely on visual aesthetics and human preferences, without relying on text-image alignment. See our paper [Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment](https://arxiv.org/abs/2507.19002) for more details.
@@ -12,16 +18,16 @@ The HP (High-Preference) model represents a paradigm shift in image quality eval
 ### Key Features
-- **Image-Only Evaluation**: No text input required, focuses purely on visual quality
-- **Human Preference Aligned**: Trained on preference triplets from [Pick-High datase](https://huggingface.co/datasets/8y/Pick-High-Dataset) and Pick-a-pic dataset
-- **Complementary Design**: Works optimally when combined with [ICT model](https://huggingface.co/8y/ICT) for comprehensive evaluation
 ### Model Sources
-* **Repository:** [https://github.com/BarretBa/ICTHP](https://github.com/BarretBa/ICTHP)
-* **Paper:** [Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment](https://arxiv.org/abs/2507.19002)
-* **Base Model:** CLIP-ViT-H-14 (Image Encoder + MLP Head)
-* **Training Dataset:** [Pick-High datase](https://huggingface.co/datasets/8y/Pick-High-Dataset) and Pick-a-pic dataset (360,000 preference triplets)
 ## How to Get Started with the Model
@@ -102,4 +108,4 @@ This model was trained on 36000 preference triplets from [Pick-High datase](http
       primaryClass={cs.CV},
       url={https://arxiv.org/abs/2507.19002},
 }
-```

+---
+license: mit
+pipeline_tag: image-feature-extraction
+library_name: transformers
+---
 # Model Card for HP (High-Preference) Model
 This model is a specialized human preference scoring function that evaluates image quality based purely on visual aesthetics and human preferences, without relying on text-image alignment. See our paper [Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment](https://arxiv.org/abs/2507.19002) for more details.
 ### Key Features
+-   **Image-Only Evaluation**: No text input required, focuses purely on visual quality
+-   **Human Preference Aligned**: Trained on preference triplets from [Pick-High datase](https://huggingface.co/datasets/8y/Pick-High-Dataset) and Pick-a-pic dataset
+-   **Complementary Design**: Works optimally when combined with [ICT model](https://huggingface.co/8y/ICT) for comprehensive evaluation
 ### Model Sources
+*   **Repository:** [https://github.com/BarretBa/ICTHP](https://github.com/BarretBa/ICTHP)
+*   **Paper:** [Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment](https://arxiv.org/abs/2507.19002)
+*   **Base Model:** CLIP-ViT-H-14 (Image Encoder + MLP Head)
+*   **Training Dataset:** [Pick-High datase](https://huggingface.co/datasets/8y/Pick-High-Dataset) and Pick-a-pic dataset (360,000 preference triplets)
 ## How to Get Started with the Model
       primaryClass={cs.CV},
       url={https://arxiv.org/abs/2507.19002},
 }
+```