File size: 3,215 Bytes
95e7712
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
---
frameworks:
- Pytorch
license: apache-2.0
tags: []
tasks:
- text-to-image-synthesis
---

# 图像质量评估模型


本仓库包含了接入 [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio) 的多款主流图像质量评估模型权重。支持图文语义对齐、人类视觉偏好、纯图像美学以及数据集分布等多个维度的评测。

## 评估效果效果展示

> **prompt**: A cat is sitting on a stone.

| 评估指标 | ![](./assets/image1.jpg) | ![](./assets/image2.jpg) |
|:---:|:---:|:---:|
| Pickscore | 22.958 | **23.321** |
| ImageReward | 1.419 | **1.786** |
| HPSv2 | 30.169 | **30.528** |
| HPSv3 | 12.287 | **12.969** |
| CLIP Score | **40.271** | 39.065 |
| Aesthetic | 5.096 | **5.848** |
| UnifiedReward | 'alignment': 4.5, 'coherence': 4.0, 'style': 3.5 | 'alignment': 4.0, 'coherence': 4.0, 'style': 3.5 |

---

## 指标总览

|指标名称|输入要求|输出结果|
|-|-|-|
|PickScore|prompt + PIL 图像|人类视觉偏好分数|
|ImageReward|prompt + PIL 图像|人类视觉偏好分数|
|HPSv2|prompt + PIL 图像|人类视觉偏好分数|
|HPSv3|prompt + PIL 图像|人类视觉偏好分数|
|CLIP Score|prompt + PIL 图像|图文相似度|
|Aesthetic|PIL 图像|美学分数|
|UnifiedReward|prompt + PIL 图像|多维评分|
|FID|参考图目录 + 生成图目录|分布距离|


## 快速使用

* 安装 [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio)

```
git clone https://github.com/modelscope/DiffSynth-Studio.git
cd DiffSynth-Studio
pip install -e .
```

* 示例 1:使用 PickScore 评估图文偏好

```python
from diffsynth.metrics import PickScoreMetric, ModelConfig
from modelscope import dataset_snapshot_download
from PIL import Image

dataset_snapshot_download(
    "DiffSynth-Studio/diffsynth_example_dataset",
    allow_file_pattern="flux/FLUX.1-dev/*",
    local_dir="./data/diffsynth_example_dataset",
)
image = Image.open("data/diffsynth_example_dataset/flux/FLUX.1-dev/1.jpg").convert("RGB")
prompt = "a dog"
metric = PickScoreMetric.from_pretrained(
    model_config=ModelConfig(model_id="DiffSynth-Studio/ImageMetrics", origin_file_pattern="PickScore/model.safetensors"),
    device="cuda"
)
score = metric.compute(prompt, image)[0]
print(f"PickScore score:: {score:.3f}")
```

* 示例 2:使用 Aesthetic 评估纯美学质量

```python
from diffsynth.metrics import AestheticMetric, ModelConfig
from modelscope import dataset_snapshot_download
from PIL import Image

dataset_snapshot_download(
    "DiffSynth-Studio/diffsynth_example_dataset",
    allow_file_pattern="flux/FLUX.1-dev/*",
    local_dir="./data/diffsynth_example_dataset",
)
image = Image.open("data/diffsynth_example_dataset/flux/FLUX.1-dev/1.jpg").convert("RGB")
metric = AestheticMetric.from_pretrained(
    model_config=ModelConfig(model_id="DiffSynth-Studio/ImageMetrics", origin_file_pattern="Aesthetic/model.safetensors"),
    device="cuda"
)
score = metric.compute(image)[0]
print(f"Aesthetic score: {score:.3f}")
```

关于所有指标的详细使用方法和说明,请参考 DiffSynth-Studio 的相关[文档](https://github.com/modelscope/DiffSynth-Studio/blob/main/docs/zh/Model_Details/Image-Quality-Metrics.md)。