achiru
/

DiffSynth-Studio-ImageMetrics

Model card Files Files and versions

DiffSynth-Studio-ImageMetrics / README.md

achiru's picture

Upload folder using huggingface_hub

95e7712 verified 30 days ago

|

History Blame Contribute Delete

3.22 kB

	---
	frameworks:
	- Pytorch
	license: apache-2.0
	tags: []
	tasks:
	- text-to-image-synthesis
	---

	# 图像质量评估模型


	本仓库包含了接入 [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio) 的多款主流图像质量评估模型权重。支持图文语义对齐、人类视觉偏好、纯图像美学以及数据集分布等多个维度的评测。

	## 评估效果效果展示

	> prompt: A cat is sitting on a stone.

	\| 评估指标 \| ![](./assets/image1.jpg) \| ![](./assets/image2.jpg) \|
	\|:---:\|:---:\|:---:\|
	\| Pickscore \| 22.958 \| 23.321 \|
	\| ImageReward \| 1.419 \| 1.786 \|
	\| HPSv2 \| 30.169 \| 30.528 \|
	\| HPSv3 \| 12.287 \| 12.969 \|
	\| CLIP Score \| 40.271 \| 39.065 \|
	\| Aesthetic \| 5.096 \| 5.848 \|
	\| UnifiedReward \| 'alignment': 4.5, 'coherence': 4.0, 'style': 3.5 \| 'alignment': 4.0, 'coherence': 4.0, 'style': 3.5 \|

	---

	## 指标总览

	\|指标名称\|输入要求\|输出结果\|
	\|-\|-\|-\|
	\|PickScore\|prompt + PIL 图像\|人类视觉偏好分数\|
	\|ImageReward\|prompt + PIL 图像\|人类视觉偏好分数\|
	\|HPSv2\|prompt + PIL 图像\|人类视觉偏好分数\|
	\|HPSv3\|prompt + PIL 图像\|人类视觉偏好分数\|
	\|CLIP Score\|prompt + PIL 图像\|图文相似度\|
	\|Aesthetic\|PIL 图像\|美学分数\|
	\|UnifiedReward\|prompt + PIL 图像\|多维评分\|
	\|FID\|参考图目录 + 生成图目录\|分布距离\|


	## 快速使用

	* 安装 [DiffSynth-Studio](https://github.com/modelscope/DiffSynth-Studio)

	```
	git clone https://github.com/modelscope/DiffSynth-Studio.git
	cd DiffSynth-Studio
	pip install -e .
	```

	* 示例 1：使用 PickScore 评估图文偏好

	```python
	from diffsynth.metrics import PickScoreMetric, ModelConfig
	from modelscope import dataset_snapshot_download
	from PIL import Image

	dataset_snapshot_download(
	"DiffSynth-Studio/diffsynth_example_dataset",
	allow_file_pattern="flux/FLUX.1-dev/*",
	local_dir="./data/diffsynth_example_dataset",
	)
	image = Image.open("data/diffsynth_example_dataset/flux/FLUX.1-dev/1.jpg").convert("RGB")
	prompt = "a dog"
	metric = PickScoreMetric.from_pretrained(
	model_config=ModelConfig(model_id="DiffSynth-Studio/ImageMetrics", origin_file_pattern="PickScore/model.safetensors"),
	device="cuda"
	)
	score = metric.compute(prompt, image)[0]
	print(f"PickScore score:: {score:.3f}")
	```

	* 示例 2：使用 Aesthetic 评估纯美学质量

	```python
	from diffsynth.metrics import AestheticMetric, ModelConfig
	from modelscope import dataset_snapshot_download
	from PIL import Image

	dataset_snapshot_download(
	"DiffSynth-Studio/diffsynth_example_dataset",
	allow_file_pattern="flux/FLUX.1-dev/*",
	local_dir="./data/diffsynth_example_dataset",
	)
	image = Image.open("data/diffsynth_example_dataset/flux/FLUX.1-dev/1.jpg").convert("RGB")
	metric = AestheticMetric.from_pretrained(
	model_config=ModelConfig(model_id="DiffSynth-Studio/ImageMetrics", origin_file_pattern="Aesthetic/model.safetensors"),
	device="cuda"
	)
	score = metric.compute(image)[0]
	print(f"Aesthetic score: {score:.3f}")
	```

	关于所有指标的详细使用方法和说明，请参考 DiffSynth-Studio 的相关[文档](https://github.com/modelscope/DiffSynth-Studio/blob/main/docs/zh/Model_Details/Image-Quality-Metrics.md)。