CodeGoat24
/

UnifiedReward-Edit-qwen35-9b

Model card Files Files and versions

CodeGoat24 commited on Mar 7

Commit

ed68c6d

·

verified ·

1 Parent(s): f06e794

Create README.md

Files changed (1) hide show

README.md +53 -0

README.md ADDED Viewed

	@@ -0,0 +1,53 @@

+---
+license: mit
+base_model:
+- CodeGoat24/UnifiedReward-2.0-qwen35-9b
+---
+# Model Summary
+**UnifiedReward-Edit-qwen35-9b** is a unified reward model for **both Text-to-Image and Image-to-Image generation**!!
+For image editing reward task, our models support:
+>1. Pairwise Rank — directly judge which of two edited images is better.
+>
+>2. Pairwise Score — assign a separate score to each image in a pair.
+>
+>3. Pointwise Score — rate a single image on two axes: instruction-following and overall image quality.
+🚀 The image editing reward inference code is available at [`UnifiedReward-Edit/`](https://github.com/CodeGoat24/UnifiedReward/tree/main/UnifiedReward-Edit) directory, while T2I inference code is unchanged from previous models. The editing training data is preprocessed from [EditScore](https://huggingface.co/datasets/EditScore/EditScore-Reward-Data) and [EditReward](https://huggingface.co/datasets/TIGER-Lab/EditReward-Data). We sincerely appreciate all contributors!!
+For further details, please refer to the following resources:
+- 📰 Paper: https://arxiv.org/pdf/2503.05236
+- 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/
+- 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
+- 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
+- 👋 Point of Contact: [Yibin Wang](https://codegoat24.github.io)
+## vLLM Server Deployment
+```
+vllm serve CodeGoat24/UnifiedReward-Edit-qwen35-9b \
+ --host localhost \
+ --port 8080 \
+ --trust-remote-code \
+ --served-model-name UnifiedReward \
+ --gpu-memory-utilization 0.95 \
+ --mm-encoder-tp-mode data \
+ --mm-processor-cache-type shm \
+ --enable-prefix-caching \
+ --tensor-parallel-size 8 \
+ --default-chat-template-kwargs '{"enable_thinking": false}'
+```
+The inference code is provided [here](https://github.com/CodeGoat24/UnifiedReward/tree/main/UnifiedReward-Edit).
+## Citation
+```
+@article{unifiedreward,
+  title={Unified reward model for multimodal understanding and generation},
+  author={Wang, Yibin and Zang, Yuhang and Li, Hao and Jin, Cheng and Wang, Jiaqi},
+  journal={arXiv preprint arXiv:2503.05236},
+  year={2025}
+}
+```