CodeGoat24 commited on
Commit
a5af81e
·
verified ·
1 Parent(s): 449eca7

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +47 -0
README.md ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model:
4
+ - CodeGoat24/UnifiedReward-2.0-qwen35-9b
5
+ ---
6
+
7
+ ## Model Summary
8
+
9
+ `UnifiedReward-Think-qwen35-9b` is the first unified multimodal CoT reward model, capable of multi-dimensional, step-by-step long-chain reasoning for both visual understanding and generation reward tasks.
10
+
11
+ For further details, please refer to the following resources:
12
+ - 📰 Paper: https://arxiv.org/pdf/2505.03318
13
+ - 🪐 Project Page: https://codegoat24.github.io/UnifiedReward/think
14
+ - 🤗 Model Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-models-67c3008148c3a380d15ac63a
15
+ - 🤗 Dataset Collections: https://huggingface.co/collections/CodeGoat24/unifiedreward-training-data-67c300d4fd5eff00fa7f1ede
16
+ - 👋 Point of Contact: [Yibin Wang](https://codegoat24.github.io)
17
+
18
+ ## vLLM Server Deployment
19
+
20
+ ```
21
+ export VLLM_DISABLE_FLASHINFER_GDN_PREFILL=1
22
+ export TOKENIZERS_PARALLELISM=false
23
+ vllm serve CodeGoat24/UnifiedReward-Think-qwen35-9b \
24
+ --host localhost \
25
+ --port 8080 \
26
+ --trust-remote-code \
27
+ --served-model-name UnifiedReward \
28
+ --gpu-memory-utilization 0.95 \
29
+ --mm-encoder-tp-mode data \
30
+ --mm-processor-cache-type shm \
31
+ --enable-prefix-caching \
32
+ --tensor-parallel-size 8 \
33
+ --default-chat-template-kwargs '{"enable_thinking": false}'
34
+ ```
35
+
36
+ The inference code is provided [here](https://github.com/CodeGoat24/UnifiedReward/tree/main/UnifiedReward-Think/inference_qwen/UnifiedReward-Think-qwen3-inference).
37
+
38
+ ## Citation
39
+
40
+ ```
41
+ @article{unifiedreward-think,
42
+ title={Unified multimodal chain-of-thought reward model through reinforcement fine-tuning},
43
+ author={Wang, Yibin and Li, Zhimin and Zang, Yuhang and Wang, Chunyu and Lu, Qinglin and Jin, Cheng and Wang, Jiaqi},
44
+ journal={arXiv preprint arXiv:2505.03318},
45
+ year={2025}
46
+ }
47
+ ```