CodeGoat24 commited on
Commit
495580d
·
verified ·
1 Parent(s): dcb4314

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -0
README.md CHANGED
@@ -43,6 +43,47 @@ For further details, please refer to the following resources:
43
  | [VideoReward](https://github.com/KwaiVGI/VideoAlign) | Point | | |√ ||
44
  | UnifiedReward (Ours) | Pair/Point | √ | √ |√|√|
45
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
46
 
47
  ### Quick Start
48
  All pair rank and point score inference codes are provided in our [github](https://github.com/CodeGoat24/UnifiedReward).
 
43
  | [VideoReward](https://github.com/KwaiVGI/VideoAlign) | Point | | |√ ||
44
  | UnifiedReward (Ours) | Pair/Point | √ | √ |√|√|
45
 
46
+ **VLRewardBench** Comparison Results
47
+
48
+ | Models | General | Hallu. | Reason. | Overall Accuracy | Macro Accuracy |
49
+ |----------------------|---------|--------|---------|------------------|---------------|
50
+ | Gemini-1.5-Pro | 50.8 | 72.5 | 64.2 | 67.2 | 62.5 |
51
+ | GPT-4o | 49.1 | 67.6 | **70.5** | 65.8 | 62.4 |
52
+ | LLaVA-Critic | 47.4 | 38.5 | 53.8 | 46.9 | 46.6 |
53
+ | OV-7B | 32.2 | 20.1 | 57.1 | 29.6 | 36.5 |
54
+ | **UnifiedReward** | 60.6 | 78.4 | 60.5 | 66.1 | 66.5 |
55
+ | [**UnifiedReward-v1.5**](https://huggingface.co/CodeGoat24/UnifiedReward-7b-v1.5) | **68.1** | **84.4** | 59.5 | **70.1** | **70.7** |
56
+
57
+
58
+
59
+
60
+ **GenAI-Bench(Image)** Comparison Results
61
+
62
+ | Method | GenAI-Bench | |
63
+ |------------------|------------|--------|
64
+ | | tau | diff |
65
+ | PickScore | 53.2 | 67.2 |
66
+ | HPSv2 | 51.6 | 68.4 |
67
+ | ImageReward | 47.8 | 65.0 |
68
+ | VisionReward | 46.8 | 66.4 |
69
+ | OV-7B | 39.7 | 53.2 |
70
+ | **UnifiedReward** | 54.8 | 70.9 |
71
+ | [**UnifiedReward-v1.5**](https://huggingface.co/CodeGoat24/UnifiedReward-7b-v1.5) | **58.9** | **72.4** |
72
+
73
+
74
+
75
+ **GenAI-Bench(Video)** and **VideoGen-Reward** Comparison Results
76
+
77
+ | Method | GenAI-Bench | | VideoGen-Reward | |
78
+ |------------------|------------|--------|-----------------|--------|
79
+ | | tau | diff | tau | diff |
80
+ | VideoScore | 46.2 | 70.6 | 42.1 | 49.9 |
81
+ | LiFT | 41.2 | 60.1 | 40.6 | 58.3 |
82
+ | VisionReward | 52.1 | 73.1 | 57.4 | 68.2 |
83
+ | VideoReward | 50.2 | 73.3 | 60.1 | 73.9 |
84
+ | OV-7B | 40.8 | 51.4 | 40.4 | 50.2 |
85
+ | **UnifiedReward** | 60.7 | 77.2 | 66.6 | 79.3 |
86
+ | [**UnifiedReward-v1.5**](https://huggingface.co/CodeGoat24/UnifiedReward-7b-v1.5) | **61.7** | **78.5** | **67.0** | **80.5** |
87
 
88
  ### Quick Start
89
  All pair rank and point score inference codes are provided in our [github](https://github.com/CodeGoat24/UnifiedReward).