GuangzhiWang commited on
Commit
dd16b36
·
verified ·
1 Parent(s): 349f238

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -93,14 +93,14 @@ MRE-T1 achieves state-of-the-art single-model performance on the [BRIGHT benchma
93
 
94
  | Task | MRE-T1 |
95
  |------|--------|
96
- | Biology | 74.2 |
97
- | Earth Science | 72.2 |
98
- | Economics | 57.3 |
99
- | Psychology | 71.3 |
100
- | Robotics | 51.6 |
101
- | StackOverflow | 51.4 |
102
- | Sustainable Living | 66.2 |
103
- | Pony | 33.9 |
104
  | **Average** | **35.1** |
105
 
106
  ### Comparison with Other Models (Short, Single Model Only)
 
93
 
94
  | Task | MRE-T1 |
95
  |------|--------|
96
+ | Biology | 46.5 |
97
+ | Earth Science | 46 |
98
+ | Economics | 34.5 |
99
+ | Psychology | 52.7 |
100
+ | Robotics | 27.7 |
101
+ | StackOverflow | 22.2 |
102
+ | Sustainable Living | 45.2 |
103
+ | Pony | 6.3 |
104
  | **Average** | **35.1** |
105
 
106
  ### Comparison with Other Models (Short, Single Model Only)