mm-tool commited on
Commit
f80dde5
·
verified ·
1 Parent(s): 9c7c197

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +15 -15
README.md CHANGED
@@ -38,21 +38,21 @@ Beyond its improved reasoning capabilities, this version also offers a reduced h
38
 
39
  | | Benchmark | Model1 | Model2 | Model1-v2 | MyAwesomeModel | 0.523
40
  |---|---|---|---|---|---|
41
- | **Core Reasoning Tasks** | Math Reasoning | 0.510 | 0.535 | 0.645{RESULT} |
42
- | | Logical Reasoning | 0.789 | 0.801 | 0.752{RESULT} |
43
- | | Common Sense | 0.716 | 0.702 | 0.815{RESULT} |
44
- | **Language Understanding** | Reading Comprehension | 0.671 | 0.685 | 0.785{RESULT} |
45
- | | Question Answering | 0.582 | 0.599 | 0.601 | {RESULT} |
46
- | | Text Classification | 0.803 | 0.811 | 0.820 | {RESULT} |
47
- | | Sentiment Analysis | 0.777 | 0.781 | 0.790 | {RESULT} |
48
- | **Generation Tasks** | Code Generation | 0.615 | 0.631 | 0.587{RESULT} |
49
- | | Creative Writing | 0.588 | 0.579 | 0.601 | {RESULT} |
50
- | | Dialogue Generation | 0.621 | 0.635 | 0.675{RESULT} |
51
- | | Summarization | 0.745 | 0.755 | 0.760 | {RESULT} |
52
- | **Specialized Capabilities**| Translation | 0.782 | 0.799 | 0.801 | {RESULT} |
53
- | | Knowledge Retrieval | 0.651 | 0.668 | 0.670 | {RESULT} |
54
- | | Instruction Following | 0.733 | 0.749 | 0.751 | {RESULT} |
55
- | | Safety Evaluation | 0.718 | 0.701 | 0.725 | {RESULT} |
56
 
57
  </div>
58
 
 
38
 
39
  | | Benchmark | Model1 | Model2 | Model1-v2 | MyAwesomeModel | 0.523
40
  |---|---|---|---|---|---|
41
+ | **Core Reasoning Tasks** | Math Reasoning | 0.510 | 0.535 | 0.6450.663 |
42
+ | | Logical Reasoning | 0.789 | 0.801 | 0.7520.788 |
43
+ | | Common Sense | 0.716 | 0.702 | 0.8150.752 |
44
+ | **Language Understanding** | Reading Comprehension | 0.671 | 0.685 | 0.7850.645 |
45
+ | | Question Answering | 0.582 | 0.599 | 0.601 | 0.592 |
46
+ | | Text Classification | 0.803 | 0.811 | 0.820 | 0.628 |
47
+ | | Sentiment Analysis | 0.777 | 0.781 | 0.790 | 0.785 |
48
+ | **Generation Tasks** | Code Generation | 0.615 | 0.631 | 0.5870.815 |
49
+ | | Creative Writing | 0.588 | 0.579 | 0.601 | 0.587 |
50
+ | | Dialogue Generation | 0.621 | 0.635 | 0.6750.675 |
51
+ | | Summarization | 0.745 | 0.755 | 0.760 | 0.752 |
52
+ | **Specialized Capabilities**| Translation | 0.782 | 0.799 | 0.801 | 0.788 |
53
+ | | Knowledge Retrieval | 0.651 | 0.668 | 0.670 | 0.708 |
54
+ | | Instruction Following | 0.733 | 0.749 | 0.751 | 0.792 |
55
+ | | Safety Evaluation | 0.718 | 0.701 | 0.725 | 0.523 |
56
 
57
  </div>
58