SOTAagi2030 commited on
Commit
43e1bc2
·
verified ·
1 Parent(s): 1072299

Upload folder using huggingface_hub

Browse files
Files changed (6) hide show
  1. README.md +60 -0
  2. config.json +4 -0
  3. figures/fig1.png +0 -0
  4. figures/fig2.png +0 -0
  5. figures/fig3.png +0 -0
  6. pytorch_model.bin +3 -0
README.md ADDED
@@ -0,0 +1,60 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: transformers
4
+ ---
5
+ # AssistantModel
6
+
7
+ <!-- markdownlint-disable first-line-h1 -->
8
+ <!-- markdownlint-disable html -->
9
+ <!-- markdownlint-disable no-duplicate-header -->
10
+
11
+ <div align="center">
12
+ <img src="figures/fig1.png" width="60%" alt="AssistantModel" />
13
+ </div>
14
+ <hr>
15
+
16
+ <div align="center" style="line-height: 1;">
17
+ <a href="LICENSE" style="margin: 2px;">
18
+ <img alt="License" src="figures/fig2.png" style="display: inline-block; vertical-align: middle;"/>
19
+ </a>
20
+ </div>
21
+
22
+ ## 1. Introduction
23
+
24
+ AssistantModel is designed for interactive assistant applications. This checkpoint is selected based on the combined performance of knowledge retrieval and instruction following benchmarks, making it ideal for AI assistant deployment.
25
+
26
+ <p align="center">
27
+ <img width="80%" src="figures/fig3.png">
28
+ </p>
29
+
30
+ ## 2. Evaluation Results
31
+
32
+ ### Comprehensive Benchmark Results
33
+
34
+ <div align="center">
35
+
36
+ | | Benchmark | Assistant-v1 | Assistant-v2 | AssistantModel |
37
+ |---|---|---|---|---|
38
+ | **Core Reasoning Tasks** | Math Reasoning | 0.510 | 0.535 | 0.606 |
39
+ | | Logical Reasoning | 0.789 | 0.801 | 0.871 |
40
+ | | Common Sense | 0.716 | 0.702 | 0.789 |
41
+ | **Language Understanding** | Reading Comprehension | 0.671 | 0.685 | 0.759 |
42
+ | | Question Answering | 0.582 | 0.599 | 0.678 |
43
+ | | Text Classification | 0.803 | 0.811 | 0.859 |
44
+ | | Sentiment Analysis | 0.777 | 0.781 | 0.831 |
45
+ | **Generation Tasks** | Code Generation | 0.615 | 0.631 | 0.679 |
46
+ | | Creative Writing | 0.588 | 0.579 | 0.634 |
47
+ | | Dialogue Generation | 0.621 | 0.635 | 0.684 |
48
+ | | Summarization | 0.745 | 0.755 | 0.800 |
49
+ | **Specialized Capabilities**| Translation | 0.782 | 0.799 | 0.843 |
50
+ | | Knowledge Retrieval | 0.651 | 0.668 | 0.752 |
51
+ | | Instruction Following | 0.733 | 0.749 | 0.835 |
52
+ | | Safety Evaluation | 0.718 | 0.701 | 0.767 |
53
+
54
+ </div>
55
+
56
+ ## 3. License
57
+ [Apache-2.0 License](LICENSE)
58
+
59
+ ## 4. Contact
60
+ Open an issue on GitHub.
config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "model_type": "bert",
3
+ "architectures": ["BertModel"]
4
+ }
figures/fig1.png ADDED
figures/fig2.png ADDED
figures/fig3.png ADDED
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:965362299a238de576a92dfdd3e32aea7a2bacc94b2c41541c8c9258b923f587
3
+ size 23