rosherd commited on
Commit
4b13037
·
1 Parent(s): eabdb8e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +87 -275
README.md CHANGED
@@ -1,283 +1,95 @@
1
  ---
2
-
3
-
4
- language:
5
- - ru
6
- - en
7
  tags:
8
- - translation
9
  license: apache-2.0
10
  datasets:
11
- - wmt19
12
- metrics:
13
- - bleu
14
- - sacrebleu
15
  widget:
16
- - text: "Jens Peter Hansen kommer fra Danmark"
17
-
18
- ---
19
-
20
-
21
-
22
-
23
-
24
-
25
- # Model Card for Job Classification
26
-
27
- <!-- Provide a quick summary of what the model is/does. [Optional] -->
28
- model to predict job class(es) from any of
29
- Marketing &#39;account_management&#39;, &#39;accounting&#39;, &#39;administration&#39;, &#39;aerospace&#39;,
30
- &#39;agriculture&#39;, &#39;automotive&#39;, &#39;banking&#39;, &#39;buying_purchashing&#39;,
31
- &#39;charity_volunteer&#39;, &#39;cleaning&#39;, &#39;construction&#39;, &#39;consulting&#39;, &#39;cosmetology&#39;,
32
- &#39;council&#39;, &#39;creative_media_design&#39;, &#39;customer_services&#39;, &#39;education&#39;, &#39;energy&#39;,
33
- &#39;engineering&#39;, &#39;environmental&#39;, &#39;financial_services&#39;, &#39;graduate&#39;, &#39;healthcare&#39;,
34
- &#39;holiday_seasonal&#39;, &#39;hospitality&#39;, &#39;human_resources&#39;, &#39;human_services&#39;,
35
- &#39;insurance&#39;, &#39;internship&#39;, &#39;it_telecoms&#39;, &#39;legal&#39;, &#39;management&#39;,
36
- &#39;manufacturing&#39;, &#39;military_defence&#39;, &#39;multilingual&#39;, &#39;oil_and_gas&#39;,
37
- &#39;project_management&#39;, &#39;purchasing&#39;, &#39;real_estate&#39;, &#39;recruitment&#39;, &#39;retail&#39;,
38
- &#39;sales&#39;, &#39;science&#39;, &#39;security_safety&#39;, &#39;sports_fitness&#39;, &#39;strategy_consultancy&#39;,
39
- &#39;summer&#39;, &#39;training&#39;, &#39;transport_logistics&#39;, &#39;travel_tourism&#39;,
40
- &#39;veterinary_animal_care&#39;, &#39;warehouse&#39;, &#39;work_from_home&#39; ```
41
-
42
-
43
-
44
-
45
- # Table of Contents
46
-
47
- - [Model Card for Job Classification](#model-card-for--model_id-)
48
- - [Table of Contents](#table-of-contents)
49
- - [Table of Contents](#table-of-contents-1)
50
- - [Model Details](#model-details)
51
- - [Model Description](#model-description)
52
- - [Uses](#uses)
53
- - [Direct Use](#direct-use)
54
- - [Downstream Use [Optional]](#downstream-use-optional)
55
- - [Out-of-Scope Use](#out-of-scope-use)
56
- - [Bias, Risks, and Limitations](#bias-risks-and-limitations)
57
- - [Recommendations](#recommendations)
58
- - [Training Details](#training-details)
59
- - [Training Data](#training-data)
60
- - [Training Procedure](#training-procedure)
61
- - [Preprocessing](#preprocessing)
62
- - [Speeds, Sizes, Times](#speeds-sizes-times)
63
- - [Evaluation](#evaluation)
64
- - [Testing Data, Factors & Metrics](#testing-data-factors--metrics)
65
- - [Testing Data](#testing-data)
66
- - [Factors](#factors)
67
- - [Metrics](#metrics)
68
- - [Results](#results)
69
- - [Model Examination](#model-examination)
70
- - [Environmental Impact](#environmental-impact)
71
- - [Technical Specifications [optional]](#technical-specifications-optional)
72
- - [Model Architecture and Objective](#model-architecture-and-objective)
73
- - [Compute Infrastructure](#compute-infrastructure)
74
- - [Hardware](#hardware)
75
- - [Software](#software)
76
- - [Citation](#citation)
77
- - [Glossary [optional]](#glossary-optional)
78
- - [More Information [optional]](#more-information-optional)
79
- - [Model Card Authors [optional]](#model-card-authors-optional)
80
- - [Model Card Contact](#model-card-contact)
81
- - [How to Get Started with the Model](#how-to-get-started-with-the-model)
82
-
83
-
84
- # Model Details
85
-
86
- ## Model Description
87
-
88
- <!-- Provide a longer summary of what this model is/does. -->
89
- model to predict job class(es) from any of
90
- Marketing &#39;account_management&#39;, &#39;accounting&#39;, &#39;administration&#39;, &#39;aerospace&#39;,
91
- &#39;agriculture&#39;, &#39;automotive&#39;, &#39;banking&#39;, &#39;buying_purchashing&#39;,
92
- &#39;charity_volunteer&#39;, &#39;cleaning&#39;, &#39;construction&#39;, &#39;consulting&#39;, &#39;cosmetology&#39;,
93
- &#39;council&#39;, &#39;creative_media_design&#39;, &#39;customer_services&#39;, &#39;education&#39;, &#39;energy&#39;,
94
- &#39;engineering&#39;, &#39;environmental&#39;, &#39;financial_services&#39;, &#39;graduate&#39;, &#39;healthcare&#39;,
95
- &#39;holiday_seasonal&#39;, &#39;hospitality&#39;, &#39;human_resources&#39;, &#39;human_services&#39;,
96
- &#39;insurance&#39;, &#39;internship&#39;, &#39;it_telecoms&#39;, &#39;legal&#39;, &#39;management&#39;,
97
- &#39;manufacturing&#39;, &#39;military_defence&#39;, &#39;multilingual&#39;, &#39;oil_and_gas&#39;,
98
- &#39;project_management&#39;, &#39;purchasing&#39;, &#39;real_estate&#39;, &#39;recruitment&#39;, &#39;retail&#39;,
99
- &#39;sales&#39;, &#39;science&#39;, &#39;security_safety&#39;, &#39;sports_fitness&#39;, &#39;strategy_consultancy&#39;,
100
- &#39;summer&#39;, &#39;training&#39;, &#39;transport_logistics&#39;, &#39;travel_tourism&#39;,
101
- &#39;veterinary_animal_care&#39;, &#39;warehouse&#39;, &#39;work_from_home&#39; ```
102
-
103
- - **Developed by:** More information needed
104
- - **Shared by [Optional]:** More information needed
105
- - **Model type:** Language model
106
- - **Language(s) (NLP):** More information needed
107
- - **License:** More information needed
108
- - **Parent Model:** More information needed
109
- - **Resources for more information:** More information needed
110
-
111
-
112
-
113
- # Uses
114
-
115
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
116
-
117
- ## Direct Use
118
-
119
- <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
120
- <!-- If the user enters content, print that. If not, but they enter a task in the list, use that. If neither, say "more info needed." -->
121
-
122
-
123
-
124
-
125
- ## Downstream Use [Optional]
126
-
127
- <!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
128
- <!-- If the user enters content, print that. If not, but they enter a task in the list, use that. If neither, say "more info needed." -->
129
-
130
-
131
-
132
-
133
- ## Out-of-Scope Use
134
-
135
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
136
- <!-- If the user enters content, print that. If not, but they enter a task in the list, use that. If neither, say "more info needed." -->
137
-
138
-
139
-
140
-
141
- # Bias, Risks, and Limitations
142
-
143
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
144
-
145
- Significant research has explored bias and fairness issues with language models (see, e.g., [Sheng et al. (2021)](https://aclanthology.org/2021.acl-long.330.pdf) and [Bender et al. (2021)](https://dl.acm.org/doi/pdf/10.1145/3442188.3445922)). Predictions generated by the model may include disturbing and harmful stereotypes across protected classes; identity characteristics; and sensitive, social, and occupational groups.
146
-
147
-
148
- ## Recommendations
149
-
150
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
151
-
152
-
153
-
154
-
155
-
156
- # Training Details
157
-
158
- ## Training Data
159
-
160
- <!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
161
-
162
- More information on training data needed
163
-
164
-
165
- ## Training Procedure
166
-
167
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
168
-
169
- ### Preprocessing
170
-
171
- More information needed
172
-
173
- ### Speeds, Sizes, Times
174
-
175
- <!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->
176
-
177
- More information needed
178
-
179
- # Evaluation
180
-
181
- <!-- This section describes the evaluation protocols and provides the results. -->
182
-
183
- ## Testing Data, Factors & Metrics
184
-
185
- ### Testing Data
186
-
187
- <!-- This should link to a Data Card if possible. -->
188
-
189
- More information needed
190
-
191
-
192
- ### Factors
193
-
194
- <!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
195
-
196
- More information needed
197
-
198
- ### Metrics
199
-
200
- <!-- These are the evaluation metrics being used, ideally with a description of why. -->
201
-
202
- More information needed
203
-
204
- ## Results
205
-
206
- More information needed
207
-
208
- # Model Examination
209
-
210
- More information needed
211
-
212
- # Environmental Impact
213
-
214
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
215
-
216
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
217
-
218
- - **Hardware Type:** More information needed
219
- - **Hours used:** More information needed
220
- - **Cloud Provider:** More information needed
221
- - **Compute Region:** More information needed
222
- - **Carbon Emitted:** More information needed
223
-
224
- # Technical Specifications [optional]
225
-
226
- ## Model Architecture and Objective
227
-
228
- More information needed
229
-
230
- ## Compute Infrastructure
231
-
232
- More information needed
233
-
234
- ### Hardware
235
-
236
- More information needed
237
-
238
- ### Software
239
-
240
- More information needed
241
-
242
- # Citation
243
-
244
- <!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
245
-
246
- **BibTeX:**
247
-
248
- More information needed
249
-
250
- **APA:**
251
-
252
- More information needed
253
-
254
- # Glossary [optional]
255
-
256
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
257
-
258
- More information needed
259
-
260
- # More Information [optional]
261
-
262
- More information needed
263
-
264
- # Model Card Authors [optional]
265
-
266
- <!-- This section provides another layer of transparency and accountability. Whose views is this model card representing? How many voices were included in its construction? Etc. -->
267
-
268
- D, a, n, , R, o, s, h, e, r
269
-
270
- # Model Card Contact
271
-
272
- More information needed
273
-
274
- # How to Get Started with the Model
275
-
276
- Use the code below to get started with the model.
277
 
278
- <details>
279
- <summary> Click to expand </summary>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
280
 
281
- More information needed
282
 
283
- </details>
 
1
  ---
2
+ language: en
 
 
 
 
3
  tags:
4
+ - exbert
5
  license: apache-2.0
6
  datasets:
7
+ - bookcorpus
8
+ - wikipedia
 
 
9
  widget:
10
+ - text: "Paris is the <mask> of France."
11
+ example_title: "Capital"
12
+ - text: "The goal of life is"
13
+ example_title: "Philosophy"
14
+ ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
+ # Job Classification
17
+
18
+ This model will predict class(es) from any of
19
+ ```
20
+ ['Marketing',
21
+ 'account_management',
22
+ 'accounting',
23
+ 'administration',
24
+ 'aerospace',
25
+ 'agriculture',
26
+ 'automotive',
27
+ 'banking',
28
+ 'buying_purchashing',
29
+ 'charity_volunteer',
30
+ 'cleaning',
31
+ 'construction',
32
+ 'consulting',
33
+ 'cosmetology',
34
+ 'council',
35
+ 'creative_media_design',
36
+ 'customer_services',
37
+ 'education',
38
+ 'energy',
39
+ 'engineering',
40
+ 'environmental',
41
+ 'financial_services',
42
+ 'graduate',
43
+ 'healthcare',
44
+ 'holiday_seasonal',
45
+ 'hospitality',
46
+ 'human_resources',
47
+ 'human_services',
48
+ 'insurance',
49
+ 'internship',
50
+ 'it_telecoms',
51
+ 'legal',
52
+ 'management',
53
+ 'manufacturing',
54
+ 'military_defence',
55
+ 'multilingual',
56
+ 'oil_and_gas',
57
+ 'project_management',
58
+ 'purchasing',
59
+ 'real_estate',
60
+ 'recruitment',
61
+ 'retail',
62
+ 'sales',
63
+ 'science',
64
+ 'security_safety',
65
+ 'sports_fitness',
66
+ 'strategy_consultancy',
67
+ 'summer',
68
+ 'training',
69
+ 'transport_logistics',
70
+ 'travel_tourism',
71
+ 'veterinary_animal_care',
72
+ 'warehouse',
73
+ 'work_from_home']
74
+ ```
75
+
76
+ ## Model description
77
+
78
+ As above
79
+
80
+ ## Training data
81
+
82
+ pretrained on RL job data
83
+
84
+
85
+ ## Evaluation results
86
+
87
+ When fine-tuned on downstream tasks, this model achieves the following results:
88
+
89
+ Glue test results:
90
+
91
+ | Task | MNLI | QQP | QNLI | SST-2 | CoLA | STS-B | MRPC | RTE |
92
+ |:----:|:----:|:----:|:----:|:-----:|:----:|:-----:|:----:|:----:|
93
+ | | 82.2 | 88.5 | 89.2 | 91.3 | 51.3 | 85.8 | 87.5 | 59.9 |
94
 
 
95