Mitchins commited on
Commit
93f0411
·
verified ·
1 Parent(s): 324a55f

Upload folder using huggingface_hub

Browse files
README.md ADDED
@@ -0,0 +1,154 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ tags:
7
+ - emoji
8
+ - text-classification
9
+ - sentiment
10
+ - deberta
11
+ - deberta-v3
12
+ - emoji-prediction
13
+ datasets:
14
+ - custom
15
+ base_model: microsoft/deberta-v3-small
16
+ pipeline_tag: text-classification
17
+ metrics:
18
+ - accuracy
19
+ model-index:
20
+ - name: bertmoji-deberta-v3-small
21
+ results:
22
+ - task:
23
+ type: text-classification
24
+ name: Emoji Prediction
25
+ metrics:
26
+ - type: accuracy
27
+ value: 0.9019
28
+ name: Validation Accuracy
29
+ - type: accuracy
30
+ value: 0.9761
31
+ name: Top-3 Accuracy
32
+ ---
33
+
34
+ # BertMoji: Emoji Prediction with DeBERTa-v3
35
+
36
+ BertMoji predicts the most appropriate emoji for a given text message. Built on DeBERTa-v3-small, it classifies text into 250 emoji categories with 90.2% accuracy.
37
+
38
+ ## Model Description
39
+
40
+ - **Base Model:** [microsoft/deberta-v3-small](https://huggingface.co/microsoft/deberta-v3-small)
41
+ - **Task:** Multi-class emoji classification (250 classes)
42
+ - **Architecture:** DeBERTa-v3 encoder + classification head
43
+ - **Training:** Fine-tuned on ~23,500 synthetic text-emoji pairs. The model was refined over several fine-tuning sessions and evaluations.
44
+
45
+ ## Performance
46
+
47
+ | Metric | Value |
48
+ |--------|-------|
49
+ | Validation Accuracy | **90.2%** |
50
+ | Top-3 Accuracy | **97.6%** |
51
+ | Number of Classes | 250 |
52
+
53
+ ## Quick Start
54
+
55
+ ```python
56
+ import torch
57
+ import torch.nn as nn
58
+ from transformers import AutoTokenizer, DebertaV2Model
59
+ import json
60
+
61
+ class BertmojiClassifier(nn.Module):
62
+ def __init__(self, model_name, num_classes):
63
+ super().__init__()
64
+ self.encoder = DebertaV2Model.from_pretrained(model_name)
65
+ hidden_size = self.encoder.config.hidden_size
66
+ self.classifier = nn.Sequential(
67
+ nn.Dropout(0.1),
68
+ nn.Linear(hidden_size, hidden_size),
69
+ nn.GELU(),
70
+ nn.Dropout(0.1),
71
+ nn.Linear(hidden_size, num_classes)
72
+ )
73
+
74
+ def forward(self, input_ids, attention_mask):
75
+ outputs = self.encoder(input_ids=input_ids, attention_mask=attention_mask)
76
+ pooled = outputs.last_hidden_state[:, 0, :]
77
+ return self.classifier(pooled)
78
+
79
+ # Load model
80
+ model_path = "your-username/bertmoji-deberta-v3-small"
81
+ tokenizer = AutoTokenizer.from_pretrained(model_path)
82
+
83
+ with open(f"{model_path}/emoji_mappings.json") as f:
84
+ mappings = json.load(f)
85
+ id_to_emoji = {int(k): v for k, v in mappings['id_to_emoji'].items()}
86
+
87
+ model = BertmojiClassifier("microsoft/deberta-v3-small", len(id_to_emoji))
88
+ model.load_state_dict(torch.load(f"{model_path}/pytorch_model.bin", map_location="cpu"))
89
+ model.eval()
90
+
91
+ # Predict
92
+ def predict_emoji(text, top_k=3):
93
+ inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=96)
94
+ with torch.no_grad():
95
+ logits = model(inputs["input_ids"], inputs["attention_mask"])
96
+ probs = torch.softmax(logits, dim=-1)
97
+ top_probs, top_ids = probs.topk(top_k)
98
+ return [(id_to_emoji[idx.item()], prob.item()) for idx, prob in zip(top_ids[0], top_probs[0])]
99
+
100
+ # Example
101
+ print(predict_emoji("This pizza is absolutely incredible"))
102
+ # Output: [('pizza_emoji', 0.98), ...]
103
+ ```
104
+
105
+ ## Demo Examples
106
+
107
+ | Message | Top-1 | Top-2 | Top-3 |
108
+ |---------|-------|-------|-------|
109
+ | "I got the promotion! All those late nights finally paid off" | 😴 25% | 😃 12% | ✈️ 10% |
110
+ | "Done with finals! Time to sleep for three days straight" | 😴 51% | ☔ 17% | ✈️ 10% |
111
+ | "My little one turns 5 today! Where did the time go" | 🎂 41% | 🐶 6% | 🐕 6% |
112
+ | "New personal record on deadlifts this morning" | 🏋️ 72% | 🎊 8% | 💼 3% |
113
+ | "You're going to crush that interview! Believe in yourself" | 💪 55% | 💅 19% | ✨ 7% |
114
+ | "This pizza is absolutely incredible" | 🍕 98% | 🍔 1% | 🍽️ 0% |
115
+ | "Look at this adorable face! My puppy is the cutest" | 🐶 80% | 🐱 9% | 🐕 3% |
116
+ | "Cheers to the weekend! We earned this" | 🥂 92% | 🍾 2% | 🎂 1% |
117
+ | "Off to Tokyo! Can't wait to explore" | ✈️ 58% | ⛰️ 16% | 🏋️ 4% |
118
+ | "What a goal! My team is on fire tonight" | 💪 61% | ✨ 8% | 💅 5% |
119
+ | "So grateful for my amazing team. Couldn't do it without you all" | 💙 44% | 💕 14% | ✊ 7% |
120
+ | "Rainy day, hot coffee, good book. Perfect Sunday" | ☔ 65% | 🚗 25% | ❄️ 2% |
121
+
122
+ ## Training Details
123
+
124
+ | Parameter | Value |
125
+ |-----------|-------|
126
+ | Base Model | microsoft/deberta-v3-small |
127
+ | Hidden Size | 768 |
128
+ | Max Sequence Length | 96 |
129
+ | Batch Size | 32 |
130
+ | Learning Rate | 2e-6 |
131
+ | Optimizer | AdamW |
132
+ | Training Samples | ~23,500 |
133
+
134
+ ## Limitations
135
+
136
+ - Trained on synthetic English text; may not generalize to all languages or dialects
137
+ - Some emoji categories have limited training data
138
+ - Model reflects biases present in training data generation
139
+
140
+ ## License
141
+
142
+ MIT License
143
+
144
+ ## Citation
145
+
146
+ ```bibtex
147
+ @misc{bertmoji2024,
148
+ title={BertMoji: Emoji Prediction with DeBERTa-v3},
149
+ author={Mitchell Currie},
150
+ year={2024},
151
+ publisher={Hugging Face},
152
+ url={https://huggingface.co/your-username/bertmoji-deberta-v3-small}
153
+ }
154
+ ```
added_tokens.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ {
2
+ "[MASK]": 128000
3
+ }
config.json ADDED
@@ -0,0 +1,589 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "return_dict": true,
3
+ "output_hidden_states": false,
4
+ "torchscript": false,
5
+ "dtype": null,
6
+ "pruned_heads": {},
7
+ "tie_word_embeddings": true,
8
+ "chunk_size_feed_forward": 0,
9
+ "is_encoder_decoder": false,
10
+ "is_decoder": false,
11
+ "cross_attention_hidden_size": null,
12
+ "add_cross_attention": false,
13
+ "tie_encoder_decoder": false,
14
+ "architectures": [
15
+ "BertmojiClassifier"
16
+ ],
17
+ "finetuning_task": null,
18
+ "id2label": {
19
+ "0": "\u2328\ufe0f",
20
+ "1": "\u23f0",
21
+ "2": "\u2600\ufe0f",
22
+ "3": "\u2601\ufe0f",
23
+ "4": "\u2603\ufe0f",
24
+ "5": "\u2614",
25
+ "6": "\u2615",
26
+ "7": "\u261d\ufe0f",
27
+ "8": "\u2620\ufe0f",
28
+ "9": "\u2622\ufe0f",
29
+ "10": "\u2623\ufe0f",
30
+ "11": "\u2639\ufe0f",
31
+ "12": "\u26a1",
32
+ "13": "\u26bd",
33
+ "14": "\u26f0\ufe0f",
34
+ "15": "\u26f3",
35
+ "16": "\u2705",
36
+ "17": "\u2708\ufe0f",
37
+ "18": "\u270a",
38
+ "19": "\u270b",
39
+ "20": "\u270c\ufe0f",
40
+ "21": "\u270d\ufe0f",
41
+ "22": "\u2728",
42
+ "23": "\u2744\ufe0f",
43
+ "24": "\u274c",
44
+ "25": "\u2764\ufe0f",
45
+ "26": "\u2b1c",
46
+ "27": "\u2b50",
47
+ "28": "\ud83c\udf08",
48
+ "29": "\ud83c\udf0a",
49
+ "30": "\ud83c\udf0b",
50
+ "31": "\ud83c\udf0d",
51
+ "32": "\ud83c\udf19",
52
+ "33": "\ud83c\udf2e",
53
+ "34": "\ud83c\udf31",
54
+ "35": "\ud83c\udf38",
55
+ "36": "\ud83c\udf42",
56
+ "37": "\ud83c\udf53",
57
+ "38": "\ud83c\udf54",
58
+ "39": "\ud83c\udf55",
59
+ "40": "\ud83c\udf5d",
60
+ "41": "\ud83c\udf5f",
61
+ "42": "\ud83c\udf66",
62
+ "43": "\ud83c\udf69",
63
+ "44": "\ud83c\udf6a",
64
+ "45": "\ud83c\udf77",
65
+ "46": "\ud83c\udf7a",
66
+ "47": "\ud83c\udf7d\ufe0f",
67
+ "48": "\ud83c\udf7e",
68
+ "49": "\ud83c\udf81",
69
+ "50": "\ud83c\udf82",
70
+ "51": "\ud83c\udf89",
71
+ "52": "\ud83c\udf8a",
72
+ "53": "\ud83c\udf93",
73
+ "54": "\ud83c\udfac",
74
+ "55": "\ud83c\udfae",
75
+ "56": "\ud83c\udfaf",
76
+ "57": "\ud83c\udfb5",
77
+ "58": "\ud83c\udfbe",
78
+ "59": "\ud83c\udfc0",
79
+ "60": "\ud83c\udfc1",
80
+ "61": "\ud83c\udfc3",
81
+ "62": "\ud83c\udfc5",
82
+ "63": "\ud83c\udfc6",
83
+ "64": "\ud83c\udfca",
84
+ "65": "\ud83c\udfcb\ufe0f",
85
+ "66": "\ud83c\udfd6\ufe0f",
86
+ "67": "\ud83c\udfdd\ufe0f",
87
+ "68": "\ud83d\udc08",
88
+ "69": "\ud83d\udc15",
89
+ "70": "\ud83d\udc31",
90
+ "71": "\ud83d\udc36",
91
+ "72": "\ud83d\udc40",
92
+ "73": "\ud83d\udc44",
93
+ "74": "\ud83d\udc46",
94
+ "75": "\ud83d\udc47",
95
+ "76": "\ud83d\udc48",
96
+ "77": "\ud83d\udc49",
97
+ "78": "\ud83d\udc4a",
98
+ "79": "\ud83d\udc4b",
99
+ "80": "\ud83d\udc4c",
100
+ "81": "\ud83d\udc4d",
101
+ "82": "\ud83d\udc4e",
102
+ "83": "\ud83d\udc4f",
103
+ "84": "\ud83d\udc50",
104
+ "85": "\ud83d\udc51",
105
+ "86": "\ud83d\udc65",
106
+ "87": "\ud83d\udc6b",
107
+ "88": "\ud83d\udc6c",
108
+ "89": "\ud83d\udc6d",
109
+ "90": "\ud83d\udc7b",
110
+ "91": "\ud83d\udc7e",
111
+ "92": "\ud83d\udc7f",
112
+ "93": "\ud83d\udc80",
113
+ "94": "\ud83d\udc85",
114
+ "95": "\ud83d\udc8b",
115
+ "96": "\ud83d\udc91",
116
+ "97": "\ud83d\udc94",
117
+ "98": "\ud83d\udc95",
118
+ "99": "\ud83d\udc99",
119
+ "100": "\ud83d\udc9c",
120
+ "101": "\ud83d\udca1",
121
+ "102": "\ud83d\udca5",
122
+ "103": "\ud83d\udca7",
123
+ "104": "\ud83d\udca9",
124
+ "105": "\ud83d\udcaa",
125
+ "106": "\ud83d\udcad",
126
+ "107": "\ud83d\udcaf",
127
+ "108": "\ud83d\udcb0",
128
+ "109": "\ud83d\udcbb",
129
+ "110": "\ud83d\udcbc",
130
+ "111": "\ud83d\udcc5",
131
+ "112": "\ud83d\udcc8",
132
+ "113": "\ud83d\udcca",
133
+ "114": "\ud83d\udccb",
134
+ "115": "\ud83d\udccd",
135
+ "116": "\ud83d\udcd6",
136
+ "117": "\ud83d\udcdd",
137
+ "118": "\ud83d\udcde",
138
+ "119": "\ud83d\udce2",
139
+ "120": "\ud83d\udcf1",
140
+ "121": "\ud83d\udcf3",
141
+ "122": "\ud83d\udcf4",
142
+ "123": "\ud83d\udcf8",
143
+ "124": "\ud83d\udd12",
144
+ "125": "\ud83d\udd14",
145
+ "126": "\ud83d\udd17",
146
+ "127": "\ud83d\udd25",
147
+ "128": "\ud83d\udd79\ufe0f",
148
+ "129": "\ud83d\udd90",
149
+ "130": "\ud83d\udd95",
150
+ "131": "\ud83d\udd96",
151
+ "132": "\ud83d\udda4",
152
+ "133": "\ud83d\udda5\ufe0f",
153
+ "134": "\ud83d\uddd3\ufe0f",
154
+ "135": "\ud83d\ude00",
155
+ "136": "\ud83d\ude01",
156
+ "137": "\ud83d\ude02",
157
+ "138": "\ud83d\ude03",
158
+ "139": "\ud83d\ude04",
159
+ "140": "\ud83d\ude05",
160
+ "141": "\ud83d\ude06",
161
+ "142": "\ud83d\ude07",
162
+ "143": "\ud83d\ude08",
163
+ "144": "\ud83d\ude09",
164
+ "145": "\ud83d\ude0a",
165
+ "146": "\ud83d\ude0b",
166
+ "147": "\ud83d\ude0c",
167
+ "148": "\ud83d\ude0d",
168
+ "149": "\ud83d\ude0e",
169
+ "150": "\ud83d\ude0f",
170
+ "151": "\ud83d\ude10",
171
+ "152": "\ud83d\ude11",
172
+ "153": "\ud83d\ude12",
173
+ "154": "\ud83d\ude13",
174
+ "155": "\ud83d\ude14",
175
+ "156": "\ud83d\ude15",
176
+ "157": "\ud83d\ude16",
177
+ "158": "\ud83d\ude17",
178
+ "159": "\ud83d\ude18",
179
+ "160": "\ud83d\ude19",
180
+ "161": "\ud83d\ude1a",
181
+ "162": "\ud83d\ude1c",
182
+ "163": "\ud83d\ude1e",
183
+ "164": "\ud83d\ude1f",
184
+ "165": "\ud83d\ude20",
185
+ "166": "\ud83d\ude21",
186
+ "167": "\ud83d\ude23",
187
+ "168": "\ud83d\ude24",
188
+ "169": "\ud83d\ude28",
189
+ "170": "\ud83d\ude29",
190
+ "171": "\ud83d\ude2a",
191
+ "172": "\ud83d\ude2b",
192
+ "173": "\ud83d\ude2c",
193
+ "174": "\ud83d\ude2d",
194
+ "175": "\ud83d\ude2e",
195
+ "176": "\ud83d\ude30",
196
+ "177": "\ud83d\ude31",
197
+ "178": "\ud83d\ude32",
198
+ "179": "\ud83d\ude33",
199
+ "180": "\ud83d\ude34",
200
+ "181": "\ud83d\ude35",
201
+ "182": "\ud83d\ude36",
202
+ "183": "\ud83d\ude37",
203
+ "184": "\ud83d\ude38",
204
+ "185": "\ud83d\ude3a",
205
+ "186": "\ud83d\ude3b",
206
+ "187": "\ud83d\ude3c",
207
+ "188": "\ud83d\ude3d",
208
+ "189": "\ud83d\ude40",
209
+ "190": "\ud83d\ude41",
210
+ "191": "\ud83d\ude43",
211
+ "192": "\ud83d\ude44",
212
+ "193": "\ud83d\ude48",
213
+ "194": "\ud83d\ude4c",
214
+ "195": "\ud83d\ude4f",
215
+ "196": "\ud83d\ude80",
216
+ "197": "\ud83d\ude82",
217
+ "198": "\ud83d\ude8c",
218
+ "199": "\ud83d\ude95",
219
+ "200": "\ud83d\ude97",
220
+ "201": "\ud83d\udea9",
221
+ "202": "\ud83d\uded1",
222
+ "203": "\ud83d\uded2",
223
+ "204": "\ud83e\udd0f",
224
+ "205": "\ud83e\udd10",
225
+ "206": "\ud83e\udd11",
226
+ "207": "\ud83e\udd13",
227
+ "208": "\ud83e\udd14",
228
+ "209": "\ud83e\udd16",
229
+ "210": "\ud83e\udd17",
230
+ "211": "\ud83e\udd18",
231
+ "212": "\ud83e\udd19",
232
+ "213": "\ud83e\udd1a",
233
+ "214": "\ud83e\udd1b",
234
+ "215": "\ud83e\udd1c",
235
+ "216": "\ud83e\udd1d",
236
+ "217": "\ud83e\udd1e",
237
+ "218": "\ud83e\udd1f",
238
+ "219": "\ud83e\udd20",
239
+ "220": "\ud83e\udd21",
240
+ "221": "\ud83e\udd22",
241
+ "222": "\ud83e\udd24",
242
+ "223": "\ud83e\udd25",
243
+ "224": "\ud83e\udd27",
244
+ "225": "\ud83e\udd28",
245
+ "226": "\ud83e\udd29",
246
+ "227": "\ud83e\udd2a",
247
+ "228": "\ud83e\udd2b",
248
+ "229": "\ud83e\udd2c",
249
+ "230": "\ud83e\udd2d",
250
+ "231": "\ud83e\udd2e",
251
+ "232": "\ud83e\udd2f",
252
+ "233": "\ud83e\udd32",
253
+ "234": "\ud83e\udd33",
254
+ "235": "\ud83e\udd42",
255
+ "236": "\ud83e\udd57",
256
+ "237": "\ud83e\udd70",
257
+ "238": "\ud83e\udd71",
258
+ "239": "\ud83e\udd72",
259
+ "240": "\ud83e\udd73",
260
+ "241": "\ud83e\udd74",
261
+ "242": "\ud83e\udd75",
262
+ "243": "\ud83e\udd76",
263
+ "244": "\ud83e\udd78",
264
+ "245": "\ud83e\udd7a",
265
+ "246": "\ud83e\uddd0",
266
+ "247": "\ud83e\udde2",
267
+ "248": "\ud83e\uddf3",
268
+ "249": "\ud83e\udef6"
269
+ },
270
+ "label2id": {
271
+ "\u2328\ufe0f": 0,
272
+ "\u23f0": 1,
273
+ "\u2600\ufe0f": 2,
274
+ "\u2601\ufe0f": 3,
275
+ "\u2603\ufe0f": 4,
276
+ "\u2614": 5,
277
+ "\u2615": 6,
278
+ "\u261d\ufe0f": 7,
279
+ "\u2620\ufe0f": 8,
280
+ "\u2622\ufe0f": 9,
281
+ "\u2623\ufe0f": 10,
282
+ "\u2639\ufe0f": 11,
283
+ "\u26a1": 12,
284
+ "\u26bd": 13,
285
+ "\u26f0\ufe0f": 14,
286
+ "\u26f3": 15,
287
+ "\u2705": 16,
288
+ "\u2708\ufe0f": 17,
289
+ "\u270a": 18,
290
+ "\u270b": 19,
291
+ "\u270c\ufe0f": 20,
292
+ "\u270d\ufe0f": 21,
293
+ "\u2728": 22,
294
+ "\u2744\ufe0f": 23,
295
+ "\u274c": 24,
296
+ "\u2764\ufe0f": 25,
297
+ "\u2b1c": 26,
298
+ "\u2b50": 27,
299
+ "\ud83c\udf08": 28,
300
+ "\ud83c\udf0a": 29,
301
+ "\ud83c\udf0b": 30,
302
+ "\ud83c\udf0d": 31,
303
+ "\ud83c\udf19": 32,
304
+ "\ud83c\udf2e": 33,
305
+ "\ud83c\udf31": 34,
306
+ "\ud83c\udf38": 35,
307
+ "\ud83c\udf42": 36,
308
+ "\ud83c\udf53": 37,
309
+ "\ud83c\udf54": 38,
310
+ "\ud83c\udf55": 39,
311
+ "\ud83c\udf5d": 40,
312
+ "\ud83c\udf5f": 41,
313
+ "\ud83c\udf66": 42,
314
+ "\ud83c\udf69": 43,
315
+ "\ud83c\udf6a": 44,
316
+ "\ud83c\udf77": 45,
317
+ "\ud83c\udf7a": 46,
318
+ "\ud83c\udf7d\ufe0f": 47,
319
+ "\ud83c\udf7e": 48,
320
+ "\ud83c\udf81": 49,
321
+ "\ud83c\udf82": 50,
322
+ "\ud83c\udf89": 51,
323
+ "\ud83c\udf8a": 52,
324
+ "\ud83c\udf93": 53,
325
+ "\ud83c\udfac": 54,
326
+ "\ud83c\udfae": 55,
327
+ "\ud83c\udfaf": 56,
328
+ "\ud83c\udfb5": 57,
329
+ "\ud83c\udfbe": 58,
330
+ "\ud83c\udfc0": 59,
331
+ "\ud83c\udfc1": 60,
332
+ "\ud83c\udfc3": 61,
333
+ "\ud83c\udfc5": 62,
334
+ "\ud83c\udfc6": 63,
335
+ "\ud83c\udfca": 64,
336
+ "\ud83c\udfcb\ufe0f": 65,
337
+ "\ud83c\udfd6\ufe0f": 66,
338
+ "\ud83c\udfdd\ufe0f": 67,
339
+ "\ud83d\udc08": 68,
340
+ "\ud83d\udc15": 69,
341
+ "\ud83d\udc31": 70,
342
+ "\ud83d\udc36": 71,
343
+ "\ud83d\udc40": 72,
344
+ "\ud83d\udc44": 73,
345
+ "\ud83d\udc46": 74,
346
+ "\ud83d\udc47": 75,
347
+ "\ud83d\udc48": 76,
348
+ "\ud83d\udc49": 77,
349
+ "\ud83d\udc4a": 78,
350
+ "\ud83d\udc4b": 79,
351
+ "\ud83d\udc4c": 80,
352
+ "\ud83d\udc4d": 81,
353
+ "\ud83d\udc4e": 82,
354
+ "\ud83d\udc4f": 83,
355
+ "\ud83d\udc50": 84,
356
+ "\ud83d\udc51": 85,
357
+ "\ud83d\udc65": 86,
358
+ "\ud83d\udc6b": 87,
359
+ "\ud83d\udc6c": 88,
360
+ "\ud83d\udc6d": 89,
361
+ "\ud83d\udc7b": 90,
362
+ "\ud83d\udc7e": 91,
363
+ "\ud83d\udc7f": 92,
364
+ "\ud83d\udc80": 93,
365
+ "\ud83d\udc85": 94,
366
+ "\ud83d\udc8b": 95,
367
+ "\ud83d\udc91": 96,
368
+ "\ud83d\udc94": 97,
369
+ "\ud83d\udc95": 98,
370
+ "\ud83d\udc99": 99,
371
+ "\ud83d\udc9c": 100,
372
+ "\ud83d\udca1": 101,
373
+ "\ud83d\udca5": 102,
374
+ "\ud83d\udca7": 103,
375
+ "\ud83d\udca9": 104,
376
+ "\ud83d\udcaa": 105,
377
+ "\ud83d\udcad": 106,
378
+ "\ud83d\udcaf": 107,
379
+ "\ud83d\udcb0": 108,
380
+ "\ud83d\udcbb": 109,
381
+ "\ud83d\udcbc": 110,
382
+ "\ud83d\udcc5": 111,
383
+ "\ud83d\udcc8": 112,
384
+ "\ud83d\udcca": 113,
385
+ "\ud83d\udccb": 114,
386
+ "\ud83d\udccd": 115,
387
+ "\ud83d\udcd6": 116,
388
+ "\ud83d\udcdd": 117,
389
+ "\ud83d\udcde": 118,
390
+ "\ud83d\udce2": 119,
391
+ "\ud83d\udcf1": 120,
392
+ "\ud83d\udcf3": 121,
393
+ "\ud83d\udcf4": 122,
394
+ "\ud83d\udcf8": 123,
395
+ "\ud83d\udd12": 124,
396
+ "\ud83d\udd14": 125,
397
+ "\ud83d\udd17": 126,
398
+ "\ud83d\udd25": 127,
399
+ "\ud83d\udd79\ufe0f": 128,
400
+ "\ud83d\udd90": 129,
401
+ "\ud83d\udd95": 130,
402
+ "\ud83d\udd96": 131,
403
+ "\ud83d\udda4": 132,
404
+ "\ud83d\udda5\ufe0f": 133,
405
+ "\ud83d\uddd3\ufe0f": 134,
406
+ "\ud83d\ude00": 135,
407
+ "\ud83d\ude01": 136,
408
+ "\ud83d\ude02": 137,
409
+ "\ud83d\ude03": 138,
410
+ "\ud83d\ude04": 139,
411
+ "\ud83d\ude05": 140,
412
+ "\ud83d\ude06": 141,
413
+ "\ud83d\ude07": 142,
414
+ "\ud83d\ude08": 143,
415
+ "\ud83d\ude09": 144,
416
+ "\ud83d\ude0a": 145,
417
+ "\ud83d\ude0b": 146,
418
+ "\ud83d\ude0c": 147,
419
+ "\ud83d\ude0d": 148,
420
+ "\ud83d\ude0e": 149,
421
+ "\ud83d\ude0f": 150,
422
+ "\ud83d\ude10": 151,
423
+ "\ud83d\ude11": 152,
424
+ "\ud83d\ude12": 153,
425
+ "\ud83d\ude13": 154,
426
+ "\ud83d\ude14": 155,
427
+ "\ud83d\ude15": 156,
428
+ "\ud83d\ude16": 157,
429
+ "\ud83d\ude17": 158,
430
+ "\ud83d\ude18": 159,
431
+ "\ud83d\ude19": 160,
432
+ "\ud83d\ude1a": 161,
433
+ "\ud83d\ude1c": 162,
434
+ "\ud83d\ude1e": 163,
435
+ "\ud83d\ude1f": 164,
436
+ "\ud83d\ude20": 165,
437
+ "\ud83d\ude21": 166,
438
+ "\ud83d\ude23": 167,
439
+ "\ud83d\ude24": 168,
440
+ "\ud83d\ude28": 169,
441
+ "\ud83d\ude29": 170,
442
+ "\ud83d\ude2a": 171,
443
+ "\ud83d\ude2b": 172,
444
+ "\ud83d\ude2c": 173,
445
+ "\ud83d\ude2d": 174,
446
+ "\ud83d\ude2e": 175,
447
+ "\ud83d\ude30": 176,
448
+ "\ud83d\ude31": 177,
449
+ "\ud83d\ude32": 178,
450
+ "\ud83d\ude33": 179,
451
+ "\ud83d\ude34": 180,
452
+ "\ud83d\ude35": 181,
453
+ "\ud83d\ude36": 182,
454
+ "\ud83d\ude37": 183,
455
+ "\ud83d\ude38": 184,
456
+ "\ud83d\ude3a": 185,
457
+ "\ud83d\ude3b": 186,
458
+ "\ud83d\ude3c": 187,
459
+ "\ud83d\ude3d": 188,
460
+ "\ud83d\ude40": 189,
461
+ "\ud83d\ude41": 190,
462
+ "\ud83d\ude43": 191,
463
+ "\ud83d\ude44": 192,
464
+ "\ud83d\ude48": 193,
465
+ "\ud83d\ude4c": 194,
466
+ "\ud83d\ude4f": 195,
467
+ "\ud83d\ude80": 196,
468
+ "\ud83d\ude82": 197,
469
+ "\ud83d\ude8c": 198,
470
+ "\ud83d\ude95": 199,
471
+ "\ud83d\ude97": 200,
472
+ "\ud83d\udea9": 201,
473
+ "\ud83d\uded1": 202,
474
+ "\ud83d\uded2": 203,
475
+ "\ud83e\udd0f": 204,
476
+ "\ud83e\udd10": 205,
477
+ "\ud83e\udd11": 206,
478
+ "\ud83e\udd13": 207,
479
+ "\ud83e\udd14": 208,
480
+ "\ud83e\udd16": 209,
481
+ "\ud83e\udd17": 210,
482
+ "\ud83e\udd18": 211,
483
+ "\ud83e\udd19": 212,
484
+ "\ud83e\udd1a": 213,
485
+ "\ud83e\udd1b": 214,
486
+ "\ud83e\udd1c": 215,
487
+ "\ud83e\udd1d": 216,
488
+ "\ud83e\udd1e": 217,
489
+ "\ud83e\udd1f": 218,
490
+ "\ud83e\udd20": 219,
491
+ "\ud83e\udd21": 220,
492
+ "\ud83e\udd22": 221,
493
+ "\ud83e\udd24": 222,
494
+ "\ud83e\udd25": 223,
495
+ "\ud83e\udd27": 224,
496
+ "\ud83e\udd28": 225,
497
+ "\ud83e\udd29": 226,
498
+ "\ud83e\udd2a": 227,
499
+ "\ud83e\udd2b": 228,
500
+ "\ud83e\udd2c": 229,
501
+ "\ud83e\udd2d": 230,
502
+ "\ud83e\udd2e": 231,
503
+ "\ud83e\udd2f": 232,
504
+ "\ud83e\udd32": 233,
505
+ "\ud83e\udd33": 234,
506
+ "\ud83e\udd42": 235,
507
+ "\ud83e\udd57": 236,
508
+ "\ud83e\udd70": 237,
509
+ "\ud83e\udd71": 238,
510
+ "\ud83e\udd72": 239,
511
+ "\ud83e\udd73": 240,
512
+ "\ud83e\udd74": 241,
513
+ "\ud83e\udd75": 242,
514
+ "\ud83e\udd76": 243,
515
+ "\ud83e\udd78": 244,
516
+ "\ud83e\udd7a": 245,
517
+ "\ud83e\uddd0": 246,
518
+ "\ud83e\udde2": 247,
519
+ "\ud83e\uddf3": 248,
520
+ "\ud83e\udef6": 249
521
+ },
522
+ "task_specific_params": null,
523
+ "problem_type": null,
524
+ "tokenizer_class": null,
525
+ "prefix": null,
526
+ "bos_token_id": null,
527
+ "pad_token_id": 0,
528
+ "eos_token_id": null,
529
+ "sep_token_id": null,
530
+ "decoder_start_token_id": null,
531
+ "max_length": 20,
532
+ "min_length": 0,
533
+ "do_sample": false,
534
+ "early_stopping": false,
535
+ "num_beams": 1,
536
+ "temperature": 1.0,
537
+ "top_k": 50,
538
+ "top_p": 1.0,
539
+ "typical_p": 1.0,
540
+ "repetition_penalty": 1.0,
541
+ "length_penalty": 1.0,
542
+ "no_repeat_ngram_size": 0,
543
+ "encoder_no_repeat_ngram_size": 0,
544
+ "bad_words_ids": null,
545
+ "num_return_sequences": 1,
546
+ "output_scores": false,
547
+ "return_dict_in_generate": false,
548
+ "forced_bos_token_id": null,
549
+ "forced_eos_token_id": null,
550
+ "remove_invalid_values": false,
551
+ "exponential_decay_length_penalty": null,
552
+ "suppress_tokens": null,
553
+ "begin_suppress_tokens": null,
554
+ "num_beam_groups": 1,
555
+ "diversity_penalty": 0.0,
556
+ "_name_or_path": "",
557
+ "transformers_version": "4.57.1",
558
+ "model_type": "deberta-v2",
559
+ "position_buckets": 256,
560
+ "norm_rel_ebd": "layer_norm",
561
+ "share_att_key": true,
562
+ "tf_legacy_loss": false,
563
+ "use_bfloat16": false,
564
+ "hidden_size": 768,
565
+ "num_hidden_layers": 6,
566
+ "num_attention_heads": 12,
567
+ "intermediate_size": 3072,
568
+ "hidden_act": "gelu",
569
+ "hidden_dropout_prob": 0.1,
570
+ "attention_probs_dropout_prob": 0.1,
571
+ "max_position_embeddings": 512,
572
+ "type_vocab_size": 0,
573
+ "initializer_range": 0.02,
574
+ "relative_attention": true,
575
+ "max_relative_positions": -1,
576
+ "position_biased_input": false,
577
+ "pos_att_type": [
578
+ "p2c",
579
+ "c2p"
580
+ ],
581
+ "vocab_size": 128100,
582
+ "layer_norm_eps": 1e-07,
583
+ "pooler_hidden_size": 768,
584
+ "pooler_dropout": 0,
585
+ "pooler_hidden_act": "gelu",
586
+ "legacy": true,
587
+ "output_attentions": false,
588
+ "num_labels": 250
589
+ }
emoji_mappings.json ADDED
@@ -0,0 +1,506 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "emoji_to_id": {
3
+ "⌨️": 0,
4
+ "⏰": 1,
5
+ "☀️": 2,
6
+ "☁️": 3,
7
+ "☃️": 4,
8
+ "☔": 5,
9
+ "☕": 6,
10
+ "☝️": 7,
11
+ "☠️": 8,
12
+ "☢️": 9,
13
+ "☣️": 10,
14
+ "☹️": 11,
15
+ "⚡": 12,
16
+ "⚽": 13,
17
+ "⛰️": 14,
18
+ "⛳": 15,
19
+ "✅": 16,
20
+ "✈️": 17,
21
+ "✊": 18,
22
+ "✋": 19,
23
+ "✌️": 20,
24
+ "✍️": 21,
25
+ "✨": 22,
26
+ "❄️": 23,
27
+ "❌": 24,
28
+ "❤️": 25,
29
+ "⬜": 26,
30
+ "⭐": 27,
31
+ "🌈": 28,
32
+ "🌊": 29,
33
+ "🌋": 30,
34
+ "🌍": 31,
35
+ "🌙": 32,
36
+ "🌮": 33,
37
+ "🌱": 34,
38
+ "🌸": 35,
39
+ "🍂": 36,
40
+ "🍓": 37,
41
+ "🍔": 38,
42
+ "🍕": 39,
43
+ "🍝": 40,
44
+ "🍟": 41,
45
+ "🍦": 42,
46
+ "🍩": 43,
47
+ "🍪": 44,
48
+ "🍷": 45,
49
+ "🍺": 46,
50
+ "🍽️": 47,
51
+ "🍾": 48,
52
+ "🎁": 49,
53
+ "🎂": 50,
54
+ "🎉": 51,
55
+ "🎊": 52,
56
+ "🎓": 53,
57
+ "🎬": 54,
58
+ "🎮": 55,
59
+ "🎯": 56,
60
+ "🎵": 57,
61
+ "🎾": 58,
62
+ "🏀": 59,
63
+ "🏁": 60,
64
+ "🏃": 61,
65
+ "🏅": 62,
66
+ "🏆": 63,
67
+ "🏊": 64,
68
+ "🏋️": 65,
69
+ "🏖️": 66,
70
+ "🏝️": 67,
71
+ "🐈": 68,
72
+ "🐕": 69,
73
+ "🐱": 70,
74
+ "🐶": 71,
75
+ "👀": 72,
76
+ "👄": 73,
77
+ "👆": 74,
78
+ "👇": 75,
79
+ "👈": 76,
80
+ "👉": 77,
81
+ "👊": 78,
82
+ "👋": 79,
83
+ "👌": 80,
84
+ "👍": 81,
85
+ "👎": 82,
86
+ "👏": 83,
87
+ "👐": 84,
88
+ "👑": 85,
89
+ "👥": 86,
90
+ "👫": 87,
91
+ "👬": 88,
92
+ "👭": 89,
93
+ "👻": 90,
94
+ "👾": 91,
95
+ "👿": 92,
96
+ "💀": 93,
97
+ "💅": 94,
98
+ "💋": 95,
99
+ "💑": 96,
100
+ "💔": 97,
101
+ "💕": 98,
102
+ "💙": 99,
103
+ "💜": 100,
104
+ "💡": 101,
105
+ "💥": 102,
106
+ "💧": 103,
107
+ "💩": 104,
108
+ "💪": 105,
109
+ "💭": 106,
110
+ "💯": 107,
111
+ "💰": 108,
112
+ "💻": 109,
113
+ "💼": 110,
114
+ "📅": 111,
115
+ "📈": 112,
116
+ "📊": 113,
117
+ "📋": 114,
118
+ "📍": 115,
119
+ "📖": 116,
120
+ "📝": 117,
121
+ "📞": 118,
122
+ "📢": 119,
123
+ "📱": 120,
124
+ "📳": 121,
125
+ "📴": 122,
126
+ "📸": 123,
127
+ "🔒": 124,
128
+ "🔔": 125,
129
+ "🔗": 126,
130
+ "🔥": 127,
131
+ "🕹️": 128,
132
+ "🖐": 129,
133
+ "🖕": 130,
134
+ "🖖": 131,
135
+ "🖤": 132,
136
+ "🖥️": 133,
137
+ "🗓️": 134,
138
+ "😀": 135,
139
+ "😁": 136,
140
+ "😂": 137,
141
+ "😃": 138,
142
+ "😄": 139,
143
+ "😅": 140,
144
+ "😆": 141,
145
+ "😇": 142,
146
+ "😈": 143,
147
+ "😉": 144,
148
+ "😊": 145,
149
+ "😋": 146,
150
+ "😌": 147,
151
+ "😍": 148,
152
+ "😎": 149,
153
+ "😏": 150,
154
+ "😐": 151,
155
+ "😑": 152,
156
+ "😒": 153,
157
+ "😓": 154,
158
+ "😔": 155,
159
+ "😕": 156,
160
+ "😖": 157,
161
+ "😗": 158,
162
+ "😘": 159,
163
+ "😙": 160,
164
+ "😚": 161,
165
+ "😜": 162,
166
+ "😞": 163,
167
+ "😟": 164,
168
+ "😠": 165,
169
+ "😡": 166,
170
+ "😣": 167,
171
+ "😤": 168,
172
+ "😨": 169,
173
+ "😩": 170,
174
+ "😪": 171,
175
+ "😫": 172,
176
+ "😬": 173,
177
+ "😭": 174,
178
+ "😮": 175,
179
+ "😰": 176,
180
+ "😱": 177,
181
+ "😲": 178,
182
+ "😳": 179,
183
+ "😴": 180,
184
+ "😵": 181,
185
+ "😶": 182,
186
+ "😷": 183,
187
+ "😸": 184,
188
+ "😺": 185,
189
+ "😻": 186,
190
+ "😼": 187,
191
+ "😽": 188,
192
+ "🙀": 189,
193
+ "🙁": 190,
194
+ "🙃": 191,
195
+ "🙄": 192,
196
+ "🙈": 193,
197
+ "🙌": 194,
198
+ "🙏": 195,
199
+ "🚀": 196,
200
+ "🚂": 197,
201
+ "🚌": 198,
202
+ "🚕": 199,
203
+ "🚗": 200,
204
+ "🚩": 201,
205
+ "🛑": 202,
206
+ "🛒": 203,
207
+ "🤏": 204,
208
+ "🤐": 205,
209
+ "🤑": 206,
210
+ "🤓": 207,
211
+ "🤔": 208,
212
+ "🤖": 209,
213
+ "🤗": 210,
214
+ "🤘": 211,
215
+ "🤙": 212,
216
+ "🤚": 213,
217
+ "🤛": 214,
218
+ "🤜": 215,
219
+ "🤝": 216,
220
+ "🤞": 217,
221
+ "🤟": 218,
222
+ "🤠": 219,
223
+ "🤡": 220,
224
+ "🤢": 221,
225
+ "🤤": 222,
226
+ "🤥": 223,
227
+ "🤧": 224,
228
+ "🤨": 225,
229
+ "🤩": 226,
230
+ "🤪": 227,
231
+ "🤫": 228,
232
+ "🤬": 229,
233
+ "🤭": 230,
234
+ "🤮": 231,
235
+ "🤯": 232,
236
+ "🤲": 233,
237
+ "🤳": 234,
238
+ "🥂": 235,
239
+ "🥗": 236,
240
+ "🥰": 237,
241
+ "🥱": 238,
242
+ "🥲": 239,
243
+ "🥳": 240,
244
+ "🥴": 241,
245
+ "🥵": 242,
246
+ "🥶": 243,
247
+ "🥸": 244,
248
+ "🥺": 245,
249
+ "🧐": 246,
250
+ "🧢": 247,
251
+ "🧳": 248,
252
+ "🫶": 249
253
+ },
254
+ "id_to_emoji": {
255
+ "0": "⌨️",
256
+ "1": "⏰",
257
+ "2": "☀️",
258
+ "3": "☁️",
259
+ "4": "☃️",
260
+ "5": "☔",
261
+ "6": "☕",
262
+ "7": "☝️",
263
+ "8": "☠️",
264
+ "9": "☢️",
265
+ "10": "☣️",
266
+ "11": "☹️",
267
+ "12": "⚡",
268
+ "13": "⚽",
269
+ "14": "⛰️",
270
+ "15": "⛳",
271
+ "16": "✅",
272
+ "17": "✈️",
273
+ "18": "✊",
274
+ "19": "✋",
275
+ "20": "✌️",
276
+ "21": "✍️",
277
+ "22": "✨",
278
+ "23": "❄️",
279
+ "24": "❌",
280
+ "25": "❤️",
281
+ "26": "⬜",
282
+ "27": "⭐",
283
+ "28": "🌈",
284
+ "29": "🌊",
285
+ "30": "🌋",
286
+ "31": "🌍",
287
+ "32": "🌙",
288
+ "33": "🌮",
289
+ "34": "🌱",
290
+ "35": "🌸",
291
+ "36": "🍂",
292
+ "37": "🍓",
293
+ "38": "🍔",
294
+ "39": "🍕",
295
+ "40": "🍝",
296
+ "41": "🍟",
297
+ "42": "🍦",
298
+ "43": "🍩",
299
+ "44": "🍪",
300
+ "45": "🍷",
301
+ "46": "🍺",
302
+ "47": "🍽️",
303
+ "48": "🍾",
304
+ "49": "🎁",
305
+ "50": "🎂",
306
+ "51": "🎉",
307
+ "52": "🎊",
308
+ "53": "🎓",
309
+ "54": "🎬",
310
+ "55": "🎮",
311
+ "56": "🎯",
312
+ "57": "🎵",
313
+ "58": "🎾",
314
+ "59": "🏀",
315
+ "60": "🏁",
316
+ "61": "🏃",
317
+ "62": "🏅",
318
+ "63": "🏆",
319
+ "64": "🏊",
320
+ "65": "🏋️",
321
+ "66": "🏖️",
322
+ "67": "🏝️",
323
+ "68": "🐈",
324
+ "69": "🐕",
325
+ "70": "🐱",
326
+ "71": "🐶",
327
+ "72": "👀",
328
+ "73": "👄",
329
+ "74": "👆",
330
+ "75": "👇",
331
+ "76": "👈",
332
+ "77": "👉",
333
+ "78": "👊",
334
+ "79": "👋",
335
+ "80": "👌",
336
+ "81": "👍",
337
+ "82": "👎",
338
+ "83": "👏",
339
+ "84": "👐",
340
+ "85": "👑",
341
+ "86": "👥",
342
+ "87": "👫",
343
+ "88": "👬",
344
+ "89": "👭",
345
+ "90": "👻",
346
+ "91": "👾",
347
+ "92": "👿",
348
+ "93": "💀",
349
+ "94": "💅",
350
+ "95": "💋",
351
+ "96": "💑",
352
+ "97": "💔",
353
+ "98": "💕",
354
+ "99": "💙",
355
+ "100": "💜",
356
+ "101": "💡",
357
+ "102": "💥",
358
+ "103": "💧",
359
+ "104": "💩",
360
+ "105": "💪",
361
+ "106": "💭",
362
+ "107": "💯",
363
+ "108": "💰",
364
+ "109": "💻",
365
+ "110": "💼",
366
+ "111": "📅",
367
+ "112": "📈",
368
+ "113": "📊",
369
+ "114": "📋",
370
+ "115": "📍",
371
+ "116": "📖",
372
+ "117": "📝",
373
+ "118": "📞",
374
+ "119": "📢",
375
+ "120": "📱",
376
+ "121": "📳",
377
+ "122": "📴",
378
+ "123": "📸",
379
+ "124": "🔒",
380
+ "125": "🔔",
381
+ "126": "🔗",
382
+ "127": "🔥",
383
+ "128": "🕹️",
384
+ "129": "🖐",
385
+ "130": "🖕",
386
+ "131": "🖖",
387
+ "132": "🖤",
388
+ "133": "🖥️",
389
+ "134": "🗓️",
390
+ "135": "😀",
391
+ "136": "😁",
392
+ "137": "😂",
393
+ "138": "😃",
394
+ "139": "😄",
395
+ "140": "😅",
396
+ "141": "😆",
397
+ "142": "😇",
398
+ "143": "😈",
399
+ "144": "😉",
400
+ "145": "😊",
401
+ "146": "😋",
402
+ "147": "😌",
403
+ "148": "😍",
404
+ "149": "😎",
405
+ "150": "😏",
406
+ "151": "😐",
407
+ "152": "😑",
408
+ "153": "😒",
409
+ "154": "😓",
410
+ "155": "😔",
411
+ "156": "😕",
412
+ "157": "😖",
413
+ "158": "😗",
414
+ "159": "😘",
415
+ "160": "😙",
416
+ "161": "😚",
417
+ "162": "😜",
418
+ "163": "😞",
419
+ "164": "😟",
420
+ "165": "😠",
421
+ "166": "😡",
422
+ "167": "😣",
423
+ "168": "😤",
424
+ "169": "😨",
425
+ "170": "😩",
426
+ "171": "😪",
427
+ "172": "😫",
428
+ "173": "😬",
429
+ "174": "😭",
430
+ "175": "😮",
431
+ "176": "😰",
432
+ "177": "😱",
433
+ "178": "😲",
434
+ "179": "😳",
435
+ "180": "😴",
436
+ "181": "😵",
437
+ "182": "😶",
438
+ "183": "😷",
439
+ "184": "😸",
440
+ "185": "😺",
441
+ "186": "😻",
442
+ "187": "😼",
443
+ "188": "😽",
444
+ "189": "🙀",
445
+ "190": "🙁",
446
+ "191": "🙃",
447
+ "192": "🙄",
448
+ "193": "🙈",
449
+ "194": "🙌",
450
+ "195": "🙏",
451
+ "196": "🚀",
452
+ "197": "🚂",
453
+ "198": "🚌",
454
+ "199": "🚕",
455
+ "200": "🚗",
456
+ "201": "🚩",
457
+ "202": "🛑",
458
+ "203": "🛒",
459
+ "204": "🤏",
460
+ "205": "🤐",
461
+ "206": "🤑",
462
+ "207": "🤓",
463
+ "208": "🤔",
464
+ "209": "🤖",
465
+ "210": "🤗",
466
+ "211": "🤘",
467
+ "212": "🤙",
468
+ "213": "🤚",
469
+ "214": "🤛",
470
+ "215": "🤜",
471
+ "216": "🤝",
472
+ "217": "🤞",
473
+ "218": "🤟",
474
+ "219": "🤠",
475
+ "220": "🤡",
476
+ "221": "🤢",
477
+ "222": "🤤",
478
+ "223": "🤥",
479
+ "224": "🤧",
480
+ "225": "🤨",
481
+ "226": "🤩",
482
+ "227": "🤪",
483
+ "228": "🤫",
484
+ "229": "🤬",
485
+ "230": "🤭",
486
+ "231": "🤮",
487
+ "232": "🤯",
488
+ "233": "🤲",
489
+ "234": "🤳",
490
+ "235": "🥂",
491
+ "236": "🥗",
492
+ "237": "🥰",
493
+ "238": "🥱",
494
+ "239": "🥲",
495
+ "240": "🥳",
496
+ "241": "🥴",
497
+ "242": "🥵",
498
+ "243": "🥶",
499
+ "244": "🥸",
500
+ "245": "🥺",
501
+ "246": "🧐",
502
+ "247": "🧢",
503
+ "248": "🧳",
504
+ "249": "🫶"
505
+ }
506
+ }
labels.txt ADDED
@@ -0,0 +1,250 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ⌨️
2
+
3
+ ☀️
4
+ ☁️
5
+ ☃️
6
+
7
+
8
+ ☝️
9
+ ☠️
10
+ ☢️
11
+ ☣️
12
+ ☹️
13
+
14
+
15
+ ⛰️
16
+
17
+
18
+ ✈️
19
+
20
+
21
+ ✌️
22
+ ✍️
23
+
24
+ ❄️
25
+
26
+ ❤️
27
+
28
+
29
+ 🌈
30
+ 🌊
31
+ 🌋
32
+ 🌍
33
+ 🌙
34
+ 🌮
35
+ 🌱
36
+ 🌸
37
+ 🍂
38
+ 🍓
39
+ 🍔
40
+ 🍕
41
+ 🍝
42
+ 🍟
43
+ 🍦
44
+ 🍩
45
+ 🍪
46
+ 🍷
47
+ 🍺
48
+ 🍽️
49
+ 🍾
50
+ 🎁
51
+ 🎂
52
+ 🎉
53
+ 🎊
54
+ 🎓
55
+ 🎬
56
+ 🎮
57
+ 🎯
58
+ 🎵
59
+ 🎾
60
+ 🏀
61
+ 🏁
62
+ 🏃
63
+ 🏅
64
+ 🏆
65
+ 🏊
66
+ 🏋️
67
+ 🏖️
68
+ 🏝️
69
+ 🐈
70
+ 🐕
71
+ 🐱
72
+ 🐶
73
+ 👀
74
+ 👄
75
+ 👆
76
+ 👇
77
+ 👈
78
+ 👉
79
+ 👊
80
+ 👋
81
+ 👌
82
+ 👍
83
+ 👎
84
+ 👏
85
+ 👐
86
+ 👑
87
+ 👥
88
+ 👫
89
+ 👬
90
+ 👭
91
+ 👻
92
+ 👾
93
+ 👿
94
+ 💀
95
+ 💅
96
+ 💋
97
+ 💑
98
+ 💔
99
+ 💕
100
+ 💙
101
+ 💜
102
+ 💡
103
+ 💥
104
+ 💧
105
+ 💩
106
+ 💪
107
+ 💭
108
+ 💯
109
+ 💰
110
+ 💻
111
+ 💼
112
+ 📅
113
+ 📈
114
+ 📊
115
+ 📋
116
+ 📍
117
+ 📖
118
+ 📝
119
+ 📞
120
+ 📢
121
+ 📱
122
+ 📳
123
+ 📴
124
+ 📸
125
+ 🔒
126
+ 🔔
127
+ 🔗
128
+ 🔥
129
+ 🕹️
130
+ 🖐
131
+ 🖕
132
+ 🖖
133
+ 🖤
134
+ 🖥️
135
+ 🗓️
136
+ 😀
137
+ 😁
138
+ 😂
139
+ 😃
140
+ 😄
141
+ 😅
142
+ 😆
143
+ 😇
144
+ 😈
145
+ 😉
146
+ 😊
147
+ 😋
148
+ 😌
149
+ 😍
150
+ 😎
151
+ 😏
152
+ 😐
153
+ 😑
154
+ 😒
155
+ 😓
156
+ 😔
157
+ 😕
158
+ 😖
159
+ 😗
160
+ 😘
161
+ 😙
162
+ 😚
163
+ 😜
164
+ 😞
165
+ 😟
166
+ 😠
167
+ 😡
168
+ 😣
169
+ 😤
170
+ 😨
171
+ 😩
172
+ 😪
173
+ 😫
174
+ 😬
175
+ 😭
176
+ 😮
177
+ 😰
178
+ 😱
179
+ 😲
180
+ 😳
181
+ 😴
182
+ 😵
183
+ 😶
184
+ 😷
185
+ 😸
186
+ 😺
187
+ 😻
188
+ 😼
189
+ 😽
190
+ 🙀
191
+ 🙁
192
+ 🙃
193
+ 🙄
194
+ 🙈
195
+ 🙌
196
+ 🙏
197
+ 🚀
198
+ 🚂
199
+ 🚌
200
+ 🚕
201
+ 🚗
202
+ 🚩
203
+ 🛑
204
+ 🛒
205
+ 🤏
206
+ 🤐
207
+ 🤑
208
+ 🤓
209
+ 🤔
210
+ 🤖
211
+ 🤗
212
+ 🤘
213
+ 🤙
214
+ 🤚
215
+ 🤛
216
+ 🤜
217
+ 🤝
218
+ 🤞
219
+ 🤟
220
+ 🤠
221
+ 🤡
222
+ 🤢
223
+ 🤤
224
+ 🤥
225
+ 🤧
226
+ 🤨
227
+ 🤩
228
+ 🤪
229
+ 🤫
230
+ 🤬
231
+ 🤭
232
+ 🤮
233
+ 🤯
234
+ 🤲
235
+ 🤳
236
+ 🥂
237
+ 🥗
238
+ 🥰
239
+ 🥱
240
+ 🥲
241
+ 🥳
242
+ 🥴
243
+ 🥵
244
+ 🥶
245
+ 🥸
246
+ 🥺
247
+ 🧐
248
+ 🧢
249
+ 🧳
250
+ 🫶
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:03f68b05ecf38866f36f399fd64b6475d66e8fb07fc6ffd3783f18c2f0192252
3
+ size 568361384
pytorch_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6c35bfb4d4885c4edff9cee02d3b3dcce68075b7cd494cfa32c3f54035927d42
3
+ size 568392395
special_tokens_map.json ADDED
@@ -0,0 +1,15 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": "[CLS]",
3
+ "cls_token": "[CLS]",
4
+ "eos_token": "[SEP]",
5
+ "mask_token": "[MASK]",
6
+ "pad_token": "[PAD]",
7
+ "sep_token": "[SEP]",
8
+ "unk_token": {
9
+ "content": "[UNK]",
10
+ "lstrip": false,
11
+ "normalized": true,
12
+ "rstrip": false,
13
+ "single_word": false
14
+ }
15
+ }
spm.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c679fbf93643d19aab7ee10c0b99e460bdbc02fedf34b92b05af343b4af586fd
3
+ size 2464616
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,59 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "[CLS]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "[SEP]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "[UNK]",
29
+ "lstrip": false,
30
+ "normalized": true,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "128000": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "[CLS]",
45
+ "clean_up_tokenization_spaces": false,
46
+ "cls_token": "[CLS]",
47
+ "do_lower_case": false,
48
+ "eos_token": "[SEP]",
49
+ "extra_special_tokens": {},
50
+ "mask_token": "[MASK]",
51
+ "model_max_length": 1000000000000000019884624838656,
52
+ "pad_token": "[PAD]",
53
+ "sep_token": "[SEP]",
54
+ "sp_model_kwargs": {},
55
+ "split_by_punct": false,
56
+ "tokenizer_class": "DebertaV2Tokenizer",
57
+ "unk_token": "[UNK]",
58
+ "vocab_type": "spm"
59
+ }