baiges commited on
Commit
7e1d01f
·
verified ·
1 Parent(s): deebd0e

Update fine-tuned Sentence Transformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,766 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - generated_from_trainer
7
+ - dataset_size:121408
8
+ - loss:MultipleNegativesRankingLoss
9
+ base_model: sentence-transformers/all-mpnet-base-v2
10
+ widget:
11
+ - source_sentence: "NAME: Baked Beans\n\nCATEGORY: Beans\n\nKEYWORDS: < 60 Mins, Easy,\
12
+ \ Inexpensive\n\nTOOLS: pan\n\nINGREDIENTS: ground beef, bell pepper, onion, brown\
13
+ \ sugar, lemon juice, ketchup, barbecue sauce\n\nINSTRUCTIONS: \nCook you bell\
14
+ \ pepper and onion in with your ground beef. Drain grease.\nIn a casserole mix\
15
+ \ all other ingredients.\nPut in over for 20 minutes."
16
+ sentences:
17
+ - 'NAME: Hearty White Bean Bake
18
+
19
+
20
+ CATEGORY: Beans
21
+
22
+
23
+ KEYWORDS: < 60 Mins, Easy, Inexpensive, One-Pot
24
+
25
+
26
+ TOOLS: pan
27
+
28
+
29
+ INGREDIENTS: ground turkey, poblano pepper, shallot, maple syrup, apple cider
30
+ vinegar, chili sauce, smoked paprika, cannellini beans
31
+
32
+
33
+ INSTRUCTIONS:
34
+
35
+ Cook the poblano pepper and shallot with the ground turkey until the turkey is
36
+ browned. Drain any excess grease.
37
+
38
+ In the same pan, combine the cooked turkey mixture with maple syrup, apple cider
39
+ vinegar, chili sauce, smoked paprika, and cannellini beans.
40
+
41
+ Mix well and bake in the oven for 25 minutes, or until heated through and bubbly.'
42
+ - 'NAME: Spicy Southwest Seasoning Mix
43
+
44
+ CATEGORY: < 15 Mins
45
+
46
+ KEYWORDS: No Cook, Easy, Spice Mix
47
+
48
+ TOOLS: Small jar
49
+
50
+ INGREDIENTS: onion powder, chili powder, smoked paprika, seasoning salt, cayenne
51
+ pepper, black pepper
52
+
53
+ INSTRUCTIONS: Add all ingredients into a small jar. Secure the lid tightly. Shake
54
+ well until thoroughly combined.'
55
+ - 'NAME: Rosé Sangria
56
+
57
+
58
+ CATEGORY: Beverages
59
+
60
+
61
+ KEYWORDS: Citrus, Berries, < 4 Hours, Easy, Refreshing
62
+
63
+
64
+ TOOLS: large punch bowl, wooden spoon
65
+
66
+
67
+ INGREDIENTS: limes, raspberries, sugar, rosé wine, brandy, strawberry slices
68
+
69
+
70
+ INSTRUCTIONS:
71
+
72
+ Place lime slices and raspberries in a large punch bowl.
73
+
74
+ Pour sugar over slices and berries and with a wooden spoon lightly mash together
75
+ until sugar dissolves and the fruit begins to break down.
76
+
77
+ Stir in rosé wine and brandy.
78
+
79
+ Add strawberry slices.
80
+
81
+ Refrigerate at least 2 hours or up to 10.
82
+
83
+ Add ice cubes and sparkling water just before serving.'
84
+ - source_sentence: "NAME: Pink Banana Bread\n\nCATEGORY: Quick Breads\n\nKEYWORDS:\
85
+ \ Breads, Pineapple, Tropical Fruits, Fruit, Healthy, < 4 Hours, Easy\n\nTOOLS:\
86
+ \ grease bread pan, oven\n\nINGREDIENTS: yogurt, sugar, crushed pineapple, eggs,\
87
+ \ flour, baking soda, baking powder, salt, bananas, pecans\n\nINSTRUCTIONS: \n\
88
+ Mix yogurt, sugar, crushed pineapple, eggs, flour, baking soda, baking powder,\
89
+ \ salt, grenadine, bananas, and pecans together.\nGrease bread pan.\nBake at 375\
90
+ \ degrees Fahrenheit for 60 minutes or until done."
91
+ sentences:
92
+ - 'NAME: Lemon-Herb Baked Cod
93
+
94
+
95
+ CATEGORY: Tilapia
96
+
97
+
98
+ KEYWORDS: Healthy, High Protein, Quick, Baked, Lemon, < 45 Mins
99
+
100
+
101
+ TOOLS: spoon, baking dish, oven, zester
102
+
103
+
104
+ INGREDIENTS: olive oil, lemon juice, lemon zest, garlic cloves, dried oregano,
105
+ dried thyme, cod fillets, salt, black pepper
106
+
107
+
108
+ INSTRUCTIONS: Preheat oven to 400°F (200°C).
109
+
110
+ In a small bowl, combine olive oil, lemon juice, lemon zest, minced garlic, oregano,
111
+ thyme, salt, and pepper.
112
+
113
+ Place cod fillets in a baking dish.
114
+
115
+ Spoon the lemon-herb mixture over the cod, ensuring each fillet is well coated.
116
+
117
+ Bake for 12-15 minutes, or until the cod is opaque and flakes easily with a fork.
118
+
119
+ Check for doneness by flaking with a fork.
120
+
121
+ Serve immediately.'
122
+ - 'NAME: Fluffy Maple Buttercream Frosting
123
+
124
+ CATEGORY: Dessert
125
+
126
+ KEYWORDS: Low Protein, Kid Friendly, Sweet, Mixer, < 15 Mins, Beginner Cook, Small
127
+ Appliance, Easy, Maple
128
+
129
+ TOOLS: mixer, bowl
130
+
131
+ INGREDIENTS: vegetable shortening, brown butter, maple extract, confectioners''
132
+ sugar, milk
133
+
134
+ INSTRUCTIONS: In a bowl, use a mixer to beat the vegetable shortening with the
135
+ browned butter and maple extract until light and creamy. Gradually add in the
136
+ confectioners'' sugar, beating on low speed until combined, then increase speed
137
+ and beat until fluffy. If needed, add milk, one tablespoon at a time, to reach
138
+ the desired consistency. Add a pinch of cinnamon if using.'
139
+ - 'NAME: Mango Coconut Bread
140
+
141
+
142
+ CATEGORY: Quick Breads
143
+
144
+
145
+ KEYWORDS: Breads, Mango, Tropical Fruits, Fruit, Healthy, < 4 Hours, Easy, Gluten-Free
146
+
147
+
148
+ TOOLS: grease bread pan, oven, mixing bowl
149
+
150
+
151
+ INGREDIENTS: Greek yogurt, coconut sugar, shredded coconut, eggs, almond flour,
152
+ baking soda, baking powder, salt, mangos, macadamia nuts, lime zest
153
+
154
+
155
+ INSTRUCTIONS:
156
+
157
+ Preheat oven to 375 degrees Fahrenheit.
158
+
159
+ In a large mixing bowl, combine Greek yogurt, coconut sugar, shredded coconut,
160
+ eggs, almond flour, baking soda, baking powder, salt, and lime zest. Mix well.
161
+
162
+ Fold in diced mangos and chopped macadamia nuts.
163
+
164
+ Grease bread pan.
165
+
166
+ Pour batter into the prepared bread pan.
167
+
168
+ Bake for 55-65 minutes, or until a toothpick inserted into the center comes out
169
+ clean.
170
+
171
+ Let cool in the pan for 10 minutes before transferring to a wire rack to cool
172
+ completely.'
173
+ - source_sentence: "NAME: Layered Zucchini &amp; Yellow Squash Casserole\n\nCATEGORY:\
174
+ \ Vegetable\n\nKEYWORDS: Low Protein, Low Cholesterol, Summer, < 60 Mins, Oven\n\
175
+ \nTOOLS: oven, baking pan\n\nINGREDIENTS: zucchini, onion, green bell pepper,\
176
+ \ fresh mushrooms, tomatoes, butter, parmesan cheese\n\nINSTRUCTIONS: \nLightly\
177
+ \ grease an 8 inch square baking pan (or spray with Pam).\nLayer the vegetables\
178
+ \ in the order listed, sprinkling each layer with salt and pepper as desired.\n\
179
+ Dot the top with butter, and sprinkle with Parmesan cheese.\nBake at 350F for\
180
+ \ 35 minutes or until crisp-tender."
181
+ sentences:
182
+ - 'NAME: Rustic Spelt Bread
183
+
184
+
185
+ CATEGORY: Yeast Breads
186
+
187
+
188
+ KEYWORDS: Breads, Grains, Swiss, European, Low Cholesterol, Healthy, Small Appliance,
189
+ < 4 Hours, Easy, Spelt
190
+
191
+
192
+ TOOLS: bowl, sharp knife, oven, electric mixer, baking sheet
193
+
194
+
195
+ INGREDIENTS: spelt flour, whole wheat flour, salt, dry yeast, warm water, milk,
196
+ olive oil
197
+
198
+
199
+ INSTRUCTIONS:
200
+
201
+ Combine the spelt flour, whole wheat flour, and salt in a large bowl. Add the
202
+ dry yeast.
203
+
204
+ Pour in the warm water, milk, and olive oil. Mix using an electric mixer with
205
+ a dough hook until the dough forms a fairly firm ball and cleans the sides of
206
+ the bowl.
207
+
208
+ Let the dough rise in a warm place until it has doubled in size.
209
+
210
+ Preheat the oven to 400°F.
211
+
212
+ Shape the dough into a round loaf on a prepared baking sheet. Cover with a damp
213
+ cloth, and allow to rest for about 20 minutes for a second rise.
214
+
215
+ Use a sharp knife to score the top of the loaf with a simple cross pattern. Bake
216
+ for 35 to 40 minutes, or until the crust is golden brown and the loaf sounds hollow
217
+ when tapped.'
218
+ - 'NAME: Layered Eggplant & Bell Pepper Casserole
219
+
220
+
221
+ CATEGORY: Vegetable
222
+
223
+
224
+ KEYWORDS: Low Protein, Low Cholesterol, Summer, < 60 Mins, Oven, Vegetarian
225
+
226
+
227
+ TOOLS: oven, baking pan
228
+
229
+
230
+ INGREDIENTS: eggplant, red onion, yellow bell pepper, fresh cremini mushrooms,
231
+ diced tomatoes, olive oil, mozzarella cheese
232
+
233
+
234
+ INSTRUCTIONS: Lightly grease an 8 inch square baking pan (or spray with cooking
235
+ spray). Layer the vegetables in the order listed, sprinkling each layer with salt,
236
+ pepper, and a pinch of dried oregano as desired. Drizzle the top with olive oil,
237
+ and sprinkle with mozzarella cheese. Bake at 350F for 40 minutes or until the
238
+ vegetables are tender and the cheese is melted and lightly browned.'
239
+ - 'NAME: Turkey Spinach Orzo Skillet
240
+
241
+
242
+ CATEGORY: One Dish Meal
243
+
244
+
245
+ KEYWORDS: Turkey, Poultry, Meat, Low Cholesterol, Healthy, < 45 Mins, Stove Top,
246
+ Quick
247
+
248
+
249
+ TOOLS: large skillet
250
+
251
+
252
+ INGREDIENTS: olive oil, butter, water, ground turkey, frozen spinach, sun-dried
253
+ tomatoes, dried oregano, feta cheese, orzo pasta
254
+
255
+
256
+ INSTRUCTIONS: In large skillet, saute orzo pasta in olive oil until lightly toasted
257
+ over medium heat. Stir in water and oregano; bring to a boil over high heat. Cover;
258
+ reduce heat to low. Simmer 8 minutes. Stir in spinach, ground turkey, and sun-dried
259
+ tomatoes. Cover, simmer 5 to 7 minutes or until most of liquid is absorbed and
260
+ turkey is cooked through. Crumble feta cheese over the top.'
261
+ - source_sentence: "NAME: Easy Basalmic Vinaigrette\n\nCATEGORY: Salad Dressings\n\
262
+ \nKEYWORDS: < 15 Mins, Easy\n\nTOOLS: \n\nINGREDIENTS: extra virgin olive oil,\
263
+ \ Dijon mustard, dried basil, salt, fresh ground pepper\n\nINSTRUCTIONS: \nPlace\
264
+ \ all ingredients in a 20 ounce reusable water bottle.\nShake vigorously until\
265
+ \ combined."
266
+ sentences:
267
+ - 'NAME: Savory Turkey Loaf
268
+
269
+
270
+ CATEGORY: One Dish Meal
271
+
272
+
273
+ KEYWORDS: Meat, Weeknight, < 4 Hours, Inexpensive, Easy
274
+
275
+
276
+ TOOLS: bread pan, oven, large bowl, turkey bowl
277
+
278
+
279
+ INGREDIENTS: ground turkey, onions, bell pepper, garlic powder, salt, egg, bread
280
+ crumbs, Worcestershire sauce
281
+
282
+
283
+ INSTRUCTIONS:
284
+
285
+ Preheat oven to 375 degrees F.
286
+
287
+ Finely dice the onions and bell pepper. In a large bowl, combine ground turkey,
288
+ diced onions, diced bell pepper, salt, garlic powder, and a dash of Worcestershire
289
+ sauce. Mix thoroughly with your hands until well combined. Add bread crumbs to
290
+ the mixture and combine again using your hands. Incorporate the egg, mixing until
291
+ evenly distributed.
292
+
293
+ Press the mixture firmly into a bread pan.
294
+
295
+ Bake for 50-60 minutes, or until the internal temperature reaches 165 degrees
296
+ F. Let stand for 10 minutes before slicing and serving.'
297
+ - 'NAME: Smoked Salmon Spread
298
+
299
+
300
+ CATEGORY: Spreads
301
+
302
+
303
+ KEYWORDS: Salmon, < 4 Hours, Easy, Smoked, Appetizer
304
+
305
+
306
+ TOOLS:
307
+
308
+
309
+ INGREDIENTS: cream cheese, mayonnaise, dill
310
+
311
+
312
+ INSTRUCTIONS:
313
+
314
+ Combine the cream cheese and mayonnaise.
315
+
316
+ Mix well, and chill for 2 hours.
317
+
318
+ Garnish with fresh dill before serving with crackers, bagel chips, or vegetables.'
319
+ - 'NAME: Simple Lemon Herb Vinaigrette
320
+
321
+ CATEGORY: Salad Dressings
322
+
323
+ KEYWORDS: < 15 Mins, Easy, Fresh
324
+
325
+ TOOLS: 20 ounce reusable water bottle
326
+
327
+ INGREDIENTS: extra virgin olive oil, honey Dijon mustard, dried oregano, salt,
328
+ fresh ground pepper, lemon juice
329
+
330
+ INSTRUCTIONS: Place all ingredients in a 20 ounce reusable water bottle. Shake
331
+ vigorously until combined. Let stand for 5 minutes before serving to allow flavors
332
+ to meld.'
333
+ - source_sentence: "NAME: Spinach with Raisins and Pine Nuts\n\nCATEGORY: Fruit\n\n\
334
+ KEYWORDS: Vegetable, Nuts, Low Cholesterol, Healthy, < 30 Mins, Stove Top\n\n\
335
+ TOOLS: grill, pot\n\nINGREDIENTS: fresh spinach, pine nuts, salt, raisins, olive\
336
+ \ oil, lemon juice\n\nINSTRUCTIONS: \nClean the spinach thoroughly.\nGrill the\
337
+ \ pine nuts until golden brown, watching carefully so as not to burn.\nBring a\
338
+ \ pot of salted water to the boil and toss in raisins and spinach.\nDrain as soon\
339
+ \ as spinach goes limp.\ntoss in olive oil and lemon juice, and scatter with the\
340
+ \ grilled pine nuts."
341
+ sentences:
342
+ - 'NAME: Dried Apricots with Pistachios and Almonds
343
+
344
+
345
+ CATEGORY: Fruit
346
+
347
+
348
+ KEYWORDS: Dried Fruit, Nuts, Low Cholesterol, Healthy, < 30 Mins, Stove Top, Vegan
349
+
350
+
351
+ TOOLS: grill, pot
352
+
353
+
354
+ INGREDIENTS: dried apricots, pistachios, salt, slivered almonds, olive oil, orange
355
+ juice
356
+
357
+
358
+ INSTRUCTIONS:
359
+
360
+ Soak the dried apricots in warm water for 10 minutes to soften them.
361
+
362
+ Grill the pistachios until lightly toasted, being careful not to burn them.
363
+
364
+ Bring a pot of salted water to the boil and add the softened apricots.
365
+
366
+ Drain immediately after the apricots plump up slightly.
367
+
368
+ Toss with olive oil and orange juice, then sprinkle with the grilled pistachios
369
+ and slivered almonds.'
370
+ - 'NAME: Smoky Chipotle Turkey Meatloaf
371
+
372
+
373
+ CATEGORY: Meat
374
+
375
+
376
+ KEYWORDS: < 60 Mins, Spicy, Oven, Comfort Food
377
+
378
+
379
+ TOOLS: frying pan, meat thermometer, oven, loaf pan
380
+
381
+
382
+ INGREDIENTS: bacon, yellow onion, green bell pepper, chipotle powder, garlic powder,
383
+ dried oregano, salt, ground mustard, smoked paprika, chili powder, tomato paste,
384
+ chicken broth, eggs, ground turkey
385
+
386
+
387
+ INSTRUCTIONS:
388
+
389
+ Preheat oven to 425 degrees.
390
+
391
+ Cook bacon in frying pan, remove, drain, and chop.
392
+
393
+ Leave drippings in pan and saute (but do not brown) onion and green pepper.
394
+
395
+ Add chipotle powder, garlic powder, oregano, salt, mustard, smoked paprika, and
396
+ chili powder.
397
+
398
+ Cook for 8 minutes.
399
+
400
+ Remove pan from heat and add tomato paste and chicken broth.
401
+
402
+ Mix bread crumbs with eggs and add to ground turkey.
403
+
404
+ Add spice mixture and bacon to turkey mixture and mix gently.
405
+
406
+ Place mixture in two or three 8 x 4 inch individual loaf pans.
407
+
408
+ Cook until done, about 35 to 45 minutes, or until internal temperature reaches
409
+ 165 degrees on a meat thermometer.
410
+
411
+ Let rest for 10 minutes before slicing.'
412
+ - 'NAME: Buttermilk Corn Fritters
413
+
414
+
415
+ CATEGORY: Breads
416
+
417
+
418
+ KEYWORDS: Healthy, Spicy, < 60 Mins, Deep Fried, Corn
419
+
420
+
421
+ TOOLS: pan, mixing bowl, slotted spoon
422
+
423
+
424
+ INGREDIENTS: yellow cornmeal, gluten-free flour blend, baking powder, brown sugar,
425
+ salt, eggs, buttermilk, scallions, cheddar cheese
426
+
427
+
428
+ INSTRUCTIONS: In a mixing bowl, combine cornmeal, flour, baking powder, brown
429
+ sugar, and salt; mix well. Add eggs, buttermilk, chopped scallions, and shredded
430
+ cheddar cheese; stir until just combined. Heat 1-inch of oil to 365°F in a pan.
431
+ Carefully drop spoonfuls of batter into the hot oil, cooking in batches to avoid
432
+ overcrowding. Fry fritters for 2-3 minutes, flipping halfway through, until golden
433
+ brown and cooked through. Remove fritters with a slotted spoon and place on paper
434
+ towel-lined plates to drain excess oil. Serve immediately.'
435
+ pipeline_tag: sentence-similarity
436
+ library_name: sentence-transformers
437
+ ---
438
+
439
+ # SentenceTransformer based on sentence-transformers/all-mpnet-base-v2
440
+
441
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
442
+
443
+ ## Model Details
444
+
445
+ ### Model Description
446
+ - **Model Type:** Sentence Transformer
447
+ - **Base model:** [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2) <!-- at revision 12e86a3c702fc3c50205a8db88f0ec7c0b6b94a0 -->
448
+ - **Maximum Sequence Length:** 384 tokens
449
+ - **Output Dimensionality:** 768 dimensions
450
+ - **Similarity Function:** Cosine Similarity
451
+ <!-- - **Training Dataset:** Unknown -->
452
+ <!-- - **Language:** Unknown -->
453
+ <!-- - **License:** Unknown -->
454
+
455
+ ### Model Sources
456
+
457
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
458
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
459
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
460
+
461
+ ### Full Model Architecture
462
+
463
+ ```
464
+ SentenceTransformer(
465
+ (0): Transformer({'max_seq_length': 384, 'do_lower_case': False}) with Transformer model: MPNetModel
466
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
467
+ (2): Normalize()
468
+ )
469
+ ```
470
+
471
+ ## Usage
472
+
473
+ ### Direct Usage (Sentence Transformers)
474
+
475
+ First install the Sentence Transformers library:
476
+
477
+ ```bash
478
+ pip install -U sentence-transformers
479
+ ```
480
+
481
+ Then you can load this model and run inference.
482
+ ```python
483
+ from sentence_transformers import SentenceTransformer
484
+
485
+ # Download from the 🤗 Hub
486
+ model = SentenceTransformer("GPTasty/TastyRecipesEmbedder")
487
+ # Run inference
488
+ sentences = [
489
+ 'NAME: Spinach with Raisins and Pine Nuts\n\nCATEGORY: Fruit\n\nKEYWORDS: Vegetable, Nuts, Low Cholesterol, Healthy, < 30 Mins, Stove Top\n\nTOOLS: grill, pot\n\nINGREDIENTS: fresh spinach, pine nuts, salt, raisins, olive oil, lemon juice\n\nINSTRUCTIONS: \nClean the spinach thoroughly.\nGrill the pine nuts until golden brown, watching carefully so as not to burn.\nBring a pot of salted water to the boil and toss in raisins and spinach.\nDrain as soon as spinach goes limp.\ntoss in olive oil and lemon juice, and scatter with the grilled pine nuts.',
490
+ 'NAME: Dried Apricots with Pistachios and Almonds\n\nCATEGORY: Fruit\n\nKEYWORDS: Dried Fruit, Nuts, Low Cholesterol, Healthy, < 30 Mins, Stove Top, Vegan\n\nTOOLS: grill, pot\n\nINGREDIENTS: dried apricots, pistachios, salt, slivered almonds, olive oil, orange juice\n\nINSTRUCTIONS:\nSoak the dried apricots in warm water for 10 minutes to soften them.\nGrill the pistachios until lightly toasted, being careful not to burn them.\nBring a pot of salted water to the boil and add the softened apricots.\nDrain immediately after the apricots plump up slightly.\nToss with olive oil and orange juice, then sprinkle with the grilled pistachios and slivered almonds.',
491
+ 'NAME: Smoky Chipotle Turkey Meatloaf\n\nCATEGORY: Meat\n\nKEYWORDS: < 60 Mins, Spicy, Oven, Comfort Food\n\nTOOLS: frying pan, meat thermometer, oven, loaf pan\n\nINGREDIENTS: bacon, yellow onion, green bell pepper, chipotle powder, garlic powder, dried oregano, salt, ground mustard, smoked paprika, chili powder, tomato paste, chicken broth, eggs, ground turkey\n\nINSTRUCTIONS:\nPreheat oven to 425 degrees.\nCook bacon in frying pan, remove, drain, and chop.\nLeave drippings in pan and saute (but do not brown) onion and green pepper.\nAdd chipotle powder, garlic powder, oregano, salt, mustard, smoked paprika, and chili powder.\nCook for 8 minutes.\nRemove pan from heat and add tomato paste and chicken broth.\nMix bread crumbs with eggs and add to ground turkey.\nAdd spice mixture and bacon to turkey mixture and mix gently.\nPlace mixture in two or three 8 x 4 inch individual loaf pans.\nCook until done, about 35 to 45 minutes, or until internal temperature reaches 165 degrees on a meat thermometer.\nLet rest for 10 minutes before slicing.',
492
+ ]
493
+ embeddings = model.encode(sentences)
494
+ print(embeddings.shape)
495
+ # [3, 768]
496
+
497
+ # Get the similarity scores for the embeddings
498
+ similarities = model.similarity(embeddings, embeddings)
499
+ print(similarities.shape)
500
+ # [3, 3]
501
+ ```
502
+
503
+ <!--
504
+ ### Direct Usage (Transformers)
505
+
506
+ <details><summary>Click to see the direct usage in Transformers</summary>
507
+
508
+ </details>
509
+ -->
510
+
511
+ <!--
512
+ ### Downstream Usage (Sentence Transformers)
513
+
514
+ You can finetune this model on your own dataset.
515
+
516
+ <details><summary>Click to expand</summary>
517
+
518
+ </details>
519
+ -->
520
+
521
+ <!--
522
+ ### Out-of-Scope Use
523
+
524
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
525
+ -->
526
+
527
+ <!--
528
+ ## Bias, Risks and Limitations
529
+
530
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
531
+ -->
532
+
533
+ <!--
534
+ ### Recommendations
535
+
536
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
537
+ -->
538
+
539
+ ## Training Details
540
+
541
+ ### Training Dataset
542
+
543
+ #### Unnamed Dataset
544
+
545
+ * Size: 121,408 training samples
546
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
547
+ * Approximate statistics based on the first 1000 samples:
548
+ | | sentence_0 | sentence_1 |
549
+ |:--------|:------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
550
+ | type | string | string |
551
+ | details | <ul><li>min: 45 tokens</li><li>mean: 185.8 tokens</li><li>max: 384 tokens</li></ul> | <ul><li>min: 59 tokens</li><li>mean: 222.58 tokens</li><li>max: 384 tokens</li></ul> |
552
+ * Samples:
553
+ | sentence_0 | sentence_1 |
554
+ |:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
555
+ | <code>NAME: Homemade Honey Mustard<br><br>CATEGORY: Sauces<br><br>KEYWORDS: Low Protein, < 15 Mins, Easy<br><br>TOOLS: <br><br>INGREDIENTS: Dijon mustard, sour cream, honey, Worcestershire sauce<br><br>INSTRUCTIONS: <br>Mix well, enjoy.</code> | <code>NAME: Creamy Maple Mustard Sauce<br>CATEGORY: Sauces<br>KEYWORDS: Low Protein, < 15 Mins, Easy, Gluten-Free<br>TOOLS:<br>INGREDIENTS: Whole grain mustard, Greek yogurt, maple syrup, apple cider vinegar<br>INSTRUCTIONS: Combine all ingredients in a bowl and mix until well combined. Refrigerate for at least 10 minutes before serving to allow flavors to meld. Enjoy with pretzels or veggies.</code> |
556
+ | <code>NAME: Baby Greens With Hazelnut Parmesan Crisps<br><br>CATEGORY: Greens<br><br>KEYWORDS: Vegetable, High In..., < 30 Mins<br><br>TOOLS: parchment paper, mixer, whisk, oven, baking sheet<br><br>INGREDIENTS: parmesan cheese, hazelnuts, lemon juice, olive oil, maple syrup, lettuce, prosciutto<br><br>INSTRUCTIONS: <br>Preheat oven to 350°F Line a baking sheet with parchment paper.<br>Combine Parmesan and hazelnuts. Drop 12 spoonfuls of Parmesan mixture onto baking sheet 3 inches apart.<br>Bake crisps for 8 to 10 minutes, or until golden. Cool on baking sheet.<br>Whisk together lemon juice, oil and maple syrup. Season with salt and pepper.<br>Toss lettuce with vinaigrette and pile on individual plates.<br>Coil each slice of prosciutto into a rose shape and set a rose in center of each mound of greens. Garnish each serving with two Parmesan crisps.</code> | <code>NAME: Spinach Salad with Almond Manchego Crisps<br><br>CATEGORY: Greens<br><br>KEYWORDS: Vegetable, High In..., < 30 Mins, Gluten-Free<br><br>TOOLS: parchment paper, mixer, whisk, oven, baking sheet<br><br>INGREDIENTS: manchego cheese, almonds, lime juice, avocado oil, honey, spinach, serrano ham<br><br>INSTRUCTIONS:<br>Preheat oven to 375°F. Line a baking sheet with parchment paper.<br>Combine Manchego cheese and chopped almonds. Drop 12 spoonfuls of the Manchego mixture onto the baking sheet, spacing them 3 inches apart.<br>Bake crisps for 6 to 8 minutes, or until golden brown. Let cool on the baking sheet.<br>Whisk together lime juice, avocado oil, and honey. Season with salt and a pinch of red pepper flakes.<br>Toss spinach with the vinaigrette and arrange on individual plates.<br>Roll each slice of serrano ham into a flower shape and place one in the center of each spinach mound. Garnish each serving with two Manchego crisps.</code> |
557
+ | <code>NAME: Classic Delicious New York Cheesecake<br><br>CATEGORY: Cheesecake<br><br>KEYWORDS: Dessert, Weeknight, For Large Groups, < 4 Hours<br><br>TOOLS: pan, mixing bowl, warm oven, mixer, refrigerator<br><br>INGREDIENTS: graham cracker crumbs, cream cheese, eggs, sour cream, butter, sugar, vanilla<br><br>INSTRUCTIONS: <br>Preheat oven to 450 degrees.<br>To make the crust, mix graham crackers crumbs, butter, and 2 tablespoons of sugar in bowl.<br>Press mixture in bottom and sides of 9 inch springform pan.<br>In mixing bowl, beat cream cheese and remaining sugar for 2 minutes.<br>Add eggs and vanilla to mixture and mix until well blended.<br>Then stir or fold in sour cream.<br>Pour mixture in crust filled pan and bake for 10 minutes.<br>Then reduce to 200 degrees to bake for 45 minutes.<br>From here the cheese cake just needs to be chilled, but I recommend doing the following step if you have a few extra hours- Leave in warm oven, once you turn it off but leave door slightly open.<br>Let sit and cool for 2 hours and remove from oven.<br>Remove sides ...</code> | <code>NAME: Lemon Ricotta Cheesecake Delight<br><br>CATEGORY: Cheesecake<br><br>KEYWORDS: Dessert, Weeknight, For Large Groups, < 4 Hours, Citrus<br><br>TOOLS: pan, mixing bowl, warm oven, mixer, refrigerator, zester<br><br>INGREDIENTS: gluten-free graham cracker crumbs, ricotta cheese, eggs, Greek yogurt, butter, sugar, vanilla extract, lemon zest, lemon juice<br><br>INSTRUCTIONS:<br>Preheat oven to 450 degrees Fahrenheit.<br>To make the crust, mix gluten-free graham cracker crumbs, melted butter, and 2 tablespoons of sugar in bowl.<br>Press mixture firmly in bottom and partially up the sides of a 9 inch springform pan.<br>In a large mixing bowl, beat ricotta cheese and remaining sugar for 3 minutes until light and fluffy.<br>Add eggs, vanilla extract, lemon zest, and lemon juice to mixture; mix until just combined. Avoid overmixing.<br>Gently fold in Greek yogurt.<br>Pour mixture into the prepared crust-lined pan and bake for 12 minutes.<br>Reduce oven temperature to 225 degrees Fahrenheit and continue baking for 40 minutes, or until the edge...</code> |
558
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
559
+ ```json
560
+ {
561
+ "scale": 20.0,
562
+ "similarity_fct": "cos_sim"
563
+ }
564
+ ```
565
+
566
+ ### Training Hyperparameters
567
+ #### Non-Default Hyperparameters
568
+
569
+ - `per_device_train_batch_size`: 64
570
+ - `per_device_eval_batch_size`: 64
571
+ - `fp16`: True
572
+ - `multi_dataset_batch_sampler`: round_robin
573
+
574
+ #### All Hyperparameters
575
+ <details><summary>Click to expand</summary>
576
+
577
+ - `overwrite_output_dir`: False
578
+ - `do_predict`: False
579
+ - `eval_strategy`: no
580
+ - `prediction_loss_only`: True
581
+ - `per_device_train_batch_size`: 64
582
+ - `per_device_eval_batch_size`: 64
583
+ - `per_gpu_train_batch_size`: None
584
+ - `per_gpu_eval_batch_size`: None
585
+ - `gradient_accumulation_steps`: 1
586
+ - `eval_accumulation_steps`: None
587
+ - `torch_empty_cache_steps`: None
588
+ - `learning_rate`: 5e-05
589
+ - `weight_decay`: 0.0
590
+ - `adam_beta1`: 0.9
591
+ - `adam_beta2`: 0.999
592
+ - `adam_epsilon`: 1e-08
593
+ - `max_grad_norm`: 1
594
+ - `num_train_epochs`: 3
595
+ - `max_steps`: -1
596
+ - `lr_scheduler_type`: linear
597
+ - `lr_scheduler_kwargs`: {}
598
+ - `warmup_ratio`: 0.0
599
+ - `warmup_steps`: 0
600
+ - `log_level`: passive
601
+ - `log_level_replica`: warning
602
+ - `log_on_each_node`: True
603
+ - `logging_nan_inf_filter`: True
604
+ - `save_safetensors`: True
605
+ - `save_on_each_node`: False
606
+ - `save_only_model`: False
607
+ - `restore_callback_states_from_checkpoint`: False
608
+ - `no_cuda`: False
609
+ - `use_cpu`: False
610
+ - `use_mps_device`: False
611
+ - `seed`: 42
612
+ - `data_seed`: None
613
+ - `jit_mode_eval`: False
614
+ - `use_ipex`: False
615
+ - `bf16`: False
616
+ - `fp16`: True
617
+ - `fp16_opt_level`: O1
618
+ - `half_precision_backend`: auto
619
+ - `bf16_full_eval`: False
620
+ - `fp16_full_eval`: False
621
+ - `tf32`: None
622
+ - `local_rank`: 0
623
+ - `ddp_backend`: None
624
+ - `tpu_num_cores`: None
625
+ - `tpu_metrics_debug`: False
626
+ - `debug`: []
627
+ - `dataloader_drop_last`: False
628
+ - `dataloader_num_workers`: 0
629
+ - `dataloader_prefetch_factor`: None
630
+ - `past_index`: -1
631
+ - `disable_tqdm`: False
632
+ - `remove_unused_columns`: True
633
+ - `label_names`: None
634
+ - `load_best_model_at_end`: False
635
+ - `ignore_data_skip`: False
636
+ - `fsdp`: []
637
+ - `fsdp_min_num_params`: 0
638
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
639
+ - `tp_size`: 0
640
+ - `fsdp_transformer_layer_cls_to_wrap`: None
641
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
642
+ - `deepspeed`: None
643
+ - `label_smoothing_factor`: 0.0
644
+ - `optim`: adamw_torch
645
+ - `optim_args`: None
646
+ - `adafactor`: False
647
+ - `group_by_length`: False
648
+ - `length_column_name`: length
649
+ - `ddp_find_unused_parameters`: None
650
+ - `ddp_bucket_cap_mb`: None
651
+ - `ddp_broadcast_buffers`: False
652
+ - `dataloader_pin_memory`: True
653
+ - `dataloader_persistent_workers`: False
654
+ - `skip_memory_metrics`: True
655
+ - `use_legacy_prediction_loop`: False
656
+ - `push_to_hub`: False
657
+ - `resume_from_checkpoint`: None
658
+ - `hub_model_id`: None
659
+ - `hub_strategy`: every_save
660
+ - `hub_private_repo`: None
661
+ - `hub_always_push`: False
662
+ - `gradient_checkpointing`: False
663
+ - `gradient_checkpointing_kwargs`: None
664
+ - `include_inputs_for_metrics`: False
665
+ - `include_for_metrics`: []
666
+ - `eval_do_concat_batches`: True
667
+ - `fp16_backend`: auto
668
+ - `push_to_hub_model_id`: None
669
+ - `push_to_hub_organization`: None
670
+ - `mp_parameters`:
671
+ - `auto_find_batch_size`: False
672
+ - `full_determinism`: False
673
+ - `torchdynamo`: None
674
+ - `ray_scope`: last
675
+ - `ddp_timeout`: 1800
676
+ - `torch_compile`: False
677
+ - `torch_compile_backend`: None
678
+ - `torch_compile_mode`: None
679
+ - `dispatch_batches`: None
680
+ - `split_batches`: None
681
+ - `include_tokens_per_second`: False
682
+ - `include_num_input_tokens_seen`: False
683
+ - `neftune_noise_alpha`: None
684
+ - `optim_target_modules`: None
685
+ - `batch_eval_metrics`: False
686
+ - `eval_on_start`: False
687
+ - `use_liger_kernel`: False
688
+ - `eval_use_gather_object`: False
689
+ - `average_tokens_across_devices`: False
690
+ - `prompts`: None
691
+ - `batch_sampler`: batch_sampler
692
+ - `multi_dataset_batch_sampler`: round_robin
693
+
694
+ </details>
695
+
696
+ ### Training Logs
697
+ | Epoch | Step | Training Loss |
698
+ |:------:|:----:|:-------------:|
699
+ | 0.2636 | 500 | 0.0583 |
700
+ | 0.5271 | 1000 | 0.0017 |
701
+ | 0.7907 | 1500 | 0.001 |
702
+ | 1.0543 | 2000 | 0.0008 |
703
+ | 1.3179 | 2500 | 0.0005 |
704
+ | 1.5814 | 3000 | 0.0006 |
705
+ | 1.8450 | 3500 | 0.0004 |
706
+ | 2.1086 | 4000 | 0.0005 |
707
+ | 2.3722 | 4500 | 0.0003 |
708
+ | 2.6357 | 5000 | 0.0003 |
709
+ | 2.8993 | 5500 | 0.0003 |
710
+
711
+
712
+ ### Framework Versions
713
+ - Python: 3.11.9
714
+ - Sentence Transformers: 4.0.1
715
+ - Transformers: 4.50.2
716
+ - PyTorch: 2.4.0
717
+ - Accelerate: 1.5.2
718
+ - Datasets: 3.5.0
719
+ - Tokenizers: 0.21.1
720
+
721
+ ## Citation
722
+
723
+ ### BibTeX
724
+
725
+ #### Sentence Transformers
726
+ ```bibtex
727
+ @inproceedings{reimers-2019-sentence-bert,
728
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
729
+ author = "Reimers, Nils and Gurevych, Iryna",
730
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
731
+ month = "11",
732
+ year = "2019",
733
+ publisher = "Association for Computational Linguistics",
734
+ url = "https://arxiv.org/abs/1908.10084",
735
+ }
736
+ ```
737
+
738
+ #### MultipleNegativesRankingLoss
739
+ ```bibtex
740
+ @misc{henderson2017efficient,
741
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
742
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
743
+ year={2017},
744
+ eprint={1705.00652},
745
+ archivePrefix={arXiv},
746
+ primaryClass={cs.CL}
747
+ }
748
+ ```
749
+
750
+ <!--
751
+ ## Glossary
752
+
753
+ *Clearly define terms in order to be accessible across audiences.*
754
+ -->
755
+
756
+ <!--
757
+ ## Model Card Authors
758
+
759
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
760
+ -->
761
+
762
+ <!--
763
+ ## Model Card Contact
764
+
765
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
766
+ -->
config.json ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "MPNetModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "bos_token_id": 0,
7
+ "eos_token_id": 2,
8
+ "hidden_act": "gelu",
9
+ "hidden_dropout_prob": 0.1,
10
+ "hidden_size": 768,
11
+ "initializer_range": 0.02,
12
+ "intermediate_size": 3072,
13
+ "layer_norm_eps": 1e-05,
14
+ "max_position_embeddings": 514,
15
+ "model_type": "mpnet",
16
+ "num_attention_heads": 12,
17
+ "num_hidden_layers": 12,
18
+ "pad_token_id": 1,
19
+ "relative_attention_num_buckets": 32,
20
+ "torch_dtype": "float32",
21
+ "transformers_version": "4.50.2",
22
+ "vocab_size": 30527
23
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "4.0.1",
4
+ "transformers": "4.50.2",
5
+ "pytorch": "2.4.0"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": "cosine"
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75a7f70762e8050ac32ba7d23b4157dfb176997b438aac7d64a92f7a316489c8
3
+ size 437967672
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 384,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "[UNK]",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": true,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "104": {
36
+ "content": "[UNK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ },
43
+ "30526": {
44
+ "content": "<mask>",
45
+ "lstrip": true,
46
+ "normalized": false,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": true
50
+ }
51
+ },
52
+ "bos_token": "<s>",
53
+ "clean_up_tokenization_spaces": false,
54
+ "cls_token": "<s>",
55
+ "do_lower_case": true,
56
+ "eos_token": "</s>",
57
+ "extra_special_tokens": {},
58
+ "mask_token": "<mask>",
59
+ "max_length": 128,
60
+ "model_max_length": 384,
61
+ "pad_to_multiple_of": null,
62
+ "pad_token": "<pad>",
63
+ "pad_token_type_id": 0,
64
+ "padding_side": "right",
65
+ "sep_token": "</s>",
66
+ "stride": 0,
67
+ "strip_accents": null,
68
+ "tokenize_chinese_chars": true,
69
+ "tokenizer_class": "MPNetTokenizer",
70
+ "truncation_side": "right",
71
+ "truncation_strategy": "longest_first",
72
+ "unk_token": "[UNK]"
73
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff