Nicolas-Spettel commited on
Commit
32b138f
·
verified ·
1 Parent(s): ea1796b

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 384,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,500 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:268
9
+ - loss:MultipleNegativesRankingLoss
10
+ base_model: sentence-transformers/all-MiniLM-L6-v2
11
+ widget:
12
+ - source_sentence: Birdwatching for Beginners with Barbara Hannah Grufferman
13
+ sentences:
14
+ - Bird that breeds in the Arctic and sub-Arctic and migrates to the Antarctic
15
+ - '[birds chirping] [Discover Bird-Watching
16
+
17
+ with Barbara Hannah Grufferman] [♪ music and birds chirping ♪] We are in the park
18
+ and I''m meeting up
19
+
20
+ with Birder Bob, who''s an expert on birding. >> It''s so nice to meet you.
21
+
22
+ >> Yes. So, listen. I came prepared.
23
+
24
+ I have my backpack. >> I even have a little notepad in there.
25
+
26
+ >> All right. >> But what am I missing?
27
+
28
+ >> Ah! Binoculars. >> May I place these over your head?
29
+
30
+ >> Please do! Thank you. [laughing]
31
+
32
+ Let''s go! Vamos! So birding is becoming
33
+
34
+ the fastest-growing outdoor activity— ["Birding Bob" DeCandido, Ornithologist]
35
+
36
+ >> Yes.
37
+
38
+ >> —in the country. >> Why do you think that is? Why?
39
+
40
+ >> Well, you can watch birds from inside looking outside
41
+
42
+ at a bird feeder in your backyard, but you can also go to a local park. [♪ music
43
+ ♪]
44
+
45
+ Here we are in this giant woodland in the middle of the city. >> And it''s beautiful.
46
+
47
+ >> Yeah. And we''re getting the clean air. We''re looking up.
48
+
49
+ We can see birds up there. >> We can hear the cardinals singing.
50
+
51
+ >> Here''s one. They''re migrating north along a flyway here. >> What''s a flyway?
52
+
53
+ >> Oh my goodness! A flyway is like an aerial path for birds, and oftentimes it''s
54
+ tied to a coastline
55
+
56
+ or a mountain chain. So there are some very common birds around
57
+
58
+ that are easy to recognize. Here he is.
59
+
60
+ Here''s your red-bellied woodpecker right here. I''m going to use my binoculars.
61
+
62
+ [laughing] Ah! It seems to me that with birding you could just—depending upon
63
+ weather— put on a sweater, a jacket, whatever and get out there and walk and look
64
+
65
+ and you''ll be birding. >> Yes.
66
+
67
+ >> Is it more complicated than that? Do I need more equipment? If you want to
68
+ take it to the next level,
69
+
70
+ a pair of inexpensive binoculars and a book so you have a reference to go with.
71
+ It''s like a guidebook to birds. Yes, because this is your classroom, you know?
72
+ >> Right.
73
+
74
+ >> And if you can teach yourself, all the best way in the world to learn. [♪ music
75
+ ♪] I''m going to do some special sounds. This is called pishing, which is
76
+
77
+ [demonstrating pishing] There comes somebody on the left. Now, it seems counterintuitive
78
+
79
+ that you make sounds and birds come to the sound. >> Yes.
80
+
81
+ >> But birds come in because they operate as a team. [demonstrating pishing]
82
+
83
+ What a wonderful thing! Yes, yeah.
84
+
85
+ [pishing] >> Look, here comes something.
86
+
87
+ >> You never know what you''re going to find
88
+
89
+ as you turn a corner. And all you need
90
+
91
+ is your eyes and ears and curiosity. Should I go closer? [birds chirping] Oh!
92
+
93
+ [bird chirping] Hello, little cutie! [birds chirping] They like my chia energy
94
+ bars.
95
+
96
+ [laughing] [♪ music and birds chirping ♪] [♪ music and birds chirping ♪] This
97
+ is such a great way to get outside,
98
+
99
+ move your body, and be with nature. Bird-watching is a great way
100
+
101
+ to see the local area and then take it national. I loved my birding experience
102
+ today. It’s—a new world
103
+
104
+ has been opened up for me really. So I think as of today
105
+
106
+ I can call myself an official birder. [AARP, Real Possibilities]'
107
+ - Teal is a dark cyan color. Its name comes from that of a bird, the Eurasian teal
108
+ which has a similarly colored stripe on its head. The word is often used colloquially
109
+ to refer to shades of cyan in general.
110
+ - source_sentence: Corn bunting
111
+ sentences:
112
+ - The corn bunting is a passerine bird in the bunting family Emberizidae, a group
113
+ now separated by most modern authors from the finches, Fringillidae. This is a
114
+ large bunting with heavily streaked buff-brown plumage. The sexes are similar
115
+ but the male is slightly larger than the female. Its range extends from Western
116
+ Europe and North Africa across to northwestern China.
117
+ - The alpine swift is a species of swift found in Africa, southern Europe, and Asia.
118
+ They breed in mountains from southern Europe to the Himalayas. Like common swifts,
119
+ they are migratory; the southern European population winters further south in
120
+ southern Africa. They have very short legs which are used for clinging to vertical
121
+ surfaces. Like most swifts, they never settle voluntarily on the ground, spending
122
+ most of their lives in the air living on the insects they catch in their beaks.
123
+ - The little tern is a seabird of the family Laridae. It was first described by
124
+ the German naturalist Peter Simon Pallas in 1764 and given the binomial name Sterna
125
+ albifrons. It was moved to the genus Sternula when the genus Sterna was restricted
126
+ to the larger typical terns. The genus name Sternula is a diminutive of Sterna,
127
+ 'tern', while the specific name albifrons is from Latin albus, 'white', and frons,
128
+ 'forehead'.
129
+ - source_sentence: Lesser spotted woodpecker
130
+ sentences:
131
+ - The Mediterranean gull is a small gull. The scientific name is from Ancient Greek.
132
+ The genus Ichthyaetus is from ikhthus, "fish", and aetos, "eagle", and the specific
133
+ melanocephalus is from melas, "black", and -kephalos "-headed".
134
+ - The spotted flycatcher is a small passerine bird in the Old World flycatcher family.
135
+ It breeds in most of Europe and in the Palearctic to Siberia, and is migratory,
136
+ wintering in Africa and south western Asia. It is declining in parts of its range.
137
+ - The lesser spotted woodpecker is a member of the woodpecker family Picidae. It
138
+ was formerly assigned to the genus Dendrocopos. Some taxonomic authorities continue
139
+ to list the species there.
140
+ - source_sentence: Barnacle goose
141
+ sentences:
142
+ - The short-toed treecreeper is a small passerine bird found in woodlands through
143
+ much of the warmer regions of Europe and into north Africa. It has a generally
144
+ more southerly distribution than the other European treecreeper species, the common
145
+ treecreeper, with which it is easily confused where they both occur. The short-toed
146
+ treecreeper tends to prefer deciduous trees and lower altitudes than its relative
147
+ in these overlap areas. Although mainly sedentary, vagrants have occurred outside
148
+ the breeding range.
149
+ - The barnacle goose is a species of goose that belongs to the genus Branta of black
150
+ geese, which contains species with extensive black in the plumage, distinguishing
151
+ them from the grey Anser species. Despite its superficial similarity to the brant
152
+ goose, genetic analysis has shown its closest relative is the cackling goose.
153
+ - The grey plover or black-bellied plover is a large plover breeding in Arctic regions.
154
+ It is a long-distance migrant, with a nearly worldwide coastal distribution when
155
+ not breeding.
156
+ - source_sentence: White stork
157
+ sentences:
158
+ - The long-tailed duck is a medium-sized sea duck that breeds in the tundra and
159
+ taiga regions of the arctic and winters along the northern coastlines of the Atlantic
160
+ and Pacific Oceans. It is the only member of the genus Clangula.
161
+ - The white stork is a large bird in the stork family, Ciconiidae. Its plumage is
162
+ mainly white, with black on the bird's wings. Adults have long red legs and long
163
+ pointed red beaks, and measure on average 100–115 cm (39–45 in) from beak tip
164
+ to end of tail, with a 155–215 cm (61–85 in) wingspan. The two subspecies, which
165
+ differ slightly in size, breed in Europe north to Finland, northwestern Africa,
166
+ Palearctic east to southern Kazakhstan and southern Africa. The white stork is
167
+ a long-distance migrant, wintering in Africa from tropical Sub-Saharan Africa
168
+ to as far south as South Africa, or on the Indian subcontinent. When migrating
169
+ between Europe and Africa, it avoids crossing the Mediterranean Sea and detours
170
+ via the Levant in the east or the Strait of Gibraltar in the west, because the
171
+ air thermals on which it depends for soaring do not form over water.
172
+ - 'The shovelers are four species of dabbling ducks in the genus Spatula with long,
173
+ broad spatula-shaped beaks:Red shoveler Spatula platalea
174
+
175
+ Cape shoveler Spatula smithii
176
+
177
+ Australasian shoveler Spatula rhynchotis
178
+
179
+ Northern shoveler Spatula clypeata'
180
+ pipeline_tag: sentence-similarity
181
+ library_name: sentence-transformers
182
+ ---
183
+
184
+ # SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
185
+
186
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
187
+
188
+ ## Model Details
189
+
190
+ ### Model Description
191
+ - **Model Type:** Sentence Transformer
192
+ - **Base model:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) <!-- at revision c9745ed1d9f207416be6d2e6f8de32d1f16199bf -->
193
+ - **Maximum Sequence Length:** 256 tokens
194
+ - **Output Dimensionality:** 384 dimensions
195
+ - **Similarity Function:** Cosine Similarity
196
+ <!-- - **Training Dataset:** Unknown -->
197
+ <!-- - **Language:** Unknown -->
198
+ <!-- - **License:** Unknown -->
199
+
200
+ ### Model Sources
201
+
202
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
203
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
204
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
205
+
206
+ ### Full Model Architecture
207
+
208
+ ```
209
+ SentenceTransformer(
210
+ (0): Transformer({'max_seq_length': 256, 'do_lower_case': False, 'architecture': 'BertModel'})
211
+ (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
212
+ (2): Normalize()
213
+ )
214
+ ```
215
+
216
+ ## Usage
217
+
218
+ ### Direct Usage (Sentence Transformers)
219
+
220
+ First install the Sentence Transformers library:
221
+
222
+ ```bash
223
+ pip install -U sentence-transformers
224
+ ```
225
+
226
+ Then you can load this model and run inference.
227
+ ```python
228
+ from sentence_transformers import SentenceTransformer
229
+
230
+ # Download from the 🤗 Hub
231
+ model = SentenceTransformer("Nicolas-Spettel/bird-qa-model")
232
+ # Run inference
233
+ sentences = [
234
+ 'White stork',
235
+ "The white stork is a large bird in the stork family, Ciconiidae. Its plumage is mainly white, with black on the bird's wings. Adults have long red legs and long pointed red beaks, and measure on average 100–115\xa0cm (39–45\xa0in) from beak tip to end of tail, with a 155–215\xa0cm (61–85\xa0in) wingspan. The two subspecies, which differ slightly in size, breed in Europe north to Finland, northwestern Africa, Palearctic east to southern Kazakhstan and southern Africa. The white stork is a long-distance migrant, wintering in Africa from tropical Sub-Saharan Africa to as far south as South Africa, or on the Indian subcontinent. When migrating between Europe and Africa, it avoids crossing the Mediterranean Sea and detours via the Levant in the east or the Strait of Gibraltar in the west, because the air thermals on which it depends for soaring do not form over water.",
236
+ 'The long-tailed duck is a medium-sized sea duck that breeds in the tundra and taiga regions of the arctic and winters along the northern coastlines of the Atlantic and Pacific Oceans. It is the only member of the genus Clangula.',
237
+ ]
238
+ embeddings = model.encode(sentences)
239
+ print(embeddings.shape)
240
+ # [3, 384]
241
+
242
+ # Get the similarity scores for the embeddings
243
+ similarities = model.similarity(embeddings, embeddings)
244
+ print(similarities)
245
+ # tensor([[1.0000, 0.7092, 0.0837],
246
+ # [0.7092, 1.0000, 0.1957],
247
+ # [0.0837, 0.1957, 1.0000]])
248
+ ```
249
+
250
+ <!--
251
+ ### Direct Usage (Transformers)
252
+
253
+ <details><summary>Click to see the direct usage in Transformers</summary>
254
+
255
+ </details>
256
+ -->
257
+
258
+ <!--
259
+ ### Downstream Usage (Sentence Transformers)
260
+
261
+ You can finetune this model on your own dataset.
262
+
263
+ <details><summary>Click to expand</summary>
264
+
265
+ </details>
266
+ -->
267
+
268
+ <!--
269
+ ### Out-of-Scope Use
270
+
271
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
272
+ -->
273
+
274
+ <!--
275
+ ## Bias, Risks and Limitations
276
+
277
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
278
+ -->
279
+
280
+ <!--
281
+ ### Recommendations
282
+
283
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
284
+ -->
285
+
286
+ ## Training Details
287
+
288
+ ### Training Dataset
289
+
290
+ #### Unnamed Dataset
291
+
292
+ * Size: 268 training samples
293
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
294
+ * Approximate statistics based on the first 268 samples:
295
+ | | sentence_0 | sentence_1 |
296
+ |:--------|:---------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
297
+ | type | string | string |
298
+ | details | <ul><li>min: 3 tokens</li><li>mean: 5.63 tokens</li><li>max: 16 tokens</li></ul> | <ul><li>min: 15 tokens</li><li>mean: 94.23 tokens</li><li>max: 256 tokens</li></ul> |
299
+ * Samples:
300
+ | sentence_0 | sentence_1 |
301
+ |:--------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
302
+ | <code>Corn bunting</code> | <code>The corn bunting is a passerine bird in the bunting family Emberizidae, a group now separated by most modern authors from the finches, Fringillidae. This is a large bunting with heavily streaked buff-brown plumage. The sexes are similar but the male is slightly larger than the female. Its range extends from Western Europe and North Africa across to northwestern China.</code> |
303
+ | <code>Water pipit</code> | <code>The water pipit is a small passerine bird which breeds in the mountains of Southern Europe and the Palearctic eastwards to China. It is a short-distance migrant; many birds move to lower altitudes or wet open lowlands in winter.</code> |
304
+ | <code>Marsh tit</code> | <code>The marsh tit is a Eurasian passerine bird in the tit family Paridae and genus Poecile, closely related to the willow tit, Père David's and Songar tits. It is a small bird, around 12 cm (4.7 in) long and weighing 12 g (0.42 oz), with a black crown and nape, pale cheeks, brown back and greyish-brown wings and tail. Between 8 and 11 subspecies are recognised. Its close resemblance to the willow tit can cause identification problems, especially in the United Kingdom where the local subspecies of the two are very similar: they were not recognised as separate species until 1897.</code> |
305
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
306
+ ```json
307
+ {
308
+ "scale": 20.0,
309
+ "similarity_fct": "cos_sim",
310
+ "gather_across_devices": false
311
+ }
312
+ ```
313
+
314
+ ### Training Hyperparameters
315
+ #### Non-Default Hyperparameters
316
+
317
+ - `per_device_train_batch_size`: 32
318
+ - `per_device_eval_batch_size`: 32
319
+ - `num_train_epochs`: 2
320
+ - `multi_dataset_batch_sampler`: round_robin
321
+
322
+ #### All Hyperparameters
323
+ <details><summary>Click to expand</summary>
324
+
325
+ - `overwrite_output_dir`: False
326
+ - `do_predict`: False
327
+ - `eval_strategy`: no
328
+ - `prediction_loss_only`: True
329
+ - `per_device_train_batch_size`: 32
330
+ - `per_device_eval_batch_size`: 32
331
+ - `per_gpu_train_batch_size`: None
332
+ - `per_gpu_eval_batch_size`: None
333
+ - `gradient_accumulation_steps`: 1
334
+ - `eval_accumulation_steps`: None
335
+ - `torch_empty_cache_steps`: None
336
+ - `learning_rate`: 5e-05
337
+ - `weight_decay`: 0.0
338
+ - `adam_beta1`: 0.9
339
+ - `adam_beta2`: 0.999
340
+ - `adam_epsilon`: 1e-08
341
+ - `max_grad_norm`: 1
342
+ - `num_train_epochs`: 2
343
+ - `max_steps`: -1
344
+ - `lr_scheduler_type`: linear
345
+ - `lr_scheduler_kwargs`: {}
346
+ - `warmup_ratio`: 0.0
347
+ - `warmup_steps`: 0
348
+ - `log_level`: passive
349
+ - `log_level_replica`: warning
350
+ - `log_on_each_node`: True
351
+ - `logging_nan_inf_filter`: True
352
+ - `save_safetensors`: True
353
+ - `save_on_each_node`: False
354
+ - `save_only_model`: False
355
+ - `restore_callback_states_from_checkpoint`: False
356
+ - `no_cuda`: False
357
+ - `use_cpu`: False
358
+ - `use_mps_device`: False
359
+ - `seed`: 42
360
+ - `data_seed`: None
361
+ - `jit_mode_eval`: False
362
+ - `use_ipex`: False
363
+ - `bf16`: False
364
+ - `fp16`: False
365
+ - `fp16_opt_level`: O1
366
+ - `half_precision_backend`: auto
367
+ - `bf16_full_eval`: False
368
+ - `fp16_full_eval`: False
369
+ - `tf32`: None
370
+ - `local_rank`: 0
371
+ - `ddp_backend`: None
372
+ - `tpu_num_cores`: None
373
+ - `tpu_metrics_debug`: False
374
+ - `debug`: []
375
+ - `dataloader_drop_last`: False
376
+ - `dataloader_num_workers`: 0
377
+ - `dataloader_prefetch_factor`: None
378
+ - `past_index`: -1
379
+ - `disable_tqdm`: False
380
+ - `remove_unused_columns`: True
381
+ - `label_names`: None
382
+ - `load_best_model_at_end`: False
383
+ - `ignore_data_skip`: False
384
+ - `fsdp`: []
385
+ - `fsdp_min_num_params`: 0
386
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
387
+ - `fsdp_transformer_layer_cls_to_wrap`: None
388
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
389
+ - `parallelism_config`: None
390
+ - `deepspeed`: None
391
+ - `label_smoothing_factor`: 0.0
392
+ - `optim`: adamw_torch_fused
393
+ - `optim_args`: None
394
+ - `adafactor`: False
395
+ - `group_by_length`: False
396
+ - `length_column_name`: length
397
+ - `ddp_find_unused_parameters`: None
398
+ - `ddp_bucket_cap_mb`: None
399
+ - `ddp_broadcast_buffers`: False
400
+ - `dataloader_pin_memory`: True
401
+ - `dataloader_persistent_workers`: False
402
+ - `skip_memory_metrics`: True
403
+ - `use_legacy_prediction_loop`: False
404
+ - `push_to_hub`: False
405
+ - `resume_from_checkpoint`: None
406
+ - `hub_model_id`: None
407
+ - `hub_strategy`: every_save
408
+ - `hub_private_repo`: None
409
+ - `hub_always_push`: False
410
+ - `hub_revision`: None
411
+ - `gradient_checkpointing`: False
412
+ - `gradient_checkpointing_kwargs`: None
413
+ - `include_inputs_for_metrics`: False
414
+ - `include_for_metrics`: []
415
+ - `eval_do_concat_batches`: True
416
+ - `fp16_backend`: auto
417
+ - `push_to_hub_model_id`: None
418
+ - `push_to_hub_organization`: None
419
+ - `mp_parameters`:
420
+ - `auto_find_batch_size`: False
421
+ - `full_determinism`: False
422
+ - `torchdynamo`: None
423
+ - `ray_scope`: last
424
+ - `ddp_timeout`: 1800
425
+ - `torch_compile`: False
426
+ - `torch_compile_backend`: None
427
+ - `torch_compile_mode`: None
428
+ - `include_tokens_per_second`: False
429
+ - `include_num_input_tokens_seen`: False
430
+ - `neftune_noise_alpha`: None
431
+ - `optim_target_modules`: None
432
+ - `batch_eval_metrics`: False
433
+ - `eval_on_start`: False
434
+ - `use_liger_kernel`: False
435
+ - `liger_kernel_config`: None
436
+ - `eval_use_gather_object`: False
437
+ - `average_tokens_across_devices`: False
438
+ - `prompts`: None
439
+ - `batch_sampler`: batch_sampler
440
+ - `multi_dataset_batch_sampler`: round_robin
441
+ - `router_mapping`: {}
442
+ - `learning_rate_mapping`: {}
443
+
444
+ </details>
445
+
446
+ ### Framework Versions
447
+ - Python: 3.13.7
448
+ - Sentence Transformers: 5.1.0
449
+ - Transformers: 4.56.1
450
+ - PyTorch: 2.8.0+cpu
451
+ - Accelerate: 1.10.1
452
+ - Datasets: 4.0.0
453
+ - Tokenizers: 0.22.0
454
+
455
+ ## Citation
456
+
457
+ ### BibTeX
458
+
459
+ #### Sentence Transformers
460
+ ```bibtex
461
+ @inproceedings{reimers-2019-sentence-bert,
462
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
463
+ author = "Reimers, Nils and Gurevych, Iryna",
464
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
465
+ month = "11",
466
+ year = "2019",
467
+ publisher = "Association for Computational Linguistics",
468
+ url = "https://arxiv.org/abs/1908.10084",
469
+ }
470
+ ```
471
+
472
+ #### MultipleNegativesRankingLoss
473
+ ```bibtex
474
+ @misc{henderson2017efficient,
475
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
476
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
477
+ year={2017},
478
+ eprint={1705.00652},
479
+ archivePrefix={arXiv},
480
+ primaryClass={cs.CL}
481
+ }
482
+ ```
483
+
484
+ <!--
485
+ ## Glossary
486
+
487
+ *Clearly define terms in order to be accessible across audiences.*
488
+ -->
489
+
490
+ <!--
491
+ ## Model Card Authors
492
+
493
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
494
+ -->
495
+
496
+ <!--
497
+ ## Model Card Contact
498
+
499
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
500
+ -->
config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "BertModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "classifier_dropout": null,
7
+ "dtype": "float32",
8
+ "gradient_checkpointing": false,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 384,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 1536,
14
+ "layer_norm_eps": 1e-12,
15
+ "max_position_embeddings": 512,
16
+ "model_type": "bert",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 6,
19
+ "pad_token_id": 0,
20
+ "position_embedding_type": "absolute",
21
+ "transformers_version": "4.56.1",
22
+ "type_vocab_size": 2,
23
+ "use_cache": true,
24
+ "vocab_size": 30522
25
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "5.1.0",
4
+ "transformers": "4.56.1",
5
+ "pytorch": "2.8.0+cpu"
6
+ },
7
+ "model_type": "SentenceTransformer",
8
+ "prompts": {
9
+ "query": "",
10
+ "document": ""
11
+ },
12
+ "default_prompt_name": null,
13
+ "similarity_fn_name": "cosine"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d5e25bbe0ea9982f67e9a42d15d484fb584c72c885b50484a54679f3b869a77
3
+ size 90864192
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 256,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": false,
45
+ "cls_token": "[CLS]",
46
+ "do_basic_tokenize": true,
47
+ "do_lower_case": true,
48
+ "extra_special_tokens": {},
49
+ "mask_token": "[MASK]",
50
+ "max_length": 128,
51
+ "model_max_length": 256,
52
+ "never_split": null,
53
+ "pad_to_multiple_of": null,
54
+ "pad_token": "[PAD]",
55
+ "pad_token_type_id": 0,
56
+ "padding_side": "right",
57
+ "sep_token": "[SEP]",
58
+ "stride": 0,
59
+ "strip_accents": null,
60
+ "tokenize_chinese_chars": true,
61
+ "tokenizer_class": "BertTokenizer",
62
+ "truncation_side": "right",
63
+ "truncation_strategy": "longest_first",
64
+ "unk_token": "[UNK]"
65
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff