MossaabDev commited on
Commit
264c407
·
verified ·
1 Parent(s): 6f8aff3

Upload 10 files

Browse files
README.md CHANGED
@@ -1,3 +1,365 @@
1
  ---
2
- license: apache-2.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:193
9
+ - loss:CosineSimilarityLoss
10
+ base_model: sentence-transformers/all-MiniLM-L6-v2
11
+ widget:
12
+ - source_sentence: I saw someone killing a cat in the street, I felt helpless and
13
+ sad
14
+ sentences:
15
+ - There is no god ?worthy of worship? except You. Glory be to You! I have certainly
16
+ done wrong.
17
+ - 'who say, when struck by a disaster, Surely to Allah we belong and to Him we
18
+ will ?all? return. '
19
+ - And never think that Allah is unaware of what the wrongdoers do. He only delays
20
+ them for a Day when eyes will stare [in horror]
21
+ - source_sentence: I am really sad, I hate my life and I wanna suicide
22
+ sentences:
23
+ - And never think that Allah is unaware of what the wrongdoers do. He only delays
24
+ them for a Day when eyes will stare [in horror]
25
+ - And when the ignorant address them, they say words of peace
26
+ - And seek help through patience and prayer. Indeed, it is a burden except for the
27
+ humble
28
+ - source_sentence: 'my cousin just died '
29
+ sentences:
30
+ - 'who say, when struck by a disaster, Surely to Allah we belong and to Him we
31
+ will ?all? return. '
32
+ - Again, no! Never obey him ?O Prophet?! Rather, ?continue to? prostrate and draw
33
+ near ?to Allah?.
34
+ - Do not do a favour expecting more ?in return?.
35
+ - source_sentence: tell me about peace
36
+ sentences:
37
+ - O mankind, eat from whatever is on earth [that is] lawful and good and do not
38
+ follow the footsteps of Satan. Indeed, he is to you a clear enemy
39
+ - And when the ignorant address them, they say words of peace
40
+ - And if you divorce them before consummating the marriage but after deciding on
41
+ a dowry, pay half of the dowry, unless the wife graciously waives it or the husband
42
+ graciously pays in full. Graciousness is closer to righteousness. And do not forget
43
+ kindness among yourselves. Surely Allah is All-Seeing of what you do.
44
+ - source_sentence: I lost my friend, he died and I miss him
45
+ sentences:
46
+ - Not equal are the good deed and the bad deed. Repel [evil] by that [deed] which
47
+ is better; and thereupon the one whom between you and him is enmity [will become]
48
+ as though he was a devoted friend
49
+ - Every soul will taste death. And you will only receive your full reward on the
50
+ Day of Judgment. Whoever is spared from the Fire and is admitted into Paradise
51
+ will ?indeed? triumph, whereas the life of this world is no more than the delusion
52
+ of enjoyment.
53
+ - Every soul will taste death, then to Us you will ?all? be returned.
54
+ pipeline_tag: sentence-similarity
55
+ library_name: sentence-transformers
56
  ---
57
+
58
+ # SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
59
+
60
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
61
+
62
+ ## Model Details
63
+
64
+ ### Model Description
65
+ - **Model Type:** Sentence Transformer
66
+ - **Base model:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) <!-- at revision c9745ed1d9f207416be6d2e6f8de32d1f16199bf -->
67
+ - **Maximum Sequence Length:** 256 tokens
68
+ - **Output Dimensionality:** 384 dimensions
69
+ - **Similarity Function:** Cosine Similarity
70
+ <!-- - **Training Dataset:** Unknown -->
71
+ <!-- - **Language:** Unknown -->
72
+ <!-- - **License:** Unknown -->
73
+
74
+ ### Model Sources
75
+
76
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
77
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
78
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
79
+
80
+ ### Full Model Architecture
81
+
82
+ ```
83
+ SentenceTransformer(
84
+ (0): Transformer({'max_seq_length': 256, 'do_lower_case': False, 'architecture': 'BertModel'})
85
+ (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
86
+ (2): Normalize()
87
+ )
88
+ ```
89
+
90
+ ## Usage
91
+
92
+ ### Direct Usage (Sentence Transformers)
93
+
94
+ First install the Sentence Transformers library:
95
+
96
+ ```bash
97
+ pip install -U sentence-transformers
98
+ ```
99
+
100
+ Then you can load this model and run inference.
101
+ ```python
102
+ from sentence_transformers import SentenceTransformer
103
+
104
+ # Download from the 🤗 Hub
105
+ model = SentenceTransformer("sentence_transformers_model_id")
106
+ # Run inference
107
+ sentences = [
108
+ 'I lost my friend, he died and I miss him',
109
+ 'Every soul will taste death. And you will only receive your full reward on the Day of Judgment. Whoever is spared from the Fire and is admitted into Paradise will ?indeed? triumph, whereas the life of this world is no more than the delusion of enjoyment.',
110
+ 'Every soul will taste death, then to Us you will ?all? be returned.',
111
+ ]
112
+ embeddings = model.encode(sentences)
113
+ print(embeddings.shape)
114
+ # [3, 384]
115
+
116
+ # Get the similarity scores for the embeddings
117
+ similarities = model.similarity(embeddings, embeddings)
118
+ print(similarities)
119
+ # tensor([[1.0000, 0.9817, 0.9870],
120
+ # [0.9817, 1.0000, 0.9923],
121
+ # [0.9870, 0.9923, 1.0000]])
122
+ ```
123
+
124
+ <!--
125
+ ### Direct Usage (Transformers)
126
+
127
+ <details><summary>Click to see the direct usage in Transformers</summary>
128
+
129
+ </details>
130
+ -->
131
+
132
+ <!--
133
+ ### Downstream Usage (Sentence Transformers)
134
+
135
+ You can finetune this model on your own dataset.
136
+
137
+ <details><summary>Click to expand</summary>
138
+
139
+ </details>
140
+ -->
141
+
142
+ <!--
143
+ ### Out-of-Scope Use
144
+
145
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
146
+ -->
147
+
148
+ <!--
149
+ ## Bias, Risks and Limitations
150
+
151
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
152
+ -->
153
+
154
+ <!--
155
+ ### Recommendations
156
+
157
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
158
+ -->
159
+
160
+ ## Training Details
161
+
162
+ ### Training Dataset
163
+
164
+ #### Unnamed Dataset
165
+
166
+ * Size: 193 training samples
167
+ * Columns: <code>sentence_0</code>, <code>sentence_1</code>, and <code>label</code>
168
+ * Approximate statistics based on the first 193 samples:
169
+ | | sentence_0 | sentence_1 | label |
170
+ |:--------|:----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:--------------------------------------------------------------|
171
+ | type | string | string | float |
172
+ | details | <ul><li>min: 5 tokens</li><li>mean: 11.27 tokens</li><li>max: 30 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 35.92 tokens</li><li>max: 121 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.9</li><li>max: 1.0</li></ul> |
173
+ * Samples:
174
+ | sentence_0 | sentence_1 | label |
175
+ |:-------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------|
176
+ | <code>I am afraid that my son is not in the right way</code> | <code>And those who say: Our Lord! Grant us comfort in our spouses and our offspring, and make us leaders of the righteous</code> | <code>1.0</code> |
177
+ | <code>my cat just died</code> | <code>And We will surely test you with something of fear and hunger and a loss of wealth and lives and fruits, but give good tidings to the patient</code> | <code>1.0</code> |
178
+ | <code>I do not have childre</code> | <code>And those who say: Our Lord! Grant us comfort in our spouses and our offspring, and make us leaders of the righteous</code> | <code>1.0</code> |
179
+ * Loss: [<code>CosineSimilarityLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosinesimilarityloss) with these parameters:
180
+ ```json
181
+ {
182
+ "loss_fct": "torch.nn.modules.loss.MSELoss"
183
+ }
184
+ ```
185
+
186
+ ### Training Hyperparameters
187
+ #### Non-Default Hyperparameters
188
+
189
+ - `num_train_epochs`: 20
190
+ - `multi_dataset_batch_sampler`: round_robin
191
+
192
+ #### All Hyperparameters
193
+ <details><summary>Click to expand</summary>
194
+
195
+ - `overwrite_output_dir`: False
196
+ - `do_predict`: False
197
+ - `eval_strategy`: no
198
+ - `prediction_loss_only`: True
199
+ - `per_device_train_batch_size`: 8
200
+ - `per_device_eval_batch_size`: 8
201
+ - `per_gpu_train_batch_size`: None
202
+ - `per_gpu_eval_batch_size`: None
203
+ - `gradient_accumulation_steps`: 1
204
+ - `eval_accumulation_steps`: None
205
+ - `torch_empty_cache_steps`: None
206
+ - `learning_rate`: 5e-05
207
+ - `weight_decay`: 0.0
208
+ - `adam_beta1`: 0.9
209
+ - `adam_beta2`: 0.999
210
+ - `adam_epsilon`: 1e-08
211
+ - `max_grad_norm`: 1
212
+ - `num_train_epochs`: 20
213
+ - `max_steps`: -1
214
+ - `lr_scheduler_type`: linear
215
+ - `lr_scheduler_kwargs`: {}
216
+ - `warmup_ratio`: 0.0
217
+ - `warmup_steps`: 0
218
+ - `log_level`: passive
219
+ - `log_level_replica`: warning
220
+ - `log_on_each_node`: True
221
+ - `logging_nan_inf_filter`: True
222
+ - `save_safetensors`: True
223
+ - `save_on_each_node`: False
224
+ - `save_only_model`: False
225
+ - `restore_callback_states_from_checkpoint`: False
226
+ - `no_cuda`: False
227
+ - `use_cpu`: False
228
+ - `use_mps_device`: False
229
+ - `seed`: 42
230
+ - `data_seed`: None
231
+ - `jit_mode_eval`: False
232
+ - `bf16`: False
233
+ - `fp16`: False
234
+ - `fp16_opt_level`: O1
235
+ - `half_precision_backend`: auto
236
+ - `bf16_full_eval`: False
237
+ - `fp16_full_eval`: False
238
+ - `tf32`: None
239
+ - `local_rank`: 0
240
+ - `ddp_backend`: None
241
+ - `tpu_num_cores`: None
242
+ - `tpu_metrics_debug`: False
243
+ - `debug`: []
244
+ - `dataloader_drop_last`: False
245
+ - `dataloader_num_workers`: 0
246
+ - `dataloader_prefetch_factor`: None
247
+ - `past_index`: -1
248
+ - `disable_tqdm`: False
249
+ - `remove_unused_columns`: True
250
+ - `label_names`: None
251
+ - `load_best_model_at_end`: False
252
+ - `ignore_data_skip`: False
253
+ - `fsdp`: []
254
+ - `fsdp_min_num_params`: 0
255
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
256
+ - `fsdp_transformer_layer_cls_to_wrap`: None
257
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
258
+ - `parallelism_config`: None
259
+ - `deepspeed`: None
260
+ - `label_smoothing_factor`: 0.0
261
+ - `optim`: adamw_torch
262
+ - `optim_args`: None
263
+ - `adafactor`: False
264
+ - `group_by_length`: False
265
+ - `length_column_name`: length
266
+ - `project`: huggingface
267
+ - `trackio_space_id`: trackio
268
+ - `ddp_find_unused_parameters`: None
269
+ - `ddp_bucket_cap_mb`: None
270
+ - `ddp_broadcast_buffers`: False
271
+ - `dataloader_pin_memory`: True
272
+ - `dataloader_persistent_workers`: False
273
+ - `skip_memory_metrics`: True
274
+ - `use_legacy_prediction_loop`: False
275
+ - `push_to_hub`: False
276
+ - `resume_from_checkpoint`: None
277
+ - `hub_model_id`: None
278
+ - `hub_strategy`: every_save
279
+ - `hub_private_repo`: None
280
+ - `hub_always_push`: False
281
+ - `hub_revision`: None
282
+ - `gradient_checkpointing`: False
283
+ - `gradient_checkpointing_kwargs`: None
284
+ - `include_inputs_for_metrics`: False
285
+ - `include_for_metrics`: []
286
+ - `eval_do_concat_batches`: True
287
+ - `fp16_backend`: auto
288
+ - `push_to_hub_model_id`: None
289
+ - `push_to_hub_organization`: None
290
+ - `mp_parameters`:
291
+ - `auto_find_batch_size`: False
292
+ - `full_determinism`: False
293
+ - `torchdynamo`: None
294
+ - `ray_scope`: last
295
+ - `ddp_timeout`: 1800
296
+ - `torch_compile`: False
297
+ - `torch_compile_backend`: None
298
+ - `torch_compile_mode`: None
299
+ - `include_tokens_per_second`: False
300
+ - `include_num_input_tokens_seen`: no
301
+ - `neftune_noise_alpha`: None
302
+ - `optim_target_modules`: None
303
+ - `batch_eval_metrics`: False
304
+ - `eval_on_start`: False
305
+ - `use_liger_kernel`: False
306
+ - `liger_kernel_config`: None
307
+ - `eval_use_gather_object`: False
308
+ - `average_tokens_across_devices`: True
309
+ - `prompts`: None
310
+ - `batch_sampler`: batch_sampler
311
+ - `multi_dataset_batch_sampler`: round_robin
312
+ - `router_mapping`: {}
313
+ - `learning_rate_mapping`: {}
314
+
315
+ </details>
316
+
317
+ ### Training Logs
318
+ | Epoch | Step | Training Loss |
319
+ |:-----:|:----:|:-------------:|
320
+ | 20.0 | 500 | 0.0455 |
321
+
322
+
323
+ ### Framework Versions
324
+ - Python: 3.12.7
325
+ - Sentence Transformers: 5.1.1
326
+ - Transformers: 4.57.1
327
+ - PyTorch: 2.5.1
328
+ - Accelerate: 1.11.0
329
+ - Datasets: 4.3.0
330
+ - Tokenizers: 0.22.1
331
+
332
+ ## Citation
333
+
334
+ ### BibTeX
335
+
336
+ #### Sentence Transformers
337
+ ```bibtex
338
+ @inproceedings{reimers-2019-sentence-bert,
339
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
340
+ author = "Reimers, Nils and Gurevych, Iryna",
341
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
342
+ month = "11",
343
+ year = "2019",
344
+ publisher = "Association for Computational Linguistics",
345
+ url = "https://arxiv.org/abs/1908.10084",
346
+ }
347
+ ```
348
+
349
+ <!--
350
+ ## Glossary
351
+
352
+ *Clearly define terms in order to be accessible across audiences.*
353
+ -->
354
+
355
+ <!--
356
+ ## Model Card Authors
357
+
358
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
359
+ -->
360
+
361
+ <!--
362
+ ## Model Card Contact
363
+
364
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
365
+ -->
config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "BertModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "classifier_dropout": null,
7
+ "dtype": "float32",
8
+ "gradient_checkpointing": false,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 384,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 1536,
14
+ "layer_norm_eps": 1e-12,
15
+ "max_position_embeddings": 512,
16
+ "model_type": "bert",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 6,
19
+ "pad_token_id": 0,
20
+ "position_embedding_type": "absolute",
21
+ "transformers_version": "4.57.1",
22
+ "type_vocab_size": 2,
23
+ "use_cache": true,
24
+ "vocab_size": 30522
25
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "5.1.1",
4
+ "transformers": "4.57.1",
5
+ "pytorch": "2.5.1"
6
+ },
7
+ "model_type": "SentenceTransformer",
8
+ "prompts": {
9
+ "query": "",
10
+ "document": ""
11
+ },
12
+ "default_prompt_name": null,
13
+ "similarity_fn_name": "cosine"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:54909b627c8aa3dc3e717e8a772b491f32716f6e655aa81ea45192fc35bd5717
3
+ size 90864192
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 256,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": false,
45
+ "cls_token": "[CLS]",
46
+ "do_basic_tokenize": true,
47
+ "do_lower_case": true,
48
+ "extra_special_tokens": {},
49
+ "mask_token": "[MASK]",
50
+ "max_length": 128,
51
+ "model_max_length": 256,
52
+ "never_split": null,
53
+ "pad_to_multiple_of": null,
54
+ "pad_token": "[PAD]",
55
+ "pad_token_type_id": 0,
56
+ "padding_side": "right",
57
+ "sep_token": "[SEP]",
58
+ "stride": 0,
59
+ "strip_accents": null,
60
+ "tokenize_chinese_chars": true,
61
+ "tokenizer_class": "BertTokenizer",
62
+ "truncation_side": "right",
63
+ "truncation_strategy": "longest_first",
64
+ "unk_token": "[UNK]"
65
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff