frankwong2001 commited on
Commit
abac766
·
verified ·
1 Parent(s): 805dfa4

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,640 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:4524
9
+ - loss:MultipleNegativesRankingLoss
10
+ base_model: nomic-ai/modernbert-embed-base
11
+ widget:
12
+ - source_sentence: The Head of Engineering is at the forefront of new technology,
13
+ charting the port technology development and integration roadmaps. He/She works
14
+ with internal and external parties to invest and develop technology and infrastructure
15
+ solutions that meet the ports business objectives, while managing budgetary constraints.
16
+ He directs the use of new technology and equipment in the ports to drive greater
17
+ productivity and service excellence, while ensuring the high reliability of existing
18
+ port equipment through cost effective maintenance programmes. He is a core member
19
+ of the management team, contributes to the overall organisation strategy, inspires
20
+ a culture of process improvement to enhance workflow and efficiency, while mentoring
21
+ others in their work.
22
+ sentences:
23
+ - The Business Development Manager is responsible for enhancing the organization's
24
+ market presence and driving financial growth. He/She identifies and engages new
25
+ clients through networking, cold calling, advertising, and other strategies to
26
+ generate interest. He builds strong customer relationships, recognizes business
27
+ opportunities, negotiates and finalizes deals, and maintains a comprehensive understanding
28
+ of current market trends. He designs persuasive strategies and presentations to
29
+ win over potential clients. He may oversee the efforts of team members involved
30
+ in business development. Working in a fast-paced, dynamic environment, he frequently
31
+ travels to client locations and participates in networking events. He is proficient
32
+ with client relationship management and sales tools, as well as knowledgeable
33
+ about the organization's products and services, along with industry trends and
34
+ challenges. The Business Development Manager is self-driven and adept at establishing
35
+ clear and meaningful objectives. He demonstrates high resilience when facing obstacles
36
+ and appreciates the consultative selling approach, effectively leveraging marketing's
37
+ role in attracting, qualifying, and nurturing potential customers. He is articulate
38
+ and inventive in using his product and customer insights to secure deals.
39
+ - The Head of Engineering leads the advancement of new technologies and defines
40
+ the development and integration strategies for port technology. He/She collaborates
41
+ with both internal and external stakeholders to invest in and create technological
42
+ and infrastructural solutions that align with the business goals of the ports,
43
+ all while adhering to budgetary limits. He directs the implementation of innovative
44
+ technologies and equipment in the ports to boost productivity and service quality,
45
+ while also ensuring the dependability of current port equipment through economical
46
+ maintenance programs. As a vital member of the management team, he contributes
47
+ to the overarching strategy of the organization, fosters a culture of continuous
48
+ improvement to optimize workflow and efficiency, and mentors colleagues in their
49
+ professional development.
50
+ - The Chef de Cuisine is responsible for designing exquisite menus and overseeing
51
+ the kitchen staff to ensure high-quality meal preparation. He/She collaborates
52
+ with suppliers to source the freshest ingredients while managing kitchen inventory
53
+ and costs. The Chef de Cuisine also innovates culinary techniques and presents
54
+ dishes that enhance the dining experience, while ensuring the kitchen operates
55
+ smoothly during service. As a leader in the culinary team, he inspires creativity
56
+ and maintains standards of excellence in food presentation and flavor.
57
+ - source_sentence: The HSE Manager oversees all activities in the Health, Safety and
58
+ Environment (HSE) department and is responsible for providing technical expertise
59
+ on HSE issues to relevant stakeholders. He/She leads the development of the Workplace
60
+ Safety and Health (WSH) and Environmental Management System (EMS) frameworks,
61
+ and evaluates the organisations WSH and EMS systems to ensure compliance with
62
+ pertinent government regulations and organisational health, safety and environmental
63
+ guidelines. He reviews WSH and environmental accident and incident findings and
64
+ trends to recommend improvements. Furthermore, he coordinates the development
65
+ and maintenance of the organisations Major Hazard Installation (MHI) Safety Case.
66
+ The HSE Manager is a senior member of the organisations crisis management team
67
+ and manages the development of the organisations emergency response and crisis
68
+ management plans. He is responsible for managing the organisations Safe System
69
+ of Work (SSoW) framework to ensure that work activities are carried out safely.
70
+ In addition, he coaches and mentors HSE department personnel and drives departmental
71
+ performance to achieve the organisations HSE goals. The HSE Manager actively promotes
72
+ a safe workplace culture across the organisation. As a department manager, he
73
+ is required to have good leadership, interpersonal and resource management skills.
74
+ sentences:
75
+ - The Commodities Trader is responsible for daily trading operations, which involve
76
+ executing trades according to established plans and monitoring both portfolio
77
+ positions and market trends. He/She identifies potential opportunities on local
78
+ and regional levels that can improve portfolio performance. The role requires
79
+ maintaining and strengthening relationships with trading partners while possessing
80
+ a solid understanding of trading operations. With strong analytical and logical
81
+ skills, he develops insights into the commodity market that aids in optimizing
82
+ the portfolio and enhancing trading efficiency. He is resourceful, collaborative,
83
+ and possesses excellent negotiation abilities.
84
+ - The HSE Manager is responsible for overseeing all functions within the Health,
85
+ Safety and Environment (HSE) department and providing technical guidance on HSE
86
+ matters to relevant stakeholders. He/She leads the creation of the Workplace Safety
87
+ and Health (WSH) and Environmental Management System (EMS) frameworks and assesses
88
+ the organisation's WSH and EMS systems to ensure alignment with applicable government
89
+ regulations and organisational health, safety, and environmental standards. He
90
+ reviews findings and trends related to WSH and environmental incidents to suggest
91
+ improvements. Additionally, he coordinates the development and upkeep of the organisation's
92
+ Major Hazard Installation (MHI) Safety Case. As a key member of the organisation's
93
+ crisis management team, the HSE Manager manages the formulation of emergency response
94
+ and crisis management plans. He is also tasked with overseeing the organisation's
95
+ Safe System of Work (SSoW) framework to guarantee that work activities are conducted
96
+ safely. Moreover, he mentors and coaches personnel within the HSE department and
97
+ drives performance to meet the organisation's HSE objectives. The HSE Manager
98
+ is dedicated to fostering a culture of safety throughout the workplace. As a department
99
+ manager, he is expected to possess strong leadership, interpersonal, and resource
100
+ management skills.
101
+ - The HSE Coordinator manages various tasks within the Health, Safety, and Emergency
102
+ (HSE) division and provides operational support on emergency management issues
103
+ to different departments. He/She supervises the implementation of the Workplace
104
+ Safety and Health (WSH) and Environmental Compliance Framework (ECF) and reviews
105
+ the organisation's WSH and ECF strategies to ensure alignment with industry standards
106
+ and internal safety protocols. He analyzes workplace safety and emergency findings
107
+ to propose strategies. Furthermore, he oversees the revision and development of
108
+ the organisation's Major Hazard Awareness (MHA) Safety Protocol. The HSE Coordinator
109
+ is a member of the organisation's operations team and manages the execution of
110
+ the organisation's operational response and safety protocols. He is tasked with
111
+ handling the organisation's Safety Management System (SMS) framework to ensure
112
+ that all operational activities are executed efficiently. Additionally, he provides
113
+ training and guidance to staff within the HSE division and enhances departmental
114
+ productivity to achieve the organisation's operational goals. The HSE Coordinator
115
+ promotes an efficient work environment across the organisation. As a team leader,
116
+ he is required to have effective communication, team-building, and project management
117
+ skills.
118
+ - source_sentence: The Town Gas Plant Maintenance Senior Technical Officer plans the
119
+ schedules for the preventive, predictive and corrective maintenance of town gas
120
+ production plants and ancillaries to ensure that town gas is stored and produced
121
+ efficiently in the plant. He/She monitors works done by contractors to ensure
122
+ projects meet the, organisational requirements. He prepares the technical specifications
123
+ for tenders and supports in tender evaluations of large projects. He builds staff
124
+ capabilities through on-the-job training, He issues work orders for Permits-to-Work,
125
+ and supervises works according to Safe System of Work (SSoW) practices. In times
126
+ of emergency, he implements emergency response plans and relevant safety procedures,
127
+ and supervises the Emergency Response Team on site incident management. He works
128
+ in the gas plant facility containing equipment such as pumps, tanks and valves,
129
+ where there is high focus on safety. He has good interpersonal skills to be able
130
+ to supervise junior team members and contractors, and coordinate with the production
131
+ team. He is meticulous and systematic in performing maintenance procedures. He
132
+ is agile and calm in responding effectively to faults and outages.
133
+ sentences:
134
+ - The Town Gas Plant Maintenance Junior Technical Officer manages the schedules
135
+ for routine, scheduled, and unscheduled maintenance of town gas distribution facilities
136
+ and associated components to ensure that town gas is utilized and consumed effectively
137
+ in the distribution network. He/She reviews tasks executed by subcontractors to
138
+ confirm that initiatives align with the project guidelines. He drafts operational
139
+ outlines for proposals and assists in project assessments of minor installations.
140
+ He develops team skills through classroom training, issues notifications for Maintenance
141
+ Work Orders, and directs tasks in line with Safe Work Practices (SWP). In non-critical
142
+ situations, he applies standard procedures and basic safety protocols while assisting
143
+ the Response Team in site management. He works in the gas distribution area, which
144
+ features apparatus such as compressors, valves, and regulators, where there is
145
+ a notable emphasis on compliance. He has average communication skills to help
146
+ oversee novice employees and subcontractors, and liaises with the operations team.
147
+ He is casual and informal in executing maintenance tasks and is slow to react
148
+ to issues and interruptions.
149
+ - The Site Director/Head is tasked with guiding the manufacturing facility towards
150
+ its strategic goals by setting and communicating key performance indicators (KPIs),
151
+ promoting a collaborative culture among departments, and managing financial planning
152
+ and budgeting processes. He/She seeks out and identifies investment opportunities
153
+ to enhance manufacturing operations and improve facilities. Additionally, he mentors
154
+ and cultivates talent for future leadership roles while overseeing learning and
155
+ development, succession planning, and talent management initiatives. He ensures
156
+ compliance with Health, Safety and Environment (HSE) policies, international regulations,
157
+ and Current Good Manufacturing Practices (CGMPs) across the manufacturing site.
158
+ He is responsible for developing business continuity plans and leading responses
159
+ to significant incidents or events. The Site Director/Head holds overall accountability
160
+ for the manufacturing site's performance and is an inspiring, people-focused leader
161
+ dedicated to motivating large teams towards excellence. He possesses a strategic,
162
+ forward-thinking approach and a global perspective when making plans and decisions
163
+ for the organization.
164
+ - The Town Gas Plant Maintenance Senior Technical Officer is responsible for planning
165
+ the schedules for preventive, predictive, and corrective maintenance of town gas
166
+ production facilities and related equipment to ensure efficient storage and production.
167
+ He/She oversees the work performed by contractors to guarantee that all projects
168
+ comply with organizational standards. He prepares technical specifications for
169
+ tenders and assists in evaluating large project proposals. He enhances staff capabilities
170
+ through on-the-job training, issues work orders for Permits-to-Work, and supervises
171
+ operations in accordance with Safe System of Work (SSoW) practices. During emergencies,
172
+ he executes emergency response plans and relevant safety protocols while leading
173
+ the Emergency Response Team in on-site incident management. He operates in the
174
+ gas plant environment, which includes equipment like pumps, tanks, and valves,
175
+ with a strong emphasis on safety. He possesses excellent interpersonal skills
176
+ to effectively supervise junior team members and contractors, as well as coordinate
177
+ with the production team. He demonstrates meticulousness and systematic approaches
178
+ in maintenance tasks and remains agile and composed when addressing faults and
179
+ outages.
180
+ - source_sentence: The Waste and Recyclables Collection Executive assists with the
181
+ management of waste and recyclables collection operations. This includes overseeing
182
+ the management of organisational resources, collection routes, work procedures
183
+ and schedules, incidents and reports to the management. He/She is also required
184
+ to plan collection routes, compile and analyse data, recommend suitable operational
185
+ plans and/or equipment to improve work processes and service quality of the organisation.
186
+ He works in a waste management facility and performs site visits when necessary.
187
+ He is expected to communicate with his stakeholders and clients as part of his
188
+ role in performing operational duties. He is organised, responsive, approachable,
189
+ able to multi-task and capable of interacting with stakeholders.
190
+ sentences:
191
+ - The Waste and Recyclables Collection Executive is responsible for managing waste
192
+ and recyclables collection operations. This includes overseeing the management
193
+ of organizational resources, collection routes, work procedures, schedules, and
194
+ reporting incidents to management. He/She is also tasked with planning collection
195
+ routes, compiling and analyzing data, and recommending appropriate operational
196
+ plans and equipment to enhance work processes and service quality. He works in
197
+ a waste management facility and conducts site visits as needed. He is expected
198
+ to engage with stakeholders and clients while performing operational duties. He
199
+ is organized, responsive, approachable, capable of multi-tasking, and adept at
200
+ interacting with stakeholders.
201
+ - The Waste and Recyclables Management Coordinator handles the supervision of waste
202
+ management operations. This involves managing organizational logistics, delivery
203
+ routes, workflow protocols, schedules, and documenting incidents for review. He/She
204
+ is also responsible for strategizing delivery routes, gathering and interpreting
205
+ information, and suggesting effective logistical plans and tools to optimize workflow
206
+ and service standards. He operates in a waste processing center and performs inspections
207
+ when required. He is expected to liaise with his team and customers as part of
208
+ his operational responsibilities. He is structured, reactive, friendly, skilled
209
+ at multitasking, and proficient in communicating with clients.
210
+ - The Pastry Chef is responsible for inspecting the prepared pastries to ensure
211
+ that quality standards are upheld before the products are served. He/She innovates
212
+ new recipes to refresh menus and decorates pastries with various icings and toppings.
213
+ He is expected to oversee the daily operations of the pastry and baking kitchen
214
+ while planning continuous improvement initiatives within the team. He also suggests
215
+ enhancements to improve customer service performance. Well-groomed and resourceful,
216
+ he has excellent problem-solving abilities and maintains composure in high-pressure
217
+ situations. He should exhibit strong attention to detail, creativity, and leadership
218
+ qualities. He may be employed in specialist pastry shops or patisseries, as well
219
+ as restaurants and hotels. He should possess comprehensive knowledge of sanitation
220
+ principles, baking techniques, and nutrition principles, and is adept at collaborating
221
+ with multi-cultural teams.
222
+ - source_sentence: The Operations Risk and Control Manager is responsible for managing
223
+ risk and control activities for the organisation and ensuring compliance with
224
+ any applicable guidelines, laws and regulations. He/She will monitor high risk
225
+ operational and emerging risk incidents with the aim of strengthening the organisation's
226
+ control environment and improving control processes. He conducts investigations
227
+ to identify risk incidents and determine corrective actions, and develops incident
228
+ response and crisis management protocols to deal with potential emergencies. The
229
+ Operations Risk and Control Manager possesses analytical capabilities and a keen
230
+ eye for pinpointing sources of risks or potential crises. He is a quick thinker
231
+ who is able to make decisions under tight timelines so as to address and resolve
232
+ risk incidents as they arise and adapt to the changing regulatory environment.
233
+ sentences:
234
+ - The Operations Risk and Control Manager is tasked with overseeing risk and control
235
+ measures within the organization, ensuring adherence to relevant guidelines, laws,
236
+ and regulations. He/She will assess high-risk operational incidents and emerging
237
+ threats to enhance the control framework and refine control processes. He conducts
238
+ thorough investigations to pinpoint risk occurrences and formulate corrective
239
+ measures, while also developing incident response and crisis management strategies
240
+ for potential emergencies. The Operations Risk and Control Manager has strong
241
+ analytical skills and is adept at identifying sources of risk or potential crises.
242
+ He is a decisive thinker who can make timely decisions to address and resolve
243
+ risk incidents as they emerge, adapting to the evolving regulatory landscape.
244
+ - The Operations Compliance Manager is responsible for overseeing compliance and
245
+ audit processes for the organization while ensuring alignment with various industry
246
+ standards and practices. He/She will evaluate low-risk operational activities
247
+ and existing compliance issues to enhance the compliance framework and streamline
248
+ audit processes. He conducts reviews to assess compliance violations and suggests
249
+ improvements, while also creating compliance training and awareness programs for
250
+ all employees. The Operations Compliance Manager possesses strong organizational
251
+ skills and is effective in identifying areas of improvement or compliance gaps.
252
+ He is a strategic planner who can implement changes to enhance compliance measures
253
+ over time, adapting to the shifting market trends.
254
+ - The Arts Educators are responsible for designing, implementing, and evaluating
255
+ learning experiences while utilizing effective assessment techniques to ensure
256
+ that learners meet established standards. Their teaching is enriched by their
257
+ own artistic practice in their selected art form. With a solid grasp of effective
258
+ teaching methodologies and learning strategies, they skillfully adjust these approaches
259
+ to cater to specific contexts, student needs, and educational goals. They guide
260
+ learners in realizing their full potential in their craft and deepening their
261
+ understanding and appreciation of artistic endeavors. Arts Educators foster creativity
262
+ and equip students with the necessary tools to explore their ideas and imagination.
263
+ They deliver arts education programs across various settings, including schools,
264
+ universities, community centers, welfare organizations, and co-curricular activities,
265
+ serving a diverse range of students. They are committed to enhancing arts education
266
+ through the development and refinement of pedagogies, programs, and curricula.
267
+ Additionally, they actively engage with arts and arts education organizations
268
+ while mentoring emerging artists. They engage in self-reflection and adopt a critical
269
+ approach to their teaching and artistic practice, often developing a distinctive
270
+ teaching style that reflects their individuality.
271
+ datasets:
272
+ - frankwong2001/ssf-train-valid-full-synthetic-batch10
273
+ pipeline_tag: sentence-similarity
274
+ library_name: sentence-transformers
275
+ ---
276
+
277
+ # SentenceTransformer based on nomic-ai/modernbert-embed-base
278
+
279
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [nomic-ai/modernbert-embed-base](https://huggingface.co/nomic-ai/modernbert-embed-base) on the [ssf-train-valid-full-synthetic-batch10](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-batch10) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
280
+
281
+ ## Model Details
282
+
283
+ ### Model Description
284
+ - **Model Type:** Sentence Transformer
285
+ - **Base model:** [nomic-ai/modernbert-embed-base](https://huggingface.co/nomic-ai/modernbert-embed-base) <!-- at revision d556a88e332558790b210f7bdbe87da2fa94a8d8 -->
286
+ - **Maximum Sequence Length:** 8192 tokens
287
+ - **Output Dimensionality:** 768 dimensions
288
+ - **Similarity Function:** Cosine Similarity
289
+ - **Training Dataset:**
290
+ - [ssf-train-valid-full-synthetic-batch10](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-batch10)
291
+ <!-- - **Language:** Unknown -->
292
+ <!-- - **License:** Unknown -->
293
+
294
+ ### Model Sources
295
+
296
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
297
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
298
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
299
+
300
+ ### Full Model Architecture
301
+
302
+ ```
303
+ SentenceTransformer(
304
+ (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
305
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
306
+ (2): Normalize()
307
+ )
308
+ ```
309
+
310
+ ## Usage
311
+
312
+ ### Direct Usage (Sentence Transformers)
313
+
314
+ First install the Sentence Transformers library:
315
+
316
+ ```bash
317
+ pip install -U sentence-transformers
318
+ ```
319
+
320
+ Then you can load this model and run inference.
321
+ ```python
322
+ from sentence_transformers import SentenceTransformer
323
+
324
+ # Download from the 🤗 Hub
325
+ model = SentenceTransformer("frankwong2001/1_modernbert-embed-base")
326
+ # Run inference
327
+ sentences = [
328
+ "The Operations Risk and Control Manager is responsible for managing risk and control activities for the organisation and ensuring compliance with any applicable guidelines, laws and regulations. He/She will monitor high risk operational and emerging risk incidents with the aim of strengthening the organisation's control environment and improving control processes. He conducts investigations to identify risk incidents and determine corrective actions, and develops incident response and crisis management protocols to deal with potential emergencies. The Operations Risk and Control Manager possesses analytical capabilities and a keen eye for pinpointing sources of risks or potential crises. He is a quick thinker who is able to make decisions under tight timelines so as to address and resolve risk incidents as they arise and adapt to the changing regulatory environment.",
329
+ 'The Operations Risk and Control Manager is tasked with overseeing risk and control measures within the organization, ensuring adherence to relevant guidelines, laws, and regulations. He/She will assess high-risk operational incidents and emerging threats to enhance the control framework and refine control processes. He conducts thorough investigations to pinpoint risk occurrences and formulate corrective measures, while also developing incident response and crisis management strategies for potential emergencies. The Operations Risk and Control Manager has strong analytical skills and is adept at identifying sources of risk or potential crises. He is a decisive thinker who can make timely decisions to address and resolve risk incidents as they emerge, adapting to the evolving regulatory landscape.',
330
+ 'The Operations Compliance Manager is responsible for overseeing compliance and audit processes for the organization while ensuring alignment with various industry standards and practices. He/She will evaluate low-risk operational activities and existing compliance issues to enhance the compliance framework and streamline audit processes. He conducts reviews to assess compliance violations and suggests improvements, while also creating compliance training and awareness programs for all employees. The Operations Compliance Manager possesses strong organizational skills and is effective in identifying areas of improvement or compliance gaps. He is a strategic planner who can implement changes to enhance compliance measures over time, adapting to the shifting market trends.',
331
+ ]
332
+ embeddings = model.encode(sentences)
333
+ print(embeddings.shape)
334
+ # [3, 768]
335
+
336
+ # Get the similarity scores for the embeddings
337
+ similarities = model.similarity(embeddings, embeddings)
338
+ print(similarities)
339
+ # tensor([[1.0000, 0.9713, 0.4877],
340
+ # [0.9713, 1.0000, 0.4780],
341
+ # [0.4877, 0.4780, 1.0000]])
342
+ ```
343
+
344
+ <!--
345
+ ### Direct Usage (Transformers)
346
+
347
+ <details><summary>Click to see the direct usage in Transformers</summary>
348
+
349
+ </details>
350
+ -->
351
+
352
+ <!--
353
+ ### Downstream Usage (Sentence Transformers)
354
+
355
+ You can finetune this model on your own dataset.
356
+
357
+ <details><summary>Click to expand</summary>
358
+
359
+ </details>
360
+ -->
361
+
362
+ <!--
363
+ ### Out-of-Scope Use
364
+
365
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
366
+ -->
367
+
368
+ <!--
369
+ ## Bias, Risks and Limitations
370
+
371
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
372
+ -->
373
+
374
+ <!--
375
+ ### Recommendations
376
+
377
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
378
+ -->
379
+
380
+ ## Training Details
381
+
382
+ ### Training Dataset
383
+
384
+ #### ssf-train-valid-full-synthetic-batch10
385
+
386
+ * Dataset: [ssf-train-valid-full-synthetic-batch10](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-batch10) at [b687585](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-batch10/tree/b68758513f8ec1b0c3891bcd284e05a599f51bce)
387
+ * Size: 4,524 training samples
388
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
389
+ * Approximate statistics based on the first 1000 samples:
390
+ | | anchor | positive | negative |
391
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
392
+ | type | string | string | string |
393
+ | details | <ul><li>min: 58 tokens</li><li>mean: 168.51 tokens</li><li>max: 403 tokens</li></ul> | <ul><li>min: 58 tokens</li><li>mean: 162.99 tokens</li><li>max: 366 tokens</li></ul> | <ul><li>min: 19 tokens</li><li>mean: 136.01 tokens</li><li>max: 368 tokens</li></ul> |
394
+ * Samples:
395
+ | anchor | positive | negative |
396
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
397
+ | <code>The Multi-Utility Operations Team Leader leads the day-to-day power plant operations by assigning tasks to junior team members, performs high voltage switching operational works and drives the rectification of all major plant faults, defects and outages. He/She supervises the first line maintenance works. He develops staff capabilities through on-the-job training and coaching. He monitors Permits-to-Work procedures, and ensures works are done according to Safe System of Work (SSoW) practices. In times of emergency, he facilitates the implementation of emergency response plans and relevant safety procedures. He also supervises the Emergency Response Team on site incident management. He works at the power plant station and may be required to perform shift work. He possesses good leadership and interpersonal skills in leading the operations teams. He is also systematic and able to respond to situations quickly in times of faults or outages.</code> | <code>The Multi-Utility Operations Team Leader is responsible for managing the daily operations of the power plant by delegating tasks to junior team members, executing high voltage switching operations, and addressing all significant plant faults, defects, and outages. He/She oversees first line maintenance activities and enhances staff capabilities through on-the-job training and coaching. He monitors Permits-to-Work procedures to ensure compliance with Safe System of Work (SSoW) practices. In emergencies, he facilitates the execution of emergency response plans and relevant safety protocols, while also supervising the Emergency Response Team during on-site incidents. He works at the power plant station and may be required to perform shift work. He demonstrates strong leadership and interpersonal skills in guiding the operations teams and is systematic, responding swiftly to faults or outages.</code> | <code>The Multi-Utility Operations Team Supervisor manages the daily logistics for the distribution center by assigning tasks to assistant staff, oversees low voltage electrical installation projects, and addresses all minor warehouse issues and delays. He/She coordinates routine inventory checks and enhances staff efficiency through training sessions and workshops. He monitors compliance with shipping regulations and ensures operations adhere to standard operating procedures (SOP). In critical situations, he facilitates the execution of logistical plans and relevant operational protocols, while also supervising the Inventory Management Team during stock assessments. He works at the distribution center and may be required to perform regular office hours. He demonstrates excellent organizational and communication skills in managing the logistics teams and is methodical, adapting quickly to challenges or delays.</code> |
398
+ | <code>The Technician (Component Repair & OverhaulMechanical) performs maintenance, repair and overhaul (MRO) tasks for aircraft components in accordance with technical manuals and standard operating procedures (SOPs). He/She examines parts for maintenance, repair or replacement. He/She troubleshoots component defects and takes corrective actions to restore components to the desired performance requirements. He also performs special processes and repair of composite structures, and documents all completed tasks. He may be authorised by the organisation to perform quality control functions, including inspection of incoming materials and outgoing serviced items, and registration of non-conformances. He may also be authorised to perform level 1 non-destructive testing (NDT) functions under supervision, perform evaluations for acceptance or rejection of aircraft components, and record results as specified in the work instructions. He complies with airworthiness and legislative requirements, and t...</code> | <code>The Technician (Component Repair & Overhaul Mechanical) is responsible for performing maintenance, repair, and overhaul (MRO) activities on aircraft components according to technical manuals and standard operating procedures (SOPs). He/She inspects parts for maintenance, repair, or replacement needs, troubleshoots component defects, and implements corrective actions to ensure components meet performance standards. Additionally, he/she carries out special processes and repairs of composite structures while documenting all completed tasks. The technician may also be authorized to conduct quality control functions, such as inspecting incoming materials and outgoing serviced items, as well as registering non-conformances. Furthermore, he/she may perform level 1 non-destructive testing (NDT) functions under supervision, evaluate aircraft components for acceptance or rejection, and record results as outlined in work instructions. He/She adheres to airworthiness and legislative requirements, ...</code> | <code>The Chef prepares gourmet meals and creates unique recipes for a fine dining restaurant. He/She manages kitchen staff, ensures food safety standards are met, and collaborates with suppliers to source fresh ingredients. Additionally, he/she designs menus that highlight seasonal produce and oversees the presentation of dishes to enhance customer experience. The chef conducts food tastings and works to innovate culinary techniques, while maintaining a clean and organized kitchen environment. He/She may also participate in promotional events to showcase the restaurant's offerings and engage with guests.</code> |
399
+ | <code>The Relationship Management Director - Small and Medium Enterprises is responsible for defining strategies for team members to achieve mass sales acquisition. He/She provides oversight to due diligence, compliance and Anti-Money Laundering (AML) processes carried out by team members. He sets policies and guidelines for ongoing support processes pertaining to credit responsibilities. He guides his team to achieve their performance targets and ensures they have the training necessary to deliver on their responsibilities. The Relationship Management Director - Small and Medium Enterprises is a strong leader who provides mentoring and coaching to his team members to allow them to succeed in their roles. He is a strong communicator with internal and external stakeholders. He is always looking for opportunities to provide enhanced services to clients. He uses analytics and problem solving capabilities to foster an environment that will yield results. He is accountable for the defined standar...</code> | <code>The Relationship Management Director - Small and Medium Enterprises is tasked with developing strategies that enable team members to achieve significant sales growth. He/She supervises the due diligence, compliance, and Anti-Money Laundering (AML) procedures executed by the team. He establishes policies and guidelines for ongoing support processes related to credit responsibilities. He mentors his team to meet their performance goals and ensures they receive the necessary training to fulfill their duties. The Relationship Management Director - Small and Medium Enterprises is an effective leader who provides guidance and support to help his team thrive in their positions. He excels in communication with both internal and external stakeholders. He consistently seeks opportunities to enhance client services. He leverages analytics and problem-solving skills to create a results-oriented environment. He is responsible for upholding the standards he sets for his team.</code> | <code>The Relationship Management Director - Large Enterprises is responsible for creating strategies for team members to achieve substantial market share. He/She oversees the financial audits, regulatory compliance, and Anti-Bribery measures conducted by team members. He formulates policies and frameworks for ongoing management processes relating to financial responsibilities. He directs his team to exceed their sales targets and ensures they have the resources needed to perform their duties. The Relationship Management Director - Large Enterprises is a proactive leader who offers training and support to his team members to enable them to excel in their functions. He is an effective communicator with clients and vendors. He frequently identifies opportunities to improve operational efficiencies. He utilizes data analysis and strategic planning to cultivate an environment that fosters success. He is responsible for the established benchmarks he sets for his team.</code> |
400
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
401
+ ```json
402
+ {
403
+ "scale": 20.0,
404
+ "similarity_fct": "cos_sim",
405
+ "gather_across_devices": false
406
+ }
407
+ ```
408
+
409
+ ### Evaluation Dataset
410
+
411
+ #### ssf-train-valid-full-synthetic-batch10
412
+
413
+ * Dataset: [ssf-train-valid-full-synthetic-batch10](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-batch10) at [b687585](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-batch10/tree/b68758513f8ec1b0c3891bcd284e05a599f51bce)
414
+ * Size: 1,131 evaluation samples
415
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
416
+ * Approximate statistics based on the first 1000 samples:
417
+ | | anchor | positive | negative |
418
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
419
+ | type | string | string | string |
420
+ | details | <ul><li>min: 66 tokens</li><li>mean: 169.88 tokens</li><li>max: 349 tokens</li></ul> | <ul><li>min: 63 tokens</li><li>mean: 163.43 tokens</li><li>max: 329 tokens</li></ul> | <ul><li>min: 22 tokens</li><li>mean: 135.7 tokens</li><li>max: 327 tokens</li></ul> |
421
+ * Samples:
422
+ | anchor | positive | negative |
423
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
424
+ | <code>The Assistant Equipment Engineer applies engineering principles and techniques to support equipment engineering processes in a manufacturing environment to meet organisational objectives. He/She also assists in analysing equipment maintenance issues. In addition, the Assistant Equipment Engineer participates in equipment improvement projects, and partakes in the development of maintenance plans in accordance with organisational objectives. The Assistant Equipment Engineer is required to have strong communication skills, good teamwork and an analytical mind to perform his role well to achieve the desired organisational outcomes.</code> | <code>The Assistant Equipment Engineer utilizes engineering principles and techniques to enhance equipment engineering processes within a manufacturing setting, aligning with organizational goals. He/She also aids in evaluating equipment maintenance challenges. Furthermore, the Assistant Equipment Engineer engages in equipment enhancement initiatives and contributes to the formulation of maintenance strategies in line with organizational objectives. Strong communication skills, effective teamwork, and analytical thinking are essential for the Assistant Equipment Engineer to succeed in achieving the desired organizational results.</code> | <code>The Assistant Mechanical Engineer employs design principles and techniques to assist mechanical engineering tasks in a construction environment to fulfill project requirements. He/She also helps in reviewing machinery performance issues. Additionally, the Assistant Mechanical Engineer takes part in machinery optimization projects and contributes to the creation of operational strategies that meet project goals. Strong leadership abilities, effective collaboration, and critical thinking are necessary for the Assistant Mechanical Engineer to excel in reaching the intended project outcomes.</code> |
425
+ | <code>The Brokerage Supervisor/ Freight Supervisor is responsible for liaising with customers, logistics operators and customs officials and supervising the custom clearance/freight forwarding operations to ensure goods are cleared through customs or quarantine in accordance with import and export laws and regulations. Analytical and systematic, he/she is required to supervise a freight operations team to execute operations in a timely manner to meet business and customers' requirements. He/She is also expected to work with internal and external stakeholders to accomplish his work.</code> | <code>The Brokerage Supervisor/Freight Supervisor is tasked with coordinating with customers, logistics providers, and customs authorities while overseeing the customs clearance and freight forwarding processes to ensure that goods comply with import and export regulations. With a strong analytical and systematic approach, he/she leads a freight operations team to execute tasks promptly, meeting both business and customer needs. Additionally, he/she collaborates with internal and external stakeholders to achieve work objectives.</code> | <code>The Freight Operations Manager is responsible for interacting with suppliers, transportation companies, and regulatory agencies while managing the delivery and logistics services to guarantee that products adhere to supply chain protocols. With a focus on detail-oriented and organized practices, he/she directs a logistics team to carry out operations efficiently, fulfilling both company and supplier expectations. Furthermore, he/she engages with internal and external partners to fulfill his/her duties.</code> |
426
+ | <code>The Production Planner is responsible for managing and executing production plans and schedules to ensure that products are delivered to customers on time and within schedule. He/She plans for the entire production supply chain from feedstock to production, storage and distribution, and analyses production data to optimise production and inventory control. The Production Planner coordinates with the maintenance planning team to align production targets with the planning of maintenance and turnaround schedules. He supports the reporting of plant production status and raw materials inventories, and highlights issues that may affect production output. He monitors feedstock movement to ensure minimal interruption to the production schedule. In addition, he identifies opportunities for continuous improvement in the organisations supply chain operations. The Production Planner works closely with the production, maintenance planning, sales and logistics teams, and interfaces with suppliers an...</code> | <code>The Production Planner is tasked with overseeing and implementing production schedules to guarantee timely delivery of products to customers. He/She is responsible for planning the complete production supply chain, from the initial feedstock to production, storage, and distribution, while analyzing production data to enhance production efficiency and inventory management. The Production Planner collaborates with the maintenance planning team to synchronize production objectives with maintenance and turnaround schedules. He supports the reporting of plant production status and raw material inventories, addressing any issues that could impact production output. He ensures smooth feedstock movement to minimize disruptions to the production timeline and identifies opportunities for ongoing improvements in the organization's supply chain operations. The Production Planner works in close partnership with the production, maintenance planning, sales, and logistics teams, while also engaging wi...</code> | <code>The Software Developer creates applications and software solutions tailored to meet client needs, focusing on coding, debugging, and testing software programs. He/She collaborates with cross-functional teams to design user-friendly interfaces and enhance user experience. The Software Developer is responsible for maintaining and updating existing software, ensuring optimal performance and security standards are met. He conducts code reviews and provides technical support to other team members while staying updated on the latest industry trends and technologies.</code> |
427
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
428
+ ```json
429
+ {
430
+ "scale": 20.0,
431
+ "similarity_fct": "cos_sim",
432
+ "gather_across_devices": false
433
+ }
434
+ ```
435
+
436
+ ### Training Hyperparameters
437
+ #### Non-Default Hyperparameters
438
+
439
+ - `eval_strategy`: epoch
440
+ - `per_device_train_batch_size`: 32
441
+ - `per_device_eval_batch_size`: 16
442
+ - `gradient_accumulation_steps`: 16
443
+ - `learning_rate`: 2e-05
444
+ - `num_train_epochs`: 5
445
+ - `lr_scheduler_type`: cosine
446
+ - `warmup_ratio`: 0.1
447
+ - `bf16`: True
448
+ - `tf32`: False
449
+ - `load_best_model_at_end`: True
450
+ - `batch_sampler`: no_duplicates
451
+
452
+ #### All Hyperparameters
453
+ <details><summary>Click to expand</summary>
454
+
455
+ - `overwrite_output_dir`: False
456
+ - `do_predict`: False
457
+ - `eval_strategy`: epoch
458
+ - `prediction_loss_only`: True
459
+ - `per_device_train_batch_size`: 32
460
+ - `per_device_eval_batch_size`: 16
461
+ - `per_gpu_train_batch_size`: None
462
+ - `per_gpu_eval_batch_size`: None
463
+ - `gradient_accumulation_steps`: 16
464
+ - `eval_accumulation_steps`: None
465
+ - `torch_empty_cache_steps`: None
466
+ - `learning_rate`: 2e-05
467
+ - `weight_decay`: 0.0
468
+ - `adam_beta1`: 0.9
469
+ - `adam_beta2`: 0.999
470
+ - `adam_epsilon`: 1e-08
471
+ - `max_grad_norm`: 1.0
472
+ - `num_train_epochs`: 5
473
+ - `max_steps`: -1
474
+ - `lr_scheduler_type`: cosine
475
+ - `lr_scheduler_kwargs`: {}
476
+ - `warmup_ratio`: 0.1
477
+ - `warmup_steps`: 0
478
+ - `log_level`: passive
479
+ - `log_level_replica`: warning
480
+ - `log_on_each_node`: True
481
+ - `logging_nan_inf_filter`: True
482
+ - `save_safetensors`: True
483
+ - `save_on_each_node`: False
484
+ - `save_only_model`: False
485
+ - `restore_callback_states_from_checkpoint`: False
486
+ - `no_cuda`: False
487
+ - `use_cpu`: False
488
+ - `use_mps_device`: False
489
+ - `seed`: 42
490
+ - `data_seed`: None
491
+ - `jit_mode_eval`: False
492
+ - `use_ipex`: False
493
+ - `bf16`: True
494
+ - `fp16`: False
495
+ - `fp16_opt_level`: O1
496
+ - `half_precision_backend`: auto
497
+ - `bf16_full_eval`: False
498
+ - `fp16_full_eval`: False
499
+ - `tf32`: False
500
+ - `local_rank`: 0
501
+ - `ddp_backend`: None
502
+ - `tpu_num_cores`: None
503
+ - `tpu_metrics_debug`: False
504
+ - `debug`: []
505
+ - `dataloader_drop_last`: False
506
+ - `dataloader_num_workers`: 0
507
+ - `dataloader_prefetch_factor`: None
508
+ - `past_index`: -1
509
+ - `disable_tqdm`: False
510
+ - `remove_unused_columns`: True
511
+ - `label_names`: None
512
+ - `load_best_model_at_end`: True
513
+ - `ignore_data_skip`: False
514
+ - `fsdp`: []
515
+ - `fsdp_min_num_params`: 0
516
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
517
+ - `fsdp_transformer_layer_cls_to_wrap`: None
518
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
519
+ - `deepspeed`: None
520
+ - `label_smoothing_factor`: 0.0
521
+ - `optim`: adamw_torch_fused
522
+ - `optim_args`: None
523
+ - `adafactor`: False
524
+ - `group_by_length`: False
525
+ - `length_column_name`: length
526
+ - `ddp_find_unused_parameters`: None
527
+ - `ddp_bucket_cap_mb`: None
528
+ - `ddp_broadcast_buffers`: False
529
+ - `dataloader_pin_memory`: True
530
+ - `dataloader_persistent_workers`: False
531
+ - `skip_memory_metrics`: True
532
+ - `use_legacy_prediction_loop`: False
533
+ - `push_to_hub`: False
534
+ - `resume_from_checkpoint`: None
535
+ - `hub_model_id`: None
536
+ - `hub_strategy`: every_save
537
+ - `hub_private_repo`: None
538
+ - `hub_always_push`: False
539
+ - `hub_revision`: None
540
+ - `gradient_checkpointing`: False
541
+ - `gradient_checkpointing_kwargs`: None
542
+ - `include_inputs_for_metrics`: False
543
+ - `include_for_metrics`: []
544
+ - `eval_do_concat_batches`: True
545
+ - `fp16_backend`: auto
546
+ - `push_to_hub_model_id`: None
547
+ - `push_to_hub_organization`: None
548
+ - `mp_parameters`:
549
+ - `auto_find_batch_size`: False
550
+ - `full_determinism`: False
551
+ - `torchdynamo`: None
552
+ - `ray_scope`: last
553
+ - `ddp_timeout`: 1800
554
+ - `torch_compile`: False
555
+ - `torch_compile_backend`: None
556
+ - `torch_compile_mode`: None
557
+ - `include_tokens_per_second`: False
558
+ - `include_num_input_tokens_seen`: False
559
+ - `neftune_noise_alpha`: None
560
+ - `optim_target_modules`: None
561
+ - `batch_eval_metrics`: False
562
+ - `eval_on_start`: False
563
+ - `use_liger_kernel`: False
564
+ - `liger_kernel_config`: None
565
+ - `eval_use_gather_object`: False
566
+ - `average_tokens_across_devices`: False
567
+ - `prompts`: None
568
+ - `batch_sampler`: no_duplicates
569
+ - `multi_dataset_batch_sampler`: proportional
570
+ - `router_mapping`: {}
571
+ - `learning_rate_mapping`: {}
572
+
573
+ </details>
574
+
575
+ ### Training Logs
576
+ | Epoch | Step | Training Loss | Validation Loss |
577
+ |:-------:|:------:|:-------------:|:---------------:|
578
+ | 1.0 | 9 | 0.1048 | 0.0043 |
579
+ | 2.0 | 18 | 0.0042 | 0.0021 |
580
+ | 3.0 | 27 | 0.0018 | 0.0016 |
581
+ | 4.0 | 36 | 0.0019 | 0.0014 |
582
+ | **5.0** | **45** | **0.0021** | **0.0014** |
583
+
584
+ * The bold row denotes the saved checkpoint.
585
+
586
+ ### Framework Versions
587
+ - Python: 3.12.11
588
+ - Sentence Transformers: 5.1.0
589
+ - Transformers: 4.55.0
590
+ - PyTorch: 2.8.0+cu128
591
+ - Accelerate: 1.10.0
592
+ - Datasets: 4.0.0
593
+ - Tokenizers: 0.21.4
594
+
595
+ ## Citation
596
+
597
+ ### BibTeX
598
+
599
+ #### Sentence Transformers
600
+ ```bibtex
601
+ @inproceedings{reimers-2019-sentence-bert,
602
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
603
+ author = "Reimers, Nils and Gurevych, Iryna",
604
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
605
+ month = "11",
606
+ year = "2019",
607
+ publisher = "Association for Computational Linguistics",
608
+ url = "https://arxiv.org/abs/1908.10084",
609
+ }
610
+ ```
611
+
612
+ #### MultipleNegativesRankingLoss
613
+ ```bibtex
614
+ @misc{henderson2017efficient,
615
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
616
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
617
+ year={2017},
618
+ eprint={1705.00652},
619
+ archivePrefix={arXiv},
620
+ primaryClass={cs.CL}
621
+ }
622
+ ```
623
+
624
+ <!--
625
+ ## Glossary
626
+
627
+ *Clearly define terms in order to be accessible across audiences.*
628
+ -->
629
+
630
+ <!--
631
+ ## Model Card Authors
632
+
633
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
634
+ -->
635
+
636
+ <!--
637
+ ## Model Card Contact
638
+
639
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
640
+ -->
config.json ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "ModernBertModel"
4
+ ],
5
+ "attention_bias": false,
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 50281,
8
+ "classifier_activation": "gelu",
9
+ "classifier_bias": false,
10
+ "classifier_dropout": 0.0,
11
+ "classifier_pooling": "mean",
12
+ "cls_token_id": 50281,
13
+ "decoder_bias": true,
14
+ "deterministic_flash_attn": false,
15
+ "embedding_dropout": 0.0,
16
+ "eos_token_id": 50282,
17
+ "global_attn_every_n_layers": 3,
18
+ "global_rope_theta": 160000.0,
19
+ "gradient_checkpointing": false,
20
+ "hidden_activation": "gelu",
21
+ "hidden_size": 768,
22
+ "initializer_cutoff_factor": 2.0,
23
+ "initializer_range": 0.02,
24
+ "intermediate_size": 1152,
25
+ "layer_norm_eps": 1e-05,
26
+ "local_attention": 128,
27
+ "local_rope_theta": 10000.0,
28
+ "max_position_embeddings": 8192,
29
+ "mlp_bias": false,
30
+ "mlp_dropout": 0.0,
31
+ "model_type": "modernbert",
32
+ "norm_bias": false,
33
+ "norm_eps": 1e-05,
34
+ "num_attention_heads": 12,
35
+ "num_hidden_layers": 22,
36
+ "pad_token_id": 50283,
37
+ "position_embedding_type": "absolute",
38
+ "repad_logits_with_grad": false,
39
+ "sep_token_id": 50282,
40
+ "sparse_pred_ignore_index": -100,
41
+ "sparse_prediction": false,
42
+ "torch_dtype": "float32",
43
+ "transformers_version": "4.55.0",
44
+ "vocab_size": 50368
45
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "5.1.0",
4
+ "transformers": "4.55.0",
5
+ "pytorch": "2.8.0+cu128"
6
+ },
7
+ "prompts": {
8
+ "query": "",
9
+ "document": ""
10
+ },
11
+ "default_prompt_name": null,
12
+ "similarity_fn_name": "cosine",
13
+ "model_type": "SentenceTransformer"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e66d9ff9cecfd78eef9597b3f0371a1e08b7099cd5b6dd8f68dc5418cfd56412
3
+ size 596070136
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 8192,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": true,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,945 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "|||IP_ADDRESS|||",
5
+ "lstrip": false,
6
+ "normalized": true,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": false
10
+ },
11
+ "1": {
12
+ "content": "<|padding|>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "50254": {
20
+ "content": " ",
21
+ "lstrip": false,
22
+ "normalized": true,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": false
26
+ },
27
+ "50255": {
28
+ "content": " ",
29
+ "lstrip": false,
30
+ "normalized": true,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": false
34
+ },
35
+ "50256": {
36
+ "content": " ",
37
+ "lstrip": false,
38
+ "normalized": true,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": false
42
+ },
43
+ "50257": {
44
+ "content": " ",
45
+ "lstrip": false,
46
+ "normalized": true,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": false
50
+ },
51
+ "50258": {
52
+ "content": " ",
53
+ "lstrip": false,
54
+ "normalized": true,
55
+ "rstrip": false,
56
+ "single_word": false,
57
+ "special": false
58
+ },
59
+ "50259": {
60
+ "content": " ",
61
+ "lstrip": false,
62
+ "normalized": true,
63
+ "rstrip": false,
64
+ "single_word": false,
65
+ "special": false
66
+ },
67
+ "50260": {
68
+ "content": " ",
69
+ "lstrip": false,
70
+ "normalized": true,
71
+ "rstrip": false,
72
+ "single_word": false,
73
+ "special": false
74
+ },
75
+ "50261": {
76
+ "content": " ",
77
+ "lstrip": false,
78
+ "normalized": true,
79
+ "rstrip": false,
80
+ "single_word": false,
81
+ "special": false
82
+ },
83
+ "50262": {
84
+ "content": " ",
85
+ "lstrip": false,
86
+ "normalized": true,
87
+ "rstrip": false,
88
+ "single_word": false,
89
+ "special": false
90
+ },
91
+ "50263": {
92
+ "content": " ",
93
+ "lstrip": false,
94
+ "normalized": true,
95
+ "rstrip": false,
96
+ "single_word": false,
97
+ "special": false
98
+ },
99
+ "50264": {
100
+ "content": " ",
101
+ "lstrip": false,
102
+ "normalized": true,
103
+ "rstrip": false,
104
+ "single_word": false,
105
+ "special": false
106
+ },
107
+ "50265": {
108
+ "content": " ",
109
+ "lstrip": false,
110
+ "normalized": true,
111
+ "rstrip": false,
112
+ "single_word": false,
113
+ "special": false
114
+ },
115
+ "50266": {
116
+ "content": " ",
117
+ "lstrip": false,
118
+ "normalized": true,
119
+ "rstrip": false,
120
+ "single_word": false,
121
+ "special": false
122
+ },
123
+ "50267": {
124
+ "content": " ",
125
+ "lstrip": false,
126
+ "normalized": true,
127
+ "rstrip": false,
128
+ "single_word": false,
129
+ "special": false
130
+ },
131
+ "50268": {
132
+ "content": " ",
133
+ "lstrip": false,
134
+ "normalized": true,
135
+ "rstrip": false,
136
+ "single_word": false,
137
+ "special": false
138
+ },
139
+ "50269": {
140
+ "content": " ",
141
+ "lstrip": false,
142
+ "normalized": true,
143
+ "rstrip": false,
144
+ "single_word": false,
145
+ "special": false
146
+ },
147
+ "50270": {
148
+ "content": " ",
149
+ "lstrip": false,
150
+ "normalized": true,
151
+ "rstrip": false,
152
+ "single_word": false,
153
+ "special": false
154
+ },
155
+ "50271": {
156
+ "content": " ",
157
+ "lstrip": false,
158
+ "normalized": true,
159
+ "rstrip": false,
160
+ "single_word": false,
161
+ "special": false
162
+ },
163
+ "50272": {
164
+ "content": " ",
165
+ "lstrip": false,
166
+ "normalized": true,
167
+ "rstrip": false,
168
+ "single_word": false,
169
+ "special": false
170
+ },
171
+ "50273": {
172
+ "content": " ",
173
+ "lstrip": false,
174
+ "normalized": true,
175
+ "rstrip": false,
176
+ "single_word": false,
177
+ "special": false
178
+ },
179
+ "50274": {
180
+ "content": " ",
181
+ "lstrip": false,
182
+ "normalized": true,
183
+ "rstrip": false,
184
+ "single_word": false,
185
+ "special": false
186
+ },
187
+ "50275": {
188
+ "content": " ",
189
+ "lstrip": false,
190
+ "normalized": true,
191
+ "rstrip": false,
192
+ "single_word": false,
193
+ "special": false
194
+ },
195
+ "50276": {
196
+ "content": " ",
197
+ "lstrip": false,
198
+ "normalized": true,
199
+ "rstrip": false,
200
+ "single_word": false,
201
+ "special": false
202
+ },
203
+ "50277": {
204
+ "content": "|||EMAIL_ADDRESS|||",
205
+ "lstrip": false,
206
+ "normalized": true,
207
+ "rstrip": false,
208
+ "single_word": false,
209
+ "special": false
210
+ },
211
+ "50278": {
212
+ "content": "|||PHONE_NUMBER|||",
213
+ "lstrip": false,
214
+ "normalized": true,
215
+ "rstrip": false,
216
+ "single_word": false,
217
+ "special": false
218
+ },
219
+ "50279": {
220
+ "content": "<|endoftext|>",
221
+ "lstrip": false,
222
+ "normalized": false,
223
+ "rstrip": false,
224
+ "single_word": false,
225
+ "special": true
226
+ },
227
+ "50280": {
228
+ "content": "[UNK]",
229
+ "lstrip": false,
230
+ "normalized": false,
231
+ "rstrip": false,
232
+ "single_word": false,
233
+ "special": true
234
+ },
235
+ "50281": {
236
+ "content": "[CLS]",
237
+ "lstrip": false,
238
+ "normalized": false,
239
+ "rstrip": false,
240
+ "single_word": false,
241
+ "special": true
242
+ },
243
+ "50282": {
244
+ "content": "[SEP]",
245
+ "lstrip": false,
246
+ "normalized": false,
247
+ "rstrip": false,
248
+ "single_word": false,
249
+ "special": true
250
+ },
251
+ "50283": {
252
+ "content": "[PAD]",
253
+ "lstrip": false,
254
+ "normalized": false,
255
+ "rstrip": false,
256
+ "single_word": false,
257
+ "special": true
258
+ },
259
+ "50284": {
260
+ "content": "[MASK]",
261
+ "lstrip": true,
262
+ "normalized": false,
263
+ "rstrip": false,
264
+ "single_word": false,
265
+ "special": true
266
+ },
267
+ "50285": {
268
+ "content": "[unused0]",
269
+ "lstrip": false,
270
+ "normalized": true,
271
+ "rstrip": false,
272
+ "single_word": false,
273
+ "special": false
274
+ },
275
+ "50286": {
276
+ "content": "[unused1]",
277
+ "lstrip": false,
278
+ "normalized": true,
279
+ "rstrip": false,
280
+ "single_word": false,
281
+ "special": false
282
+ },
283
+ "50287": {
284
+ "content": "[unused2]",
285
+ "lstrip": false,
286
+ "normalized": true,
287
+ "rstrip": false,
288
+ "single_word": false,
289
+ "special": false
290
+ },
291
+ "50288": {
292
+ "content": "[unused3]",
293
+ "lstrip": false,
294
+ "normalized": true,
295
+ "rstrip": false,
296
+ "single_word": false,
297
+ "special": false
298
+ },
299
+ "50289": {
300
+ "content": "[unused4]",
301
+ "lstrip": false,
302
+ "normalized": true,
303
+ "rstrip": false,
304
+ "single_word": false,
305
+ "special": false
306
+ },
307
+ "50290": {
308
+ "content": "[unused5]",
309
+ "lstrip": false,
310
+ "normalized": true,
311
+ "rstrip": false,
312
+ "single_word": false,
313
+ "special": false
314
+ },
315
+ "50291": {
316
+ "content": "[unused6]",
317
+ "lstrip": false,
318
+ "normalized": true,
319
+ "rstrip": false,
320
+ "single_word": false,
321
+ "special": false
322
+ },
323
+ "50292": {
324
+ "content": "[unused7]",
325
+ "lstrip": false,
326
+ "normalized": true,
327
+ "rstrip": false,
328
+ "single_word": false,
329
+ "special": false
330
+ },
331
+ "50293": {
332
+ "content": "[unused8]",
333
+ "lstrip": false,
334
+ "normalized": true,
335
+ "rstrip": false,
336
+ "single_word": false,
337
+ "special": false
338
+ },
339
+ "50294": {
340
+ "content": "[unused9]",
341
+ "lstrip": false,
342
+ "normalized": true,
343
+ "rstrip": false,
344
+ "single_word": false,
345
+ "special": false
346
+ },
347
+ "50295": {
348
+ "content": "[unused10]",
349
+ "lstrip": false,
350
+ "normalized": true,
351
+ "rstrip": false,
352
+ "single_word": false,
353
+ "special": false
354
+ },
355
+ "50296": {
356
+ "content": "[unused11]",
357
+ "lstrip": false,
358
+ "normalized": true,
359
+ "rstrip": false,
360
+ "single_word": false,
361
+ "special": false
362
+ },
363
+ "50297": {
364
+ "content": "[unused12]",
365
+ "lstrip": false,
366
+ "normalized": true,
367
+ "rstrip": false,
368
+ "single_word": false,
369
+ "special": false
370
+ },
371
+ "50298": {
372
+ "content": "[unused13]",
373
+ "lstrip": false,
374
+ "normalized": true,
375
+ "rstrip": false,
376
+ "single_word": false,
377
+ "special": false
378
+ },
379
+ "50299": {
380
+ "content": "[unused14]",
381
+ "lstrip": false,
382
+ "normalized": true,
383
+ "rstrip": false,
384
+ "single_word": false,
385
+ "special": false
386
+ },
387
+ "50300": {
388
+ "content": "[unused15]",
389
+ "lstrip": false,
390
+ "normalized": true,
391
+ "rstrip": false,
392
+ "single_word": false,
393
+ "special": false
394
+ },
395
+ "50301": {
396
+ "content": "[unused16]",
397
+ "lstrip": false,
398
+ "normalized": true,
399
+ "rstrip": false,
400
+ "single_word": false,
401
+ "special": false
402
+ },
403
+ "50302": {
404
+ "content": "[unused17]",
405
+ "lstrip": false,
406
+ "normalized": true,
407
+ "rstrip": false,
408
+ "single_word": false,
409
+ "special": false
410
+ },
411
+ "50303": {
412
+ "content": "[unused18]",
413
+ "lstrip": false,
414
+ "normalized": true,
415
+ "rstrip": false,
416
+ "single_word": false,
417
+ "special": false
418
+ },
419
+ "50304": {
420
+ "content": "[unused19]",
421
+ "lstrip": false,
422
+ "normalized": true,
423
+ "rstrip": false,
424
+ "single_word": false,
425
+ "special": false
426
+ },
427
+ "50305": {
428
+ "content": "[unused20]",
429
+ "lstrip": false,
430
+ "normalized": true,
431
+ "rstrip": false,
432
+ "single_word": false,
433
+ "special": false
434
+ },
435
+ "50306": {
436
+ "content": "[unused21]",
437
+ "lstrip": false,
438
+ "normalized": true,
439
+ "rstrip": false,
440
+ "single_word": false,
441
+ "special": false
442
+ },
443
+ "50307": {
444
+ "content": "[unused22]",
445
+ "lstrip": false,
446
+ "normalized": true,
447
+ "rstrip": false,
448
+ "single_word": false,
449
+ "special": false
450
+ },
451
+ "50308": {
452
+ "content": "[unused23]",
453
+ "lstrip": false,
454
+ "normalized": true,
455
+ "rstrip": false,
456
+ "single_word": false,
457
+ "special": false
458
+ },
459
+ "50309": {
460
+ "content": "[unused24]",
461
+ "lstrip": false,
462
+ "normalized": true,
463
+ "rstrip": false,
464
+ "single_word": false,
465
+ "special": false
466
+ },
467
+ "50310": {
468
+ "content": "[unused25]",
469
+ "lstrip": false,
470
+ "normalized": true,
471
+ "rstrip": false,
472
+ "single_word": false,
473
+ "special": false
474
+ },
475
+ "50311": {
476
+ "content": "[unused26]",
477
+ "lstrip": false,
478
+ "normalized": true,
479
+ "rstrip": false,
480
+ "single_word": false,
481
+ "special": false
482
+ },
483
+ "50312": {
484
+ "content": "[unused27]",
485
+ "lstrip": false,
486
+ "normalized": true,
487
+ "rstrip": false,
488
+ "single_word": false,
489
+ "special": false
490
+ },
491
+ "50313": {
492
+ "content": "[unused28]",
493
+ "lstrip": false,
494
+ "normalized": true,
495
+ "rstrip": false,
496
+ "single_word": false,
497
+ "special": false
498
+ },
499
+ "50314": {
500
+ "content": "[unused29]",
501
+ "lstrip": false,
502
+ "normalized": true,
503
+ "rstrip": false,
504
+ "single_word": false,
505
+ "special": false
506
+ },
507
+ "50315": {
508
+ "content": "[unused30]",
509
+ "lstrip": false,
510
+ "normalized": true,
511
+ "rstrip": false,
512
+ "single_word": false,
513
+ "special": false
514
+ },
515
+ "50316": {
516
+ "content": "[unused31]",
517
+ "lstrip": false,
518
+ "normalized": true,
519
+ "rstrip": false,
520
+ "single_word": false,
521
+ "special": false
522
+ },
523
+ "50317": {
524
+ "content": "[unused32]",
525
+ "lstrip": false,
526
+ "normalized": true,
527
+ "rstrip": false,
528
+ "single_word": false,
529
+ "special": false
530
+ },
531
+ "50318": {
532
+ "content": "[unused33]",
533
+ "lstrip": false,
534
+ "normalized": true,
535
+ "rstrip": false,
536
+ "single_word": false,
537
+ "special": false
538
+ },
539
+ "50319": {
540
+ "content": "[unused34]",
541
+ "lstrip": false,
542
+ "normalized": true,
543
+ "rstrip": false,
544
+ "single_word": false,
545
+ "special": false
546
+ },
547
+ "50320": {
548
+ "content": "[unused35]",
549
+ "lstrip": false,
550
+ "normalized": true,
551
+ "rstrip": false,
552
+ "single_word": false,
553
+ "special": false
554
+ },
555
+ "50321": {
556
+ "content": "[unused36]",
557
+ "lstrip": false,
558
+ "normalized": true,
559
+ "rstrip": false,
560
+ "single_word": false,
561
+ "special": false
562
+ },
563
+ "50322": {
564
+ "content": "[unused37]",
565
+ "lstrip": false,
566
+ "normalized": true,
567
+ "rstrip": false,
568
+ "single_word": false,
569
+ "special": false
570
+ },
571
+ "50323": {
572
+ "content": "[unused38]",
573
+ "lstrip": false,
574
+ "normalized": true,
575
+ "rstrip": false,
576
+ "single_word": false,
577
+ "special": false
578
+ },
579
+ "50324": {
580
+ "content": "[unused39]",
581
+ "lstrip": false,
582
+ "normalized": true,
583
+ "rstrip": false,
584
+ "single_word": false,
585
+ "special": false
586
+ },
587
+ "50325": {
588
+ "content": "[unused40]",
589
+ "lstrip": false,
590
+ "normalized": true,
591
+ "rstrip": false,
592
+ "single_word": false,
593
+ "special": false
594
+ },
595
+ "50326": {
596
+ "content": "[unused41]",
597
+ "lstrip": false,
598
+ "normalized": true,
599
+ "rstrip": false,
600
+ "single_word": false,
601
+ "special": false
602
+ },
603
+ "50327": {
604
+ "content": "[unused42]",
605
+ "lstrip": false,
606
+ "normalized": true,
607
+ "rstrip": false,
608
+ "single_word": false,
609
+ "special": false
610
+ },
611
+ "50328": {
612
+ "content": "[unused43]",
613
+ "lstrip": false,
614
+ "normalized": true,
615
+ "rstrip": false,
616
+ "single_word": false,
617
+ "special": false
618
+ },
619
+ "50329": {
620
+ "content": "[unused44]",
621
+ "lstrip": false,
622
+ "normalized": true,
623
+ "rstrip": false,
624
+ "single_word": false,
625
+ "special": false
626
+ },
627
+ "50330": {
628
+ "content": "[unused45]",
629
+ "lstrip": false,
630
+ "normalized": true,
631
+ "rstrip": false,
632
+ "single_word": false,
633
+ "special": false
634
+ },
635
+ "50331": {
636
+ "content": "[unused46]",
637
+ "lstrip": false,
638
+ "normalized": true,
639
+ "rstrip": false,
640
+ "single_word": false,
641
+ "special": false
642
+ },
643
+ "50332": {
644
+ "content": "[unused47]",
645
+ "lstrip": false,
646
+ "normalized": true,
647
+ "rstrip": false,
648
+ "single_word": false,
649
+ "special": false
650
+ },
651
+ "50333": {
652
+ "content": "[unused48]",
653
+ "lstrip": false,
654
+ "normalized": true,
655
+ "rstrip": false,
656
+ "single_word": false,
657
+ "special": false
658
+ },
659
+ "50334": {
660
+ "content": "[unused49]",
661
+ "lstrip": false,
662
+ "normalized": true,
663
+ "rstrip": false,
664
+ "single_word": false,
665
+ "special": false
666
+ },
667
+ "50335": {
668
+ "content": "[unused50]",
669
+ "lstrip": false,
670
+ "normalized": true,
671
+ "rstrip": false,
672
+ "single_word": false,
673
+ "special": false
674
+ },
675
+ "50336": {
676
+ "content": "[unused51]",
677
+ "lstrip": false,
678
+ "normalized": true,
679
+ "rstrip": false,
680
+ "single_word": false,
681
+ "special": false
682
+ },
683
+ "50337": {
684
+ "content": "[unused52]",
685
+ "lstrip": false,
686
+ "normalized": true,
687
+ "rstrip": false,
688
+ "single_word": false,
689
+ "special": false
690
+ },
691
+ "50338": {
692
+ "content": "[unused53]",
693
+ "lstrip": false,
694
+ "normalized": true,
695
+ "rstrip": false,
696
+ "single_word": false,
697
+ "special": false
698
+ },
699
+ "50339": {
700
+ "content": "[unused54]",
701
+ "lstrip": false,
702
+ "normalized": true,
703
+ "rstrip": false,
704
+ "single_word": false,
705
+ "special": false
706
+ },
707
+ "50340": {
708
+ "content": "[unused55]",
709
+ "lstrip": false,
710
+ "normalized": true,
711
+ "rstrip": false,
712
+ "single_word": false,
713
+ "special": false
714
+ },
715
+ "50341": {
716
+ "content": "[unused56]",
717
+ "lstrip": false,
718
+ "normalized": true,
719
+ "rstrip": false,
720
+ "single_word": false,
721
+ "special": false
722
+ },
723
+ "50342": {
724
+ "content": "[unused57]",
725
+ "lstrip": false,
726
+ "normalized": true,
727
+ "rstrip": false,
728
+ "single_word": false,
729
+ "special": false
730
+ },
731
+ "50343": {
732
+ "content": "[unused58]",
733
+ "lstrip": false,
734
+ "normalized": true,
735
+ "rstrip": false,
736
+ "single_word": false,
737
+ "special": false
738
+ },
739
+ "50344": {
740
+ "content": "[unused59]",
741
+ "lstrip": false,
742
+ "normalized": true,
743
+ "rstrip": false,
744
+ "single_word": false,
745
+ "special": false
746
+ },
747
+ "50345": {
748
+ "content": "[unused60]",
749
+ "lstrip": false,
750
+ "normalized": true,
751
+ "rstrip": false,
752
+ "single_word": false,
753
+ "special": false
754
+ },
755
+ "50346": {
756
+ "content": "[unused61]",
757
+ "lstrip": false,
758
+ "normalized": true,
759
+ "rstrip": false,
760
+ "single_word": false,
761
+ "special": false
762
+ },
763
+ "50347": {
764
+ "content": "[unused62]",
765
+ "lstrip": false,
766
+ "normalized": true,
767
+ "rstrip": false,
768
+ "single_word": false,
769
+ "special": false
770
+ },
771
+ "50348": {
772
+ "content": "[unused63]",
773
+ "lstrip": false,
774
+ "normalized": true,
775
+ "rstrip": false,
776
+ "single_word": false,
777
+ "special": false
778
+ },
779
+ "50349": {
780
+ "content": "[unused64]",
781
+ "lstrip": false,
782
+ "normalized": true,
783
+ "rstrip": false,
784
+ "single_word": false,
785
+ "special": false
786
+ },
787
+ "50350": {
788
+ "content": "[unused65]",
789
+ "lstrip": false,
790
+ "normalized": true,
791
+ "rstrip": false,
792
+ "single_word": false,
793
+ "special": false
794
+ },
795
+ "50351": {
796
+ "content": "[unused66]",
797
+ "lstrip": false,
798
+ "normalized": true,
799
+ "rstrip": false,
800
+ "single_word": false,
801
+ "special": false
802
+ },
803
+ "50352": {
804
+ "content": "[unused67]",
805
+ "lstrip": false,
806
+ "normalized": true,
807
+ "rstrip": false,
808
+ "single_word": false,
809
+ "special": false
810
+ },
811
+ "50353": {
812
+ "content": "[unused68]",
813
+ "lstrip": false,
814
+ "normalized": true,
815
+ "rstrip": false,
816
+ "single_word": false,
817
+ "special": false
818
+ },
819
+ "50354": {
820
+ "content": "[unused69]",
821
+ "lstrip": false,
822
+ "normalized": true,
823
+ "rstrip": false,
824
+ "single_word": false,
825
+ "special": false
826
+ },
827
+ "50355": {
828
+ "content": "[unused70]",
829
+ "lstrip": false,
830
+ "normalized": true,
831
+ "rstrip": false,
832
+ "single_word": false,
833
+ "special": false
834
+ },
835
+ "50356": {
836
+ "content": "[unused71]",
837
+ "lstrip": false,
838
+ "normalized": true,
839
+ "rstrip": false,
840
+ "single_word": false,
841
+ "special": false
842
+ },
843
+ "50357": {
844
+ "content": "[unused72]",
845
+ "lstrip": false,
846
+ "normalized": true,
847
+ "rstrip": false,
848
+ "single_word": false,
849
+ "special": false
850
+ },
851
+ "50358": {
852
+ "content": "[unused73]",
853
+ "lstrip": false,
854
+ "normalized": true,
855
+ "rstrip": false,
856
+ "single_word": false,
857
+ "special": false
858
+ },
859
+ "50359": {
860
+ "content": "[unused74]",
861
+ "lstrip": false,
862
+ "normalized": true,
863
+ "rstrip": false,
864
+ "single_word": false,
865
+ "special": false
866
+ },
867
+ "50360": {
868
+ "content": "[unused75]",
869
+ "lstrip": false,
870
+ "normalized": true,
871
+ "rstrip": false,
872
+ "single_word": false,
873
+ "special": false
874
+ },
875
+ "50361": {
876
+ "content": "[unused76]",
877
+ "lstrip": false,
878
+ "normalized": true,
879
+ "rstrip": false,
880
+ "single_word": false,
881
+ "special": false
882
+ },
883
+ "50362": {
884
+ "content": "[unused77]",
885
+ "lstrip": false,
886
+ "normalized": true,
887
+ "rstrip": false,
888
+ "single_word": false,
889
+ "special": false
890
+ },
891
+ "50363": {
892
+ "content": "[unused78]",
893
+ "lstrip": false,
894
+ "normalized": true,
895
+ "rstrip": false,
896
+ "single_word": false,
897
+ "special": false
898
+ },
899
+ "50364": {
900
+ "content": "[unused79]",
901
+ "lstrip": false,
902
+ "normalized": true,
903
+ "rstrip": false,
904
+ "single_word": false,
905
+ "special": false
906
+ },
907
+ "50365": {
908
+ "content": "[unused80]",
909
+ "lstrip": false,
910
+ "normalized": true,
911
+ "rstrip": false,
912
+ "single_word": false,
913
+ "special": false
914
+ },
915
+ "50366": {
916
+ "content": "[unused81]",
917
+ "lstrip": false,
918
+ "normalized": true,
919
+ "rstrip": false,
920
+ "single_word": false,
921
+ "special": false
922
+ },
923
+ "50367": {
924
+ "content": "[unused82]",
925
+ "lstrip": false,
926
+ "normalized": true,
927
+ "rstrip": false,
928
+ "single_word": false,
929
+ "special": false
930
+ }
931
+ },
932
+ "clean_up_tokenization_spaces": true,
933
+ "cls_token": "[CLS]",
934
+ "extra_special_tokens": {},
935
+ "mask_token": "[MASK]",
936
+ "model_input_names": [
937
+ "input_ids",
938
+ "attention_mask"
939
+ ],
940
+ "model_max_length": 8192,
941
+ "pad_token": "[PAD]",
942
+ "sep_token": "[SEP]",
943
+ "tokenizer_class": "PreTrainedTokenizerFast",
944
+ "unk_token": "[UNK]"
945
+ }