Ermiyas commited on
Commit
d5fefb7
·
verified ·
1 Parent(s): b35fbbb

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 384,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,711 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:1900
9
+ - loss:CosineSimilarityLoss
10
+ base_model: sentence-transformers/all-MiniLM-L6-v2
11
+ widget:
12
+ - source_sentence: ' Hi there, I''m writing a piece about a worker who shared a post
13
+ on Reddit questioning "What is a polite and professional way to explain to my
14
+ employer that since they''ve decided to cut our salary by 60%, that I''ll only
15
+ be completing 40% of all assigned tasks from now on?" Read the full context of
16
+ the story here at https://www.reddit.com/r/antiwork/comments/10oaben/what_is_a_polite_and_professional_way_to_explain/
17
+ OR pasted below:
18
+
19
+
20
+ READER POST BEGINS
21
+
22
+
23
+ My employers have decided that since we''re transitioning into a new company,
24
+ they deserve raises since they''ll be doing twice the amount of work. I work for
25
+ a pretty small non-profit, having said that ALL OF US have had to do twice the
26
+ amount of work we signed on for. I''m only contracted to work 40 hours of work
27
+ a week, but recently I''ve had to work 60 hours a week and have even been called
28
+ in for work ON MY DAYS OFF. But in spite of all of that, my employer has announced
29
+ that the rest of us will be losing vision and dental benefits and be required
30
+ to pay for health insurance out of pocket. Oh, and because we need to save money
31
+ for the new company, thw rest of us will be getting pay cuts.
32
+
33
+
34
+ READER POST ENDS
35
+
36
+
37
+ It would be great if a career expert or life coach can share comment on whether
38
+ the employee is approaching the situation in the best way. Could they be risking
39
+ getting fired? Would this sort of tactic be effective in getting the employer
40
+ to respond in a way that benefits the worker? Be great to get any brief comments
41
+ (200 or so words at max) | Career Expert or Life Coach Needed for Comment on
42
+ Employee Doing Only 40% of Their Work After 60% Pay Cut '
43
+ sentences:
44
+ - 'Hi Sydney, In response to your recent query regarding co-parenting apps, I''d
45
+ like to offer the insights below from New York family law attorney Atty Bruggemann,
46
+ Esq., partner at Dimopoulos Bruggemann P.C. (www.dimolaw.com). Atty specializes
47
+ in child custody, support and co-parenting matters and would be happy to offer
48
+ her perspective. Please see below for her thoughts: The benefits of co-parenting
49
+ apps: Co-parenting apps have been on the rise given the increase in co-parenting
50
+ or shared custody arrangements awarded by the courts even when the parents involved
51
+ fail to communicate effectively. While the failure of parents to get along and
52
+ communicate has historically been seen as a barrier to a shared custody arrangement
53
+ for a multitude of reasons, courts are now willing to take a chance on these arrangements.
54
+ With the advancement of technology there is now the ability to control communications
55
+ and the abuse that can come along with having full access through social media,
56
+ text messaging, emails, and apps. These apps consolidate all the mediums of communication
57
+ into one and allow parents to trust the communications, calendars, and financial
58
+ aspects of their relationship. The apps accomplish several things to help parents
59
+ in difficult co-parenting situations: - Tracking communications in one place.
60
+ Often keeping texts, emails, and other communications become difficult when all
61
+ mediums are permitted to be used. All communications, receipts, scheduling happens
62
+ in one place in these apps and helps to consolidate communications. - Accountability.
63
+ During the custody litigation everyone is on their best behavior. Many issues
64
+ arise post judgment because people begin to take matters into their own hands.
65
+ These apps save all communications and the communications cannot be edited. These
66
+ apps create that sense of eyes on the behavior of the parents. Further, all communications
67
+ can be downloaded and saved to provide as evidence should the parents needs to
68
+ go back to court on any issues. - Cuts down on harassing communications by not
69
+ giving each parent full and unfettered access to one another. Often there is a
70
+ prohibition from contacting one another outside the app unless a third party is
71
+ copied, such as a doctor or a teacher. Conversely, also stops phone call and text
72
+ blocking or claims that one parent did not receive a communication. - Calendaring:
73
+ There can be no confusion on when the children are with each parent and when the
74
+ vacation schedules are if the parents put the regular and vacation/holiday schedules
75
+ in the app calendar function. - Expense tracking: Many of the apps have the capability
76
+ to track expenses for the parents and record payments. This helps to cut down
77
+ on back-and-forth communications about who owes what and when the goal of the
78
+ apps is to organize and streamline interactions between parties who have proven
79
+ they cannot communicate effectively. While these apps cannot solve all issues
80
+ between parents, they can help to keep parents focused on the most important issue,
81
+ the wellbeing of their children. Here''s some background on Atty: Atty Bruggemann,
82
+ Esq. has handled some of the most intricate divorce cases on record. With over
83
+ fourteen years of experience, she has established herself as a trusted authority,
84
+ providing strategic counsel and innovative solutions when matters seem insurmountable.
85
+ Ms. Bruggemann specializes in navigating complex matrimonial cases involving high-net-worth
86
+ individuals, professional athletes, and celebrities, consistently securing favorable
87
+ outcomes that serve her clients'' best interests. Ms. Bruggemann’s extensive knowledge
88
+ and skill in child custody and support matters ensures compassionate advocacy.
89
+ While she prioritizes amicable resolutions, Ms. Bruggemann has a proven track
90
+ record of success in highly contentious custody disputes and is prepared to vigorously
91
+ litigate when necessary to safeguard the best interest of the children involved.
92
+ Her expertise further extends to the drafting and negotiation of prenuptial, postnuptial,
93
+ separation, and settlement agreements. Clients frequently call on her for advice
94
+ when contemplating a divorce, preparing for mediation, and for second opinions
95
+ on their matters. Recognized by Top Women Attorneys in New York and Super Lawyers’
96
+ Rising Stars, Ms. Bruggemann has published articles in The New York Law Journal
97
+ and New York Family Law Monthly. She is a member of the American Bar Association
98
+ and the New York State Bar Association. Ms. Bruggemann received a B.A. from Emerson
99
+ College, cum laude, and a J.D. from New York Law School, cum laude. She is admitted
100
+ to practice law in New York. Thank you for your consideration. Best, Kelly Lee
101
+ Co-Communications (585) 764-4661 (cell) klee@cocommunications.com'
102
+ - 'Hi Lindsay, Here''s a real estate investment story that your readers may find
103
+ interesting. I''m a pharmacist by profession, so when I started investing in real
104
+ estate I didn''t know what to look for in a property – or what to avoid. The first
105
+ investment property I ever bought was a 100-year-old house. It was cheap, and
106
+ I thought I had snagged a great deal. But I soon realized I had gotten myself
107
+ into a project that needed a lot more than I had originally bargained for. Between
108
+ the broken pipes and the rat and flea infestations, my new property needed $40,000
109
+ in repairs and improvements. But here are a few things I learned from that first
110
+ investment: 1. Always vet a property before buying it. If you uncover problems
111
+ that aren''t deal breakers, you''ll be in a much better position to negotiate
112
+ the sale price if you know what''s wrong and how much repairs would cost. 2. Invest
113
+ in properties that are big enough to allow you to maximize the number of bedrooms
114
+ and bathrooms. That way, you can rent out individual rooms to college students
115
+ and end up earning a lot more for the same property (versus renting the entire
116
+ house to one person) 3. Expensive mistakes can help you if you take the time to
117
+ learn from them. That''s what I did, and it''s helped me invest in properties
118
+ that offer a high ROI. Despite that early investment mistake, I chose to keep
119
+ going. And now, I can choose to retire thanks to my investment portfolio – even
120
+ though I''m still in my early thirties. Please let me know if you''d like to hear
121
+ more. Website: https://www.newbierealestateinvesting.com/ Warmly, Ryan'
122
+ - It is difficult to comment on the situation without knowing the specific details
123
+ of the manager in question. However, in general, it is not recommended to approach
124
+ a request for a raise or better benefits in a confrontational or ultimatum-style
125
+ manner, as this can create tension and negatively affect the employee's relationship
126
+ with their employer. It can also risk the employee being seen as difficult or
127
+ unreasonable, which could result in negative consequences such as not receiving
128
+ the desired raise or even being fired. A better approach would be to schedule
129
+ a meeting with the employer and have an open and honest conversation about the
130
+ employee's contribution to the company, their career goals, and their reasons
131
+ for requesting a raise or better benefits. The employee can present data, such
132
+ as market research or their performance metrics, to support their request and
133
+ show their value to the company. It's important to remember that the outcome of
134
+ these discussions is not always guaranteed, and the employer may not be able to
135
+ meet the employee's request. However, approaching the situation in a professional
136
+ and solution-focused manner can increase the chances of reaching a mutually beneficial
137
+ agreement and maintaining a positive relationship with the employer.
138
+ - source_sentence: ' I''m putting together a piece on "Times Using Your Debit Card
139
+ Will Cost You More." It''s similar to this piece about debit cards ( https://bestlifeonline.com/never-use-debit-card-purchases-news/
140
+ ), and is pretty self-explanatory! Could be overdraft fees, higher rates, etc...
141
+
142
+
143
+ FYI: For most pieces, we simply collect a recommendation and then 2-3 sentences
144
+ about why it''s been selected it as we present them in slideshow form. | Finance
145
+ and consumer experts needed for piece on "times when debit cards cost you more" '
146
+ sentences:
147
+ - Hi Zach, Hope all is well! As you're looking for financial experts to discuss
148
+ times using your debit card will cost you more, I'd love to offer Courtney Alev,
149
+ Credit Karma's Consumer Financial Advocate. In her role, Courtney is responsible
150
+ for building Credit Karma's next generation of product features focused on helping
151
+ members know, grow and protect their wealth. She'd be happy to share her insights
152
+ on the topic. Let me know if you'd be interested in getting Courtney's insights,
153
+ happy to share any specific questions you have. Best, Morgan
154
+ - Hi Tabitha, I hope your day is going well! Per you query, i'd love to offer expert
155
+ insights from Vanessa Johnson, Director of Instructor Training. Vanessa can answer
156
+ the questions you need for your article or any others you may have. Just let me
157
+ know and I'd be happy to coordinate! Best, Gabe Rosenberg grosenberg@fishmanpr.com
158
+ - 'Hi Rachel! I hope this message finds you well. I’m reaching out to because I
159
+ believe we''ve got a unique device that would be an exact match for your upcoming
160
+ coverage. Our company, Kineon Labs, has engineered a safe, non-invasive, red light
161
+ therapy device called the Move+ which was built from the ground up to relieve
162
+ joint pain and support recovery. The Move + was launched on Indiegogo and received
163
+ 100% funding in just 24 hrs. If you or someone you know has suffered a joint injury
164
+ or chronic pain, you know the most common way to treat that pain is to apply ice
165
+ or take over the counter medication. Recent studies have shown icing to be more
166
+ detrimental to your recovery and traditional medications like NSAID’s aren’t a
167
+ healthy long-term strategy. The Move+ was developed by a team of physiotherapists,
168
+ scientists, medical professionals and product geeks, the sleek, portable device
169
+ uses the latest scientific research and combines powerful lasers and LEDs to deliver
170
+ deeper penetration and optimal results. The Move+ helps those suffering from chronic
171
+ joint pain to live a happier and healthier life– because everyone should be able
172
+ to move comfortably and pain-free. If you’d like to learn more about the product,
173
+ please check out: https://kineon.io/products/knee If interested, I’d be happy
174
+ to coordinate an interview with the founders of Kineon Labs, and/or provide a
175
+ sample for you to try out. Thank you. I look forward to hearing back from you.
176
+ Kind regards, Analyn Maer'
177
+ - source_sentence: " Looking for regtech/ fintech regulation industry members to take\
178
+ \ part in a webinar on Entity Centric AML.\n\nThe webinar is about the need to\
179
+ \ take a broader look at activity and behaviour when assessing suspicion. The\
180
+ \ way to do this is to focus on the subject entity and not on the individual events/\
181
+ \ transactions which have occurred. We call this Entity Centric AML, by taking\
182
+ \ a more contextual view, taking into account all information on the entity, from\
183
+ \ both internal and external data sources, suspicious activity for each entity\
184
+ \ can be more clearly defined and therefore monitored for and detected on.\n \n\
185
+ Discussion points:\nDo we need to take a different approach?\nWhat is an Entity\
186
+ \ Centric approach\nWhat are the benefits of this approach\nHow do we achieve\
187
+ \ this approach?\n\nLooking for speaker suggestions, get in touch with any questions\
188
+ \ etc\n | Fintech/regtech industry member or AML expert for Entity Centric AML\
189
+ \ Webinar "
190
+ sentences:
191
+ - ' '
192
+ - Hey AK! My name is Brian Gallegos, and I am representing Generation Tux, US formalwear
193
+ platform specializing in at home fitting with doorstep delivery. Generation is
194
+ Tux is about to launch a line of female suits to cater to female presenting individuals.
195
+ We are hoping to speak out on our support and the LGBT+ community, and I think
196
+ that your podcast might be a great opportunity. I am representing Pablo Villalpando
197
+ directly, but we have a great team from Generation Tux that I'm sure would LOVE
198
+ to engage. Fingers crossed to hearing back! -Brian G. 🌈
199
+ - 'Hi Polly, Hope you''re well. Having looked at your request for someone from the
200
+ regtech/fintech industry to take part in this webinar, I believe that our Head
201
+ of Compliance, Kirsty Coughlan, would be a great fit. She''s a great public speaker
202
+ and would be able to give a lot of insight into entity-centric AML, as well as
203
+ from a crypto perspective. Please find her bio below: Kirsty is an experienced
204
+ professional with over 16 years experience within the compliance and financial
205
+ services industry. Notably, during the last 13 years, she has worked exclusively
206
+ in the foreign exchange and global payments space, holding senior compliance positions
207
+ fighting financial crime. She currently works as Head of Compliance at leading
208
+ crypto payments platform, Wirex, where she holds MLRO status and is the primary
209
+ individual responsible for all KYC/AML/financial crime-related issues and risk
210
+ management. With this role, she provides ongoing cross-company support and training
211
+ to the Compliance team and global business from a commercial, risk and compliant
212
+ perspective. Let me know if you need any more info! Thanks, Lottie'
213
+ - source_sentence: " Cucumbers are a staple in classic Mediterranean side dishes and\
214
+ \ condiments, from Israeli salads to Tzaziki and dill pickles. However, beyond\
215
+ \ that, they are a secret weapon for a variety of recipes. \n\nI am looking for\
216
+ \ chefs, cookbook authors and even nutritionists to share recipes with great photos\
217
+ \ as well as unexpected uses for them, types of cucumbers and their use, and health\
218
+ \ benefits that have made cucumbers a staple in the summertime diets? Do they\
219
+ \ have healing properties or the ability to cool us down when it gets hot?\n\n\
220
+ Looking forward to some fresh slices of information!\n | Cucumbers: Nothing’s\
221
+ \ Cooler in Summer Recipes "
222
+ sentences:
223
+ - 'Hi Mandy, I am a small business owner with two retail stores. In addition, our
224
+ gourmet food products are distributed in 42 states. Here is my real-world experience
225
+ in reply to your questions: "Our BOP covers makes it simple for us since everything
226
+ is contained in one policy including general liability and property insurance.
227
+ However, as will any small business, uncertainty is part to being in business.
228
+ For example, several years ago one of our retail store fell victim to a breaking
229
+ and entering. Fortunately, in addition we have addition options including crime
230
+ and business interruption. So with one call to our agent, we were covered the
231
+ costs to repair damaged windows, walls and missing cash. " Andy LaPointe www.TraverseBayFarms.com
232
+ Traverse Bay Farms - Winner of 48+ National Food Awards'
233
+ - Hi there! Longtime insurance product manager now insurance agent here. Lots of
234
+ thoughts on gap insurance! Kate Terry, CEO, Surround Insurance Cars have gotten
235
+ a lot more expensive over the past few years, and as a result, people are taking
236
+ out longer auto loans so they can afford the monthly payment on a new car. However,
237
+ this comes with a risk - new cars depreciate rapidly. By the end of the first
238
+ year of ownership, a car may be worth 20% less than what you paid for it. Unfortunately,
239
+ if you have long car loan, say 72 or 84 months, the balance on your loan may be
240
+ more than your car is worth. If you total it in a car crash, you still owe the
241
+ loan, even though you no longer have a driveable car, and you probably need to
242
+ buy a new car as well. This situation is what gap insurance prevents. Gap insurance,
243
+ which you can get through your auto insurance company or sometimes from the dealer,
244
+ pays out the difference between your loan balance and the actual cash value of
245
+ your car, minus the deductible on your insurance policy. That lets you pay off
246
+ your auto loan and just focus on finding a new car. Gap insurance will usually
247
+ cost you under $100/year if you add it to your car insurance. You'll probably
248
+ have to maintain full coverage on your car as well, which is comprehensive and
249
+ collision coverage in addition to liability. Most lenders require full coverage
250
+ anyway, though, so this isn't really an additional requirement. And you should
251
+ pay attention to any limitations on payouts. Sometimes, there are limits to how
252
+ large a gap will be covered, so you'll want to take that into account.
253
+ - Hi Elyse, I hope this note finds you well. Just a short note to introduce myself.
254
+ I am an expert on the Middle East conflict and I am on the ground in Judea (the
255
+ "West Bank"). I can answer your questions on the Arab-Israeli conflict, discuss
256
+ history, the Israel/Hamas war, address issues of international law, and more.
257
+ So much of the news is distorted and agenda-driven media. If you are looking for
258
+ an unapologetic, truthful analysis of what's happening on the ground, I can be
259
+ very helpful. If you ever want to increase your basic knowledge on the subject,
260
+ I'm happy to brief you on key historical developments that got us to where we
261
+ are today or answer any other questions about the conflict that you always wanted
262
+ to know and just never found good answers readily available. I look forward to
263
+ working with you in the near future. Kindest regards, Kalman Michoel
264
+ - source_sentence: ' We''re working on a story about negotiating a raise at work during
265
+ a time of rising inflation. We need these queries answered for the story.
266
+
267
+
268
+ 1. What impact, if any, is skyrocketing inflation having on salaries and compensation
269
+ on the job?
270
+
271
+
272
+ 2. Is it reasonable for an employee to ask for a raise at a time of high inflation?
273
+ Or should job responsibilities be the only factor when asking for a raise?
274
+
275
+
276
+ 3. What are your best tips for negotiating a raise at a time of high inflation?
277
+ What strategies works and what doesn''t?
278
+
279
+
280
+ Thanks. | Looking for negotiation, career and financial experts for an article
281
+ on negotiating a raise at work at a time of high inflation '
282
+ sentences:
283
+ - 'Hey Amy Walters, Thank you for reaching out. Dr. Sandeep Nayak, a Surgical Oncologist
284
+ (DNB, MRCS UK, Fellowship in Laparoscopic and Robotic Surgical Oncology), provides
285
+ his professional insight into the claim of curing cancer through dietary changes:
286
+ "A claim like this needs to be examined carefully. While diet and nutrition play
287
+ a crucial role in overall health and can support recovery during and after cancer
288
+ treatment, there is no scientific evidence to suggest that dietary changes alone
289
+ can cure cancer. The case mentioned, where surgery was performed and a plant-based
290
+ diet followed, likely reflects the combined effect of medical intervention and
291
+ lifestyle improvements. Surgery is often a curative treatment for localized cancers,
292
+ but attributing the cure solely to dietary changes is misleading. What diet does
293
+ offer is an enhancement to the body’s ability to heal and recover. For example,
294
+ a diet rich in vegetables, legumes, and whole foods provides antioxidants and
295
+ anti-inflammatory properties that can support the immune system and potentially
296
+ reduce the risk of cancer recurrence. However, this should complement, not replace,
297
+ proven cancer therapies like surgery, chemotherapy, or immunotherapy. It’s important
298
+ for patients to understand that cancer is multifactorial, involving genetic, environmental,
299
+ and lifestyle factors. While maintaining a healthy diet is essential for reducing
300
+ risks and supporting recovery, it is not a standalone cure. Bold claims like this
301
+ can deter patients from seeking evidence-based treatments, which could have life-saving
302
+ outcomes." Dr. Nayak emphasizes that diet, while a powerful tool in promoting
303
+ health, is one part of a larger treatment plan. "The best approach combines advanced
304
+ medical treatments with lifestyle modifications for long-term well-being and prevention.
305
+ Patients should always consult their oncologist or healthcare provider before
306
+ making decisions about cancer care based on anecdotal reports." If you’d like
307
+ additional commentary or specific examples, please feel free to reach out. Here
308
+ are Dr. Nayak’s details for attribution: Dr. Sandeep Nayak DNB (General Surgery),
309
+ DNB (Surgical Oncology), MRCS (UK), MNAMS (General Surgery) Fellowship in Laparoscopic
310
+ and Robotic Surgical Oncology Profile: Dr. Sandeep Nayak Website: MACS for Cancer'
311
+ - ' '
312
+ - Hello, This is just a test pitch that is being submitted by the Qwoted team. Apologies
313
+ for any confusion in your inbox. All the best, Shelby Bridges
314
+ pipeline_tag: sentence-similarity
315
+ library_name: sentence-transformers
316
+ metrics:
317
+ - pearson_cosine
318
+ - spearman_cosine
319
+ model-index:
320
+ - name: SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
321
+ results:
322
+ - task:
323
+ type: semantic-similarity
324
+ name: Semantic Similarity
325
+ dataset:
326
+ name: sts dev
327
+ type: sts-dev
328
+ metrics:
329
+ - type: pearson_cosine
330
+ value: 0.9237640704578296
331
+ name: Pearson Cosine
332
+ - type: spearman_cosine
333
+ value: 0.817098656291897
334
+ name: Spearman Cosine
335
+ ---
336
+
337
+ # SentenceTransformer based on sentence-transformers/all-MiniLM-L6-v2
338
+
339
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) on the csv dataset. It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
340
+
341
+ ## Model Details
342
+
343
+ ### Model Description
344
+ - **Model Type:** Sentence Transformer
345
+ - **Base model:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) <!-- at revision c9745ed1d9f207416be6d2e6f8de32d1f16199bf -->
346
+ - **Maximum Sequence Length:** 256 tokens
347
+ - **Output Dimensionality:** 384 dimensions
348
+ - **Similarity Function:** Cosine Similarity
349
+ - **Training Dataset:**
350
+ - csv
351
+ <!-- - **Language:** Unknown -->
352
+ <!-- - **License:** Unknown -->
353
+
354
+ ### Model Sources
355
+
356
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
357
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
358
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
359
+
360
+ ### Full Model Architecture
361
+
362
+ ```
363
+ SentenceTransformer(
364
+ (0): Transformer({'max_seq_length': 256, 'do_lower_case': False, 'architecture': 'BertModel'})
365
+ (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
366
+ (2): Normalize()
367
+ )
368
+ ```
369
+
370
+ ## Usage
371
+
372
+ ### Direct Usage (Sentence Transformers)
373
+
374
+ First install the Sentence Transformers library:
375
+
376
+ ```bash
377
+ pip install -U sentence-transformers
378
+ ```
379
+
380
+ Then you can load this model and run inference.
381
+ ```python
382
+ from sentence_transformers import SentenceTransformer
383
+
384
+ # Download from the 🤗 Hub
385
+ model = SentenceTransformer("Ermiyas/embedding-1")
386
+ # Run inference
387
+ sentences = [
388
+ " We're working on a story about negotiating a raise at work during a time of rising inflation. We need these queries answered for the story.\n\n1. What impact, if any, is skyrocketing inflation having on salaries and compensation on the job?\n\n2. Is it reasonable for an employee to ask for a raise at a time of high inflation? Or should job responsibilities be the only factor when asking for a raise?\n\n3. What are your best tips for negotiating a raise at a time of high inflation? What strategies works and what doesn't?\n\nThanks. | Looking for negotiation, career and financial experts for an article on negotiating a raise at work at a time of high inflation ",
389
+ ' ',
390
+ 'Hey Amy Walters, Thank you for reaching out. Dr. Sandeep Nayak, a Surgical Oncologist (DNB, MRCS UK, Fellowship in Laparoscopic and Robotic Surgical Oncology), provides his professional insight into the claim of curing cancer through dietary changes: "A claim like this needs to be examined carefully. While diet and nutrition play a crucial role in overall health and can support recovery during and after cancer treatment, there is no scientific evidence to suggest that dietary changes alone can cure cancer. The case mentioned, where surgery was performed and a plant-based diet followed, likely reflects the combined effect of medical intervention and lifestyle improvements. Surgery is often a curative treatment for localized cancers, but attributing the cure solely to dietary changes is misleading. What diet does offer is an enhancement to the body’s ability to heal and recover. For example, a diet rich in vegetables, legumes, and whole foods provides antioxidants and anti-inflammatory properties that can support the immune system and potentially reduce the risk of cancer recurrence. However, this should complement, not replace, proven cancer therapies like surgery, chemotherapy, or immunotherapy. It’s important for patients to understand that cancer is multifactorial, involving genetic, environmental, and lifestyle factors. While maintaining a healthy diet is essential for reducing risks and supporting recovery, it is not a standalone cure. Bold claims like this can deter patients from seeking evidence-based treatments, which could have life-saving outcomes." Dr. Nayak emphasizes that diet, while a powerful tool in promoting health, is one part of a larger treatment plan. "The best approach combines advanced medical treatments with lifestyle modifications for long-term well-being and prevention. Patients should always consult their oncologist or healthcare provider before making decisions about cancer care based on anecdotal reports." If you’d like additional commentary or specific examples, please feel free to reach out. Here are Dr. Nayak’s details for attribution: Dr. Sandeep Nayak DNB (General Surgery), DNB (Surgical Oncology), MRCS (UK), MNAMS (General Surgery) Fellowship in Laparoscopic and Robotic Surgical Oncology Profile: Dr. Sandeep Nayak Website: MACS for Cancer',
391
+ ]
392
+ embeddings = model.encode(sentences)
393
+ print(embeddings.shape)
394
+ # [3, 384]
395
+
396
+ # Get the similarity scores for the embeddings
397
+ similarities = model.similarity(embeddings, embeddings)
398
+ print(similarities)
399
+ # tensor([[ 1.0000, -0.0021, 0.4980],
400
+ # [ -0.0021, 1.0000, 0.0003],
401
+ # [ 0.4980, 0.0003, 1.0000]])
402
+ ```
403
+
404
+ <!--
405
+ ### Direct Usage (Transformers)
406
+
407
+ <details><summary>Click to see the direct usage in Transformers</summary>
408
+
409
+ </details>
410
+ -->
411
+
412
+ <!--
413
+ ### Downstream Usage (Sentence Transformers)
414
+
415
+ You can finetune this model on your own dataset.
416
+
417
+ <details><summary>Click to expand</summary>
418
+
419
+ </details>
420
+ -->
421
+
422
+ <!--
423
+ ### Out-of-Scope Use
424
+
425
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
426
+ -->
427
+
428
+ ## Evaluation
429
+
430
+ ### Metrics
431
+
432
+ #### Semantic Similarity
433
+
434
+ * Dataset: `sts-dev`
435
+ * Evaluated with [<code>EmbeddingSimilarityEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.EmbeddingSimilarityEvaluator)
436
+
437
+ | Metric | Value |
438
+ |:--------------------|:-----------|
439
+ | pearson_cosine | 0.9238 |
440
+ | **spearman_cosine** | **0.8171** |
441
+
442
+ <!--
443
+ ## Bias, Risks and Limitations
444
+
445
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
446
+ -->
447
+
448
+ <!--
449
+ ### Recommendations
450
+
451
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
452
+ -->
453
+
454
+ ## Training Details
455
+
456
+ ### Training Dataset
457
+
458
+ #### csv
459
+
460
+ * Dataset: csv
461
+ * Size: 1,900 training samples
462
+ * Columns: <code>request</code>, <code>pitch</code>, and <code>score</code>
463
+ * Approximate statistics based on the first 1000 samples:
464
+ | | request | pitch | score |
465
+ |:--------|:-------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:----------------------------------------------------------------|
466
+ | type | string | string | float |
467
+ | details | <ul><li>min: 16 tokens</li><li>mean: 120.15 tokens</li><li>max: 256 tokens</li></ul> | <ul><li>min: 2 tokens</li><li>mean: 160.7 tokens</li><li>max: 256 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.75</li><li>max: 85.0</li></ul> |
468
+ * Samples:
469
+ | request | pitch | score |
470
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------|
471
+ | <code>What can be done to improve diversity for financial advisors? | What are tangible ways that the financial advisory industry can increase diversity?</code> | <code>What can be done to improve diversity for financial advisors? What are tangible ways that the financial advisory industry can increase diversity?<br><br>Even as an eternal optimist, I'll start with the business case: diverse teams are more likely to outperform on the bottom line. McKinsey can give you the stats. With that out of the way, let's talk about action. "Diversity & Inclusion" is often taken to mean hiring statistics, but increasing diversity in the financial advisory industry requires work to build a talent pipeline for advisors and to build better services from advisors. A history of exclusionary practices has resulted in our status quo; today, financial advisors are often assumed to be well-off themselves and to serve a well-off clientele. Changing this history is going to take time, and improving diversity for financial advisors and for the industry won't happen overnight. <br><br>The industry should start by investing time to build a diverse customer base and to meet their needs. On ...</code> | <code>0.95</code> |
472
+ | <code> UPDATE: Medical Doctors who are also Geriatricians | Topic: Baby Boomers <br>I want to speak with US medical experts (for 15 min) support the medical needs of those born from 1946 and 1964 AKA "Boomers" <br><br>Previous: <br>or those in professions that support their healthcare and retirement concerns. Aging life Specialists and Care Managers and Geriatricians to the front, please.<br>Someone who can chat with me about the nuance in this demographic and what that are currently facing. <br><br>-Please don't send ready-made quotes for this one. <br><br>-Looking for experts who can speak to the health and social concerns of this demographic. It would be ideal to connect with an expert who can speak for 15 minutes on the 11 or 12 of next week.<br><br>Here are my primary questions about Baby Boomers: <br>What Role do Baby Boomers play in family dynamics today and why does it matter?<br>What are challenges baby boomers are facing when it comes to their health?<br>What are challenges baby boomers are facing when it comes to senio...</code> | <code>Hi Yolande,<br><br>I'd like to introduce you to Ann Lilly as a potential source for you. Ann is the Brand Lead for House Doctors, a leading home improvement and handyman service, and just last year, she helped launch the Aging in Place program. This program prioritizes fall prevention for the aging population, helping so seniors can live independently and safely in their homes with a solution that addresses their unique needs.<br><br>One in Four senior citizens in the U.S. suffer a fall each year. With nearly 90% of seniors expressing a desire to remain in their own homes as they age, Ann is helping to ensure that seniors can continue living in their homes with the comfort and safety they deserve.<br><br>KEY STATS:<br>• 1 in 4 seniors in the U.S. falls each year<br>• Falls result in 3 million emergency department visits annually.<br>• 76% of remodelers have seen a significant or moderate increase in requests for aging-in-place features in the last five years.<br>• The most common aging-i...</code> | <code>0.8</code> |
473
+ | <code>Looking to speak to a researcher that has been in academia 20+ years | Looking to have a quick conversation with a researcher that has been in academia 20+ years and uses Twitter to share their research.</code> | <code> </code> | <code>0.0</code> |
474
+ * Loss: [<code>CosineSimilarityLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosinesimilarityloss) with these parameters:
475
+ ```json
476
+ {
477
+ "loss_fct": "torch.nn.modules.loss.MSELoss"
478
+ }
479
+ ```
480
+
481
+ ### Evaluation Dataset
482
+
483
+ #### csv
484
+
485
+ * Dataset: csv
486
+ * Size: 316 evaluation samples
487
+ * Columns: <code>request</code>, <code>pitch</code>, and <code>score</code>
488
+ * Approximate statistics based on the first 316 samples:
489
+ | | request | pitch | score |
490
+ |:--------|:-------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------|
491
+ | type | string | string | float |
492
+ | details | <ul><li>min: 14 tokens</li><li>mean: 120.31 tokens</li><li>max: 256 tokens</li></ul> | <ul><li>min: 2 tokens</li><li>mean: 161.22 tokens</li><li>max: 256 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.66</li><li>max: 0.95</li></ul> |
493
+ * Samples:
494
+ | request | pitch | score |
495
+ |:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------|
496
+ | <code> I am working on an article for USA Today's Blueprint (Business Section) on the best onboarding practices. Looking for an HR professional to weigh in on the following: <br><br>1. What are some top onboarding practices? For instance, keep new hires engaged before their joining date, set up team meet and greets, set clear goals and objectives, check in often, collect feedback to refine onboarding practices. <br><br>2. Why is it important to implement these best onboarding practices? How does it help with productivity, retention, general morale and well-being in the workplace? <br><br>3. What are some challenges to look out for during onboarding? <br><br>4. What are some tools to help with the onboarding process? | Human resources professional needed for insights on best onboarding and worker retention practices </code> | <code>Hey Jackie, I work with Keith Harper, the founder and CEO of NY-based Above & Beyond Talent Acquisition. A&B is on a mission to increase executive diversity among the Fortune 500 and along the way Keith has helped hundreds of companies find and successfully onboard candidates. They actually covered some onboarding strategies that maximize retention in a recent blog post here: https://aandbtalent.com/importance-of-mentorship-during-new-employee-onboarding/ I think Keith would be a fantastic resource and I'd be happy to connect you both if you'd like to schedule some time to speak with him. Very best, Bruce</code> | <code>0.8</code> |
497
+ | <code> I’m working on a TravelSavvy feature on the Best Spring Break Destinations for Families With Young Kids—places where parents can relax, kids can play, and everyone gets the most out of their vacation. Whether it's a beach escape with calm waters (for newbie swimmers), a resort packed with kid-friendly activities, or an off-the-beaten-path gem perfect that's safe for family adventures, I want to hear about it! If you rep a destination (hotel, resort, tourism board) that offers an unforgettable spring break experience for families, send me your pitch! Please include links to book, off-peak and peak price ranges, and details on what makes it kid-friendly! I'll respond to those I plan to include. | Best Spring Break Destinations for Families With Young Kids </code> | <code>Hi Ysolt, Hope you're doing well! For your story on the best Spring Break Destinations for Families with Young Kids, I wanted to put forward Cornwall, England for potential consideration. Destination: Cornwall, England Why: Cornwall is the ultimate British beach holiday location in the Spring as the weather gets noticeably warmer here much earlier than in other parts of Britain. The golden sand and turquoise water rivals that of the Caribbean. The quaint fishing villages and stunning beaches make for the perfect family break. During the summer months there will be ice cream on almost every corner, restaurants with delicious locally sourced fish dishes and coves to explore at low tide. It is beautiful corner of England that also offers fantastic car free biking trails (The Camel Trail) ideal for all ages and abilities. Coastal walks providing dramatic cliff tops, film locations from Poldark and more miles of sandy beaches for sandcastles or sunbathing! Quote from Gaby Cecil, Commercial ...</code> | <code>0.89</code> |
498
+ | <code> Hi there, I'm looking for a telecoms engineer to discuss ways to connect to a satellite during a service outage, and share are more similar tips. | Looking for a telecoms engineer </code> | <code>Hi Maria, Cool story you're working on here. I work with Benchmark Electronics, an advanced manufacturing and engineering company based in Arizona. They do A LOT of telecom work (https://www.bench.com/next-gen-communications) and I'd love to find an engineer for you to speak with there. I have one question for you on this though, is the listed deadline you have for collecting sources or is it for getting answers to the questions you have? If its the former, I can package these up for some folks and work to get them to you but if its the latter the turn around may be too tight. Please let me know either way.</code> | <code>0.8</code> |
499
+ * Loss: [<code>CosineSimilarityLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosinesimilarityloss) with these parameters:
500
+ ```json
501
+ {
502
+ "loss_fct": "torch.nn.modules.loss.MSELoss"
503
+ }
504
+ ```
505
+
506
+ ### Training Hyperparameters
507
+ #### Non-Default Hyperparameters
508
+
509
+ - `eval_strategy`: steps
510
+ - `per_device_train_batch_size`: 32
511
+ - `per_device_eval_batch_size`: 64
512
+ - `gradient_accumulation_steps`: 2
513
+ - `learning_rate`: 0.0001
514
+ - `num_train_epochs`: 7
515
+ - `warmup_ratio`: 0.2
516
+ - `load_best_model_at_end`: True
517
+
518
+ #### All Hyperparameters
519
+ <details><summary>Click to expand</summary>
520
+
521
+ - `overwrite_output_dir`: False
522
+ - `do_predict`: False
523
+ - `eval_strategy`: steps
524
+ - `prediction_loss_only`: True
525
+ - `per_device_train_batch_size`: 32
526
+ - `per_device_eval_batch_size`: 64
527
+ - `per_gpu_train_batch_size`: None
528
+ - `per_gpu_eval_batch_size`: None
529
+ - `gradient_accumulation_steps`: 2
530
+ - `eval_accumulation_steps`: None
531
+ - `torch_empty_cache_steps`: None
532
+ - `learning_rate`: 0.0001
533
+ - `weight_decay`: 0.0
534
+ - `adam_beta1`: 0.9
535
+ - `adam_beta2`: 0.999
536
+ - `adam_epsilon`: 1e-08
537
+ - `max_grad_norm`: 1.0
538
+ - `num_train_epochs`: 7
539
+ - `max_steps`: -1
540
+ - `lr_scheduler_type`: linear
541
+ - `lr_scheduler_kwargs`: {}
542
+ - `warmup_ratio`: 0.2
543
+ - `warmup_steps`: 0
544
+ - `log_level`: passive
545
+ - `log_level_replica`: warning
546
+ - `log_on_each_node`: True
547
+ - `logging_nan_inf_filter`: True
548
+ - `save_safetensors`: True
549
+ - `save_on_each_node`: False
550
+ - `save_only_model`: False
551
+ - `restore_callback_states_from_checkpoint`: False
552
+ - `no_cuda`: False
553
+ - `use_cpu`: False
554
+ - `use_mps_device`: False
555
+ - `seed`: 42
556
+ - `data_seed`: None
557
+ - `jit_mode_eval`: False
558
+ - `use_ipex`: False
559
+ - `bf16`: False
560
+ - `fp16`: False
561
+ - `fp16_opt_level`: O1
562
+ - `half_precision_backend`: auto
563
+ - `bf16_full_eval`: False
564
+ - `fp16_full_eval`: False
565
+ - `tf32`: None
566
+ - `local_rank`: 0
567
+ - `ddp_backend`: None
568
+ - `tpu_num_cores`: None
569
+ - `tpu_metrics_debug`: False
570
+ - `debug`: []
571
+ - `dataloader_drop_last`: False
572
+ - `dataloader_num_workers`: 0
573
+ - `dataloader_prefetch_factor`: None
574
+ - `past_index`: -1
575
+ - `disable_tqdm`: False
576
+ - `remove_unused_columns`: True
577
+ - `label_names`: None
578
+ - `load_best_model_at_end`: True
579
+ - `ignore_data_skip`: False
580
+ - `fsdp`: []
581
+ - `fsdp_min_num_params`: 0
582
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
583
+ - `fsdp_transformer_layer_cls_to_wrap`: None
584
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
585
+ - `deepspeed`: None
586
+ - `label_smoothing_factor`: 0.0
587
+ - `optim`: adamw_torch
588
+ - `optim_args`: None
589
+ - `adafactor`: False
590
+ - `group_by_length`: False
591
+ - `length_column_name`: length
592
+ - `ddp_find_unused_parameters`: None
593
+ - `ddp_bucket_cap_mb`: None
594
+ - `ddp_broadcast_buffers`: False
595
+ - `dataloader_pin_memory`: True
596
+ - `dataloader_persistent_workers`: False
597
+ - `skip_memory_metrics`: True
598
+ - `use_legacy_prediction_loop`: False
599
+ - `push_to_hub`: False
600
+ - `resume_from_checkpoint`: None
601
+ - `hub_model_id`: None
602
+ - `hub_strategy`: every_save
603
+ - `hub_private_repo`: None
604
+ - `hub_always_push`: False
605
+ - `hub_revision`: None
606
+ - `gradient_checkpointing`: False
607
+ - `gradient_checkpointing_kwargs`: None
608
+ - `include_inputs_for_metrics`: False
609
+ - `include_for_metrics`: []
610
+ - `eval_do_concat_batches`: True
611
+ - `fp16_backend`: auto
612
+ - `push_to_hub_model_id`: None
613
+ - `push_to_hub_organization`: None
614
+ - `mp_parameters`:
615
+ - `auto_find_batch_size`: False
616
+ - `full_determinism`: False
617
+ - `torchdynamo`: None
618
+ - `ray_scope`: last
619
+ - `ddp_timeout`: 1800
620
+ - `torch_compile`: False
621
+ - `torch_compile_backend`: None
622
+ - `torch_compile_mode`: None
623
+ - `include_tokens_per_second`: False
624
+ - `include_num_input_tokens_seen`: False
625
+ - `neftune_noise_alpha`: None
626
+ - `optim_target_modules`: None
627
+ - `batch_eval_metrics`: False
628
+ - `eval_on_start`: False
629
+ - `use_liger_kernel`: False
630
+ - `liger_kernel_config`: None
631
+ - `eval_use_gather_object`: False
632
+ - `average_tokens_across_devices`: False
633
+ - `prompts`: None
634
+ - `batch_sampler`: batch_sampler
635
+ - `multi_dataset_batch_sampler`: proportional
636
+ - `router_mapping`: {}
637
+ - `learning_rate_mapping`: {}
638
+
639
+ </details>
640
+
641
+ ### Training Logs
642
+ | Epoch | Step | Training Loss | Validation Loss | sts-dev_spearman_cosine |
643
+ |:----------:|:-------:|:-------------:|:---------------:|:-----------------------:|
644
+ | -1 | -1 | - | - | 0.7685 |
645
+ | 0.3333 | 10 | 11.2084 | 0.0593 | 0.7714 |
646
+ | 0.6667 | 20 | 0.0316 | 0.0308 | 0.7794 |
647
+ | 1.0 | 30 | 0.0245 | 0.0334 | 0.7870 |
648
+ | 1.3333 | 40 | 0.0136 | 0.0246 | 0.7978 |
649
+ | 1.6667 | 50 | 11.0605 | 0.0233 | 0.8154 |
650
+ | 2.0 | 60 | 0.0155 | 0.0234 | 0.8161 |
651
+ | 2.3333 | 70 | 0.0125 | 0.0244 | 0.8331 |
652
+ | 2.6667 | 80 | 11.0853 | 0.0230 | 0.8264 |
653
+ | 3.0 | 90 | 0.0116 | 0.0225 | 0.8261 |
654
+ | 3.3333 | 100 | 0.0071 | 0.0226 | 0.8279 |
655
+ | 3.6667 | 110 | 11.0368 | 0.0227 | 0.8165 |
656
+ | 4.0 | 120 | 0.0072 | 0.0226 | 0.8145 |
657
+ | **4.3333** | **130** | **0.005** | **0.0222** | **0.8194** |
658
+ | 4.6667 | 140 | 0.0054 | 0.0225 | 0.8193 |
659
+ | 5.0 | 150 | 11.034 | 0.0226 | 0.8103 |
660
+ | 5.3333 | 160 | 0.0036 | 0.0225 | 0.8150 |
661
+ | 5.6667 | 170 | 0.0038 | 0.0228 | 0.8193 |
662
+ | 6.0 | 180 | 11.0309 | 0.0224 | 0.8168 |
663
+ | 6.3333 | 190 | 11.0297 | 0.0224 | 0.8163 |
664
+ | 6.6667 | 200 | 0.0029 | 0.0225 | 0.8165 |
665
+ | 7.0 | 210 | 0.0026 | 0.0225 | 0.8171 |
666
+
667
+ * The bold row denotes the saved checkpoint.
668
+
669
+ ### Framework Versions
670
+ - Python: 3.11.13
671
+ - Sentence Transformers: 5.0.0
672
+ - Transformers: 4.55.0
673
+ - PyTorch: 2.6.0+cu124
674
+ - Accelerate: 1.9.0
675
+ - Datasets: 4.0.0
676
+ - Tokenizers: 0.21.4
677
+
678
+ ## Citation
679
+
680
+ ### BibTeX
681
+
682
+ #### Sentence Transformers
683
+ ```bibtex
684
+ @inproceedings{reimers-2019-sentence-bert,
685
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
686
+ author = "Reimers, Nils and Gurevych, Iryna",
687
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
688
+ month = "11",
689
+ year = "2019",
690
+ publisher = "Association for Computational Linguistics",
691
+ url = "https://arxiv.org/abs/1908.10084",
692
+ }
693
+ ```
694
+
695
+ <!--
696
+ ## Glossary
697
+
698
+ *Clearly define terms in order to be accessible across audiences.*
699
+ -->
700
+
701
+ <!--
702
+ ## Model Card Authors
703
+
704
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
705
+ -->
706
+
707
+ <!--
708
+ ## Model Card Contact
709
+
710
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
711
+ -->
config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "BertModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "classifier_dropout": null,
7
+ "gradient_checkpointing": false,
8
+ "hidden_act": "gelu",
9
+ "hidden_dropout_prob": 0.1,
10
+ "hidden_size": 384,
11
+ "initializer_range": 0.02,
12
+ "intermediate_size": 1536,
13
+ "layer_norm_eps": 1e-12,
14
+ "max_position_embeddings": 512,
15
+ "model_type": "bert",
16
+ "num_attention_heads": 12,
17
+ "num_hidden_layers": 6,
18
+ "pad_token_id": 0,
19
+ "position_embedding_type": "absolute",
20
+ "torch_dtype": "float32",
21
+ "transformers_version": "4.55.0",
22
+ "type_vocab_size": 2,
23
+ "use_cache": true,
24
+ "vocab_size": 30522
25
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "5.0.0",
4
+ "transformers": "4.55.0",
5
+ "pytorch": "2.6.0+cu124"
6
+ },
7
+ "model_type": "SentenceTransformer",
8
+ "prompts": {
9
+ "query": "",
10
+ "document": ""
11
+ },
12
+ "default_prompt_name": null,
13
+ "similarity_fn_name": "cosine"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a6bc245f81456a983779f90d820df3372a1049d853978efa3a068c238103fcf1
3
+ size 90864192
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 256,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": false,
45
+ "cls_token": "[CLS]",
46
+ "do_basic_tokenize": true,
47
+ "do_lower_case": true,
48
+ "extra_special_tokens": {},
49
+ "mask_token": "[MASK]",
50
+ "max_length": 128,
51
+ "model_max_length": 256,
52
+ "never_split": null,
53
+ "pad_to_multiple_of": null,
54
+ "pad_token": "[PAD]",
55
+ "pad_token_type_id": 0,
56
+ "padding_side": "right",
57
+ "sep_token": "[SEP]",
58
+ "stride": 0,
59
+ "strip_accents": null,
60
+ "tokenize_chinese_chars": true,
61
+ "tokenizer_class": "BertTokenizer",
62
+ "truncation_side": "right",
63
+ "truncation_strategy": "longest_first",
64
+ "unk_token": "[UNK]"
65
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff