ar9av commited on
Commit
19a5ed9
·
verified ·
1 Parent(s): f626f5a

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,834 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:10000
9
+ - loss:DenoisingAutoEncoderLoss
10
+ base_model: google-bert/bert-base-uncased
11
+ widget:
12
+ - source_sentence: . the land and management meeting of 5th is 7:01 the order Um,
13
+ is roll . we have us, Nicole, and, . Uh, the is the of of October, . Do have,,
14
+ are there any, All right,, all favor approving your that is Um to participation
15
+ Is would like see Miss> hear>> Yes,>>,>, so I'm sorry uh echoing? we . We hear
16
+ little echo, but we you . Uh f first all, record, Diane Uh
17
+ sentences:
18
+ - 'All right. Good evening and welcome to
19
+
20
+ the land use and building management
21
+
22
+ committee meeting of November 5th, 2025. It is 7:01 and I am calling the meeting
23
+
24
+ to order. Um, first item is roll call. Tonight we have with us council members
25
+
26
+ Jim Fyer, Nicole Edy, Nicole Ays, and
27
+
28
+ myself, Barbara Smith. Uh, next item on the agenda is the
29
+
30
+ acceptance of the minutes of our meeting
31
+
32
+ of October 1, 2025. Do I have a motion,
33
+
34
+ Miss Edy, um, are there any corrections,
35
+
36
+ All right, seeing none, all in favor of
37
+
38
+ approving the minutes, please raise your
39
+
40
+ Okay, that is unanimous. Um, moving on to public participation. Is there anyone
41
+
42
+ here who would like to speak? I see Miss
43
+
44
+ >> Yes, good. Can you hear me? >> Yes, we can. >> Yes, we can. >> I am in a hallway,
45
+ so I''m sorry for any
46
+
47
+ uh echoing. I hope it''s Can you still
48
+
49
+ understand me? >> Yes, we can. We hear a little echo, but
50
+
51
+ we can still hear you. Uh anyway, f first of all, good evening. For the record,
52
+ my name is Diane
53
+
54
+ Laurisella. Uh'
55
+ - I'll now call to order the November 6, 2025 meeting of Mayor and Council. I'm
56
+ going to call on the Mayor Pro Tem Andy Gibbs to lead us in prayer. Please remain
57
+ standing for the pledge. Body heads, please. Dear me, Father, just thank you for
58
+ this evening. Thank you for bringing us all together. Lord, I just ask you that
59
+ as we look at the different things on our agenda tonight, that you help us to
60
+ make the best decision for our community to move to move our community forward.
61
+ Lord, I just ask you, Lord, to continue to put your arms around us for protection.
62
+ To put your arms around us, your protection. I thank you, Lord, for sending your
63
+ son to die on the cross for our sins, Lord. I thank you for the things that you've
64
+ given us in our life, the mercy of the grace that you've extended to us each and
65
+ every day to allow us to be able to wake up and walk among other people and help
66
+ us to serve others as you want us to serve, Lord. And I thank you for everything
67
+ you do for us. Your name, I pra
68
+ - In the NOSERT room today, the link can be found on the Town of Orleans website.
69
+ And calling the meeting to order. Any agenda changes? None this evening, this
70
+ afternoon, sorry. Is there anyone for citizens speak? No one additional online.
71
+ Okay, seeing none, we're going to go into priority business and we are going to
72
+ start with our assistant superintendent. Wonderful. Thank you very much. Thank
73
+ you. So I'm here today to share two things, our homeschooling process and then
74
+ just an update on our grant. We can't hear you. Yeah. So sorry to enter. I was
75
+ just going to interrupt. Just make sure that you pull this mic up and for everybody.
76
+ Is this better? Okay, Parker. Sorry about that. Sorry. So I'm here today. Thank
77
+ you for that. Just to talk about our homeschooling process and our grant process
78
+ as well. So I'll go over our homeschooling process. I know of detailed information
79
+ in the memo and I've included some links to our forms so you'll be able to see
80
+ it and I'll just give a high-level ov
81
+ - source_sentence: Um, and think these looking perhaps some feedback from what issues
82
+ that need And you know just a the formula ., it based did work 1990 so and've
83
+ and that address then uh they did student act was meant uh needs your communities
84
+ urban Right now though large with towns . where are being because need of bill
85
+ . especially the you'm to little . when hold harmless and you it certified would
86
+ t
87
+ sentences:
88
+ - 'Um, and so I think these are the minds
89
+
90
+ that they''re looking to perhaps get some
91
+
92
+ feedback from about what are the the
93
+
94
+ issues that need to be tackled. And I
95
+
96
+ think that, you know, just a couple
97
+
98
+ things about the funding formula. Look,
99
+
100
+ it''s based based they did the work back
101
+
102
+ in 1990 or so and they''ve made
103
+
104
+ adjustments and they''ve made made
105
+
106
+ probably two I think primary adjustments
107
+
108
+ since that time. One was to address
109
+
110
+ vocational needs and then uh they did
111
+
112
+ the student opportunity act which was
113
+
114
+ meant to address uh the needs of
115
+
116
+ students in uh your gateway communities
117
+
118
+ in urban areas. Right now though,
119
+
120
+ there''s a large contingent of districts
121
+
122
+ in with towns such as Rutland, Paxton,
123
+
124
+ Hold it. Um where they are being asked
125
+
126
+ because of this old formula, you need to
127
+
128
+ put more of the bill. Um and then
129
+
130
+ especially in the context, and I know
131
+
132
+ you asked this question, I''m going to
133
+
134
+ talk about it a little bit. Um when
135
+
136
+ districts fall into hold harmless, and
137
+
138
+ you know, it hasn''t been certified
139
+
140
+ officially, but I would t'
141
+ - 'Good evening and welcome to the uh
142
+
143
+ November 5th board meeting of the Topeka
144
+
145
+ Development Corporation Board of
146
+
147
+ Directors. I''ll call this meeting to
148
+
149
+ If we could stand for the pledge of
150
+
151
+ allegiance, please. I pledge algiance to the flag of the
152
+
153
+ United States of America and to the
154
+
155
+ republic for which it stands. One nation
156
+
157
+ under God, indivisible, with liberty and
158
+
159
+ justice for all. >> Okay. Madame clerk, could call the role,
160
+
161
+ >> President Padilla,
162
+
163
+ >> here. Vice President Dolbler
164
+
165
+ >> Directors Hiller
166
+
167
+ >> here. Banks
168
+
169
+ and Hoer
170
+
171
+ >> All right. We have seven present. Seven present. We''ll move on to item
172
+
173
+ number four on the agenda. Madam Clerk,
174
+
175
+ >> for approval of the August 12, 2025
176
+
177
+ meeting minutes. >> Okay. I hope uh everyone has had an
178
+
179
+ opportunity to look at the minutes
180
+
181
+ from August meeting. >> Motion to approve by Councilman Banks,
182
+
183
+ second by Councilman Miller. Take the
184
+
185
+ roll call, please. >> All right. Uh President Padilla
186
+
187
+ >> here. I Yes. Vic'
188
+ - You seem loud today. All right, it's 8 a.m. I'll go ahead and call this meeting
189
+ to order. Today is Tuesday, November 8th, the meeting in the formal boardroom
190
+ with the Genoaxi Public Service Center. First on our agenda are minutes. We have
191
+ none and no claims. All right, great. We'll move right into our regular agenda.
192
+ The board will meet with elected officials and the department heads discuss the
193
+ following topics. We'll start with fiscal year 2027 budget calendar initiatives
194
+ and guidelines. Good morning, Sarah. Good morning, Sarah Barrows, Budget Director.
195
+ The board asked that we just go over a few things for the FY27 budget. Kind of
196
+ hard to believe we're already there, but it is time. We met yesterday with the
197
+ board and talked about some guidelines and initiatives that they could include
198
+ for the FY27 budget. A lot of them you'll be very familiar with. 25% ending fund
199
+ balance, things like that. The one thing that they have yet to discuss tomorrow
200
+ is the placeholder that we'll include fo
201
+ - source_sentence: we're of . That is year a good time, think's . So let's get back
202
+ So we're having it quickly first ring . think they got the picture . I if of know
203
+ we're to go this Each and every have the handout given you, City of Ridge fiscal
204
+ year 2026 . If, . You again,'ll revising it You guys will of next with Mandy,
205
+ there anything we need do Speak Mandy got a said ca us sorry Council disagrees
206
+ me you scheduled to adopt a 2026
207
+ sentences:
208
+ - 'So, I''d like to make a motion to come
209
+
210
+ out of non-public session. >> Okay, we have a motion to come out of
211
+
212
+ non-public and a second. All in favor? I
213
+
214
+ >> opposed. Okay, motion carries. 70. >> Um, and I''d also like to make a motion
215
+
216
+ to seal the minutes indefinitely. >> Second that. >> Motion to seal the minutes
217
+ indefinitely
218
+
219
+ and a second. Uh, all in favor? Any
220
+
221
+ Great. Motion carry 70. >> Good evening. Welcome to the Jerry Town
222
+
223
+ Council meeting for Tuesday, November
224
+
225
+ 4th, 2025. >> Mr. Chair, excuse me. If we could make
226
+
227
+ sure everyone''s microphone is turned on,
228
+
229
+ green light at the base of the stem. Thank you. >> Everyone''s microphone on.
230
+ >> All right. council Spencer, please lead us pledge
231
+
232
+ algiance to the flag of the United
233
+
234
+ States of America to the republic for it
235
+
236
+ stands one nation
237
+
238
+ indivisible with liberty and justice for
239
+
240
+ >> Thank you. Uh, prevent emergency exits
241
+
242
+ in the back of the room. Mr. The Havens
243
+
244
+ are glad to help you. Um,
245
+
246
+ do a roll ca'
247
+ - 'I pledge allegiance to the flag of the
248
+
249
+ United States of America to the republic
250
+
251
+ for it stands one nationy
252
+
253
+ and justice for all. Please remain standing for a moment of
254
+
255
+ silent reflection for all the service
256
+
257
+ men and women throughout the world and
258
+
259
+ for those who have passed away in our
260
+
261
+ community, especially our former mayor,
262
+
263
+ David J. Wenszel. Thank you. >> Roll call, please. >> Mr. King,
264
+
265
+ >> Mr. Schuster,
266
+
267
+ >> Dr. Mr. Rothschild
268
+
269
+ >> Mr. McCandrew
270
+
271
+ >> Mr. Small
272
+
273
+ >> Dispatch with three of the minutes. >> Third order 3A correspondence dated
274
+
275
+ October 28th, 2025 from the Interite
276
+
277
+ Bike Coalition Executive Director Kuba
278
+
279
+ James regarding support for the downtown
280
+
281
+ Scranton Walkability Streetscape
282
+
283
+ project. 3B. Letters of support from
284
+
285
+ local businesses for the walkability and
286
+
287
+ streetscape project ordinances. 3C. Letter from Fire Tree Limited received
288
+
289
+ October 27th, 2025 regarding residential
290
+
291
+ re-entry center services. >> Are there any comments on any of the
292
+
293
+ thir'
294
+ - Yeah, we're kind of talking about everything. That is. Any time of the year is
295
+ a good time to catch fish. Okay, I think it's been about five minutes. So let's
296
+ get started back. So we're having one next one. We're doing it quickly. So this
297
+ is the first ring. I think they got the picture. Okay. Okay, I think if you can
298
+ just kind of, I know we're not going to go over this. Each and every one of you
299
+ in here should have the handout that's given to you, which says, City of Blue
300
+ Ridge, proposed fiscal year budget for 2026. If you will, please take the opportunity.
301
+ You know, this is, again, we'll be revising it. You guys will at the beginning
302
+ of next year, hopefully, with better numbers. Mandy, is there anything we need
303
+ to do with this tonight? Speak into the microphone, Mandy. I got a phone call
304
+ that said they can't hear us. I'm sorry. Council does have an option, unless Chuck
305
+ disagrees with me. I'm sorry to put you on the spot. You are scheduled to adopt
306
+ a 2026 budget next week. I believe, I
307
+ - source_sentence: The City Buffalo Planning ⚫ November 17, NY 14202 http //www.buffalony.gov
308
+ Morrell Regular Planning Board 4:00 PM Hall, Sq., NY . called order 4:06 PM by
309
+ Vice Chairman Schwartz Name Absent   Cynthia Schwartz  Martha Scott Billman
310
+  Arthur      Samantha  Meeting be Via Videoconferencing and Anyone Wishes
311
+ 912 City or Videoconferencing ., or Call 1-415-655-0001 Meeting 173 2665 Password
312
+ PublicComment (Not for) II . Approval of Minutes 1 . - Meetin
313
+ sentences:
314
+ - "THE CITY OF BUFFALO \n \nThe City of Buffalo Planning Board \n912 City Hall \n\
315
+ Minutes ⚫ November 17, 2025 \n \nBuffalo, NY 14202 \n \n \nhttp://www.buffalony.gov\
316
+ \ \n \n \nJames Morrell \nRegular Planning Board Meeting \nOSP Conference Room\
317
+ \ \n4:00 PM \n \n912 City Hall, 65 Niagara Sq., Buffalo, NY 14202 \nI. Call to\
318
+ \ Order \n The meeting was called to order at 4:06 PM by Vice Chairman Cynthia\
319
+ \ Schwartz \nAttendee Name \nPresent \nAbsent \nLate \nArrived \nJames Morrell\
320
+ \ \n \n \n \n \nCynthia Schwartz \n \n \n \n \nMartha Lamparelli \n \n\
321
+  \n \n \nScott Billman \n \n \n \n \nArthur Hall \n \n \n \n \nLen Sciolino\
322
+ \ \n \n \n \n \nSamantha Soltis \n \n \n \n \nThis Meeting Will be Held\
323
+ \ Via Videoconferencing and In-Person. Anyone who Wishes to Attend \nMay Do So\
324
+ \ in Room 912 City Hall or Via Videoconferencing. to Join Online, Go to \nWww.Webex.Com\
325
+ \ or Call 1-415-655-0001; Meeting Number 173 504 2665 Password: \nPublicComment\
326
+ \ (Not Required for Phone) \nII. Approval of Meeting Minutes \n1. Planning Board\
327
+ \ - Regular Planning Board Meetin"
328
+ - 'p.m. and I''m calling the Hamilton
329
+
330
+ Planning Board meeting to order. This meeting is being recorded. Uh to aid the
331
+ transcriptionist, I''m
332
+
333
+ going to take roles. So when I call your
334
+
335
+ name, would you please indicate that
336
+
337
+ you''re present? Pat Norton,
338
+
339
+ >> Jonathan Poor,
340
+
341
+ >> Jonathan Poor, present. >> Amal Dawquist,
342
+
343
+ >> Amal Dawquist, present. >> Darcy Dale,
344
+
345
+ >> Darcy Dale, present. >> And Marne Crouch, present. And I note
346
+
347
+ for the record that Beth her is absent. Um, her father passed away and we''re
348
+
349
+ very sorry to hear that. And, uh,
350
+
351
+ perhaps Bill uh will uh appear. I know
352
+
353
+ Matt Hamill, he was on the Cape, right? It''s unlikely he''ll be here and I don''t
354
+
355
+ believe we''ve heard from Jeff Austin. >> Uh the first item on our agenda is to
356
+
357
+ review the minutes from October 21st. Um
358
+
359
+ do I have a motion to approve those uh
360
+
361
+ >> So So moved. >> Do I have a second? Any discussion about
362
+
363
+ the minutes? >> Uh Darcy, I think you found a few typos. Yeah, just a coup'
364
+ - 'We the citizens of the city of Gilroy as
365
+
366
+ we come to this room tonight as a
367
+
368
+ complex human crowd carrying burdens,
369
+
370
+ hope, joy, and trusting that you alone
371
+
372
+ can hold us together. God, we pray for
373
+
374
+ your love and your peace. Help us to
375
+
376
+ trust you, Lord, more than our opinions,
377
+
378
+ our fears, and even our accomplishments. May we turn to you for your truth. Lastly,
379
+ fill us with grace, Lord, as
380
+
381
+ these leaders make decisions tonight
382
+
383
+ that will affect each one of us. Continue to remind us that all that this
384
+
385
+ day is done, all that is done here this
386
+
387
+ day and all that is accomplished is for
388
+
389
+ the greater glory of you. Please bless
390
+
391
+ these proceedings. We ask for these
392
+
393
+ things in your name. Amen. Thank you. Thank you, pastor. Moving on to item
394
+
395
+ 1.6. Six. Orders of the day we have
396
+
397
+ none. Moving on to item 1.7, employee
398
+
399
+ Finance director Harjo Singha will
400
+
401
+ introduce a new employee. Thank you. Thank you, Mayor. Um, tonight I have the
402
+
403
+ privilege of introducing Naelli
404
+
405
+ Martinez. Uh, she joins us in the
406
+
407
+ utility billing team uh'
408
+ - source_sentence: right o'clock Oh Jack here . Man brought team 3rd November . going'll
409
+ to . We Pledge of and . And we got guests Manager Brantley'll of . the . I appeal
410
+ flag the United States of and to Republic which,, for . Let . God, come before
411
+ this grateful live wonderful community We serve our community members in our respective
412
+ capacities morning and give us eyes and full hearts, Lord, as handle business
413
+ Thank you your, In Christ name amen Thank Price the pledge . First item is
414
+ sentences:
415
+ - All right, it's nine o'clock. Oh, Jack's here too. Man, y'all brought the whole
416
+ team. 9 o'clock, November 3rd, November already. Gosh. Time keeps on going. We'll
417
+ call the meeting to order. We'll open with the Pledge of Allegiance and prayer.
418
+ And since we've got special guests, I'll ask Town Manager Brantley Price if he'll
419
+ lead us in the Pledge of Allegiance. And I'll say the prayer. I appeal to the
420
+ flag of the United States of America and to the Republic for which it stands,
421
+ one nation under God, indivisible, with liberty and justice for all. Let's pray
422
+ together. God, we come before you this morning with grateful hearts to live in
423
+ such a wonderful community. We're grateful for the opportunity to serve our community
424
+ members in our respective capacities here this morning. Be with us and give us
425
+ clear eyes and full hearts, Lord, as we handle the people's business. Thank you
426
+ for your many, many blessings. In Christ's name, amen. Amen. Thank you, Manager
427
+ Price, for the pledge. First item is
428
+ - 'Notice of Meeting of the
429
+
430
+ Planning and Zoning Commission
431
+
432
+ of the City of Georgetown, Texas
433
+
434
+ November 4, 2025
435
+
436
+ The Georgetown Planning and Zoning Commission will meet on November 4, 2025 at
437
+ 6:00 PM at Council
438
+
439
+ and Court Building, 510 W. 9th Street. The City of Georgetown is committed to
440
+ compliance with the Americans with Disabilities Act (ADA). If you
441
+
442
+ require assistance in participating at a public meeting due to a disability, as
443
+ defined under the ADA,
444
+
445
+ reasonable assistance, adaptations, or accommodations will be provided upon request.
446
+ Please contact the
447
+
448
+ City Secretary''s Office, at least three (3) days prior to the scheduled meeting
449
+ date, at (512) 930-3652 or
450
+
451
+ City Hall at 808 Martin Luther King Jr. Street, Georgetown, TX 78626 for additional
452
+ information; TTY users
453
+
454
+ route through Relay Texas at 711. Notice of Cancellation
455
+
456
+ This regularly scheduled meeting of the Board is cancelled. Please check the posted
457
+ agendas for next
458
+
459
+ month''s meeting. Certificate of Posting
460
+
461
+ Certificate of Posting
462
+
463
+ I, Robyn'
464
+ - 'for Wednesday, November 12th
465
+
466
+ at 7:05 p.m. What you want? This meeting is being recorded. report anything. I
467
+ have
468
+
469
+ Yeah. agenda
470
+
471
+ but there were other items
472
+
473
+ >> that didn''t get to this print Right. >> Caught you off guard there. >> I know
474
+ you did. I''m not. Oh, so it
475
+
476
+ worked very well. >> what did what did we end up
477
+
478
+ because of the day? >> that''s all right. You''re fine. >> All right. So, we''re
479
+ still going to go
480
+
481
+ a wedding for
482
+
483
+ I think we''re gonna
484
+
485
+ want to go. All right. No problem. 1.7. First one is Parker Hill
486
+
487
+ Well, it''s already been
488
+
489
+ Conservation Commission. Yeah. And since it''s not supposed to be
490
+
491
+ what snacks,
492
+
493
+ >> By the way, I I did autographs. put your
494
+
495
+ your name on your agenda and stuff and
496
+
497
+ You mind just announcing
498
+
499
+ that you''ve arrived and what time? Did provide? Guess we should wait. kind of
500
+ that was
501
+
502
+ >> right. So what doesn''t go into the
503
+
504
+ general account instead
505
+
506
+ of meeting
507
+
508
+ I thought we had a
509
+
510
+ Can you read it off? Oh, you
511
+
512
+ You just repeat the number.'
513
+ pipeline_tag: sentence-similarity
514
+ library_name: sentence-transformers
515
+ ---
516
+
517
+ # SentenceTransformer based on google-bert/bert-base-uncased
518
+
519
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
520
+
521
+ ## Model Details
522
+
523
+ ### Model Description
524
+ - **Model Type:** Sentence Transformer
525
+ - **Base model:** [google-bert/bert-base-uncased](https://huggingface.co/google-bert/bert-base-uncased) <!-- at revision 86b5e0934494bd15c9632b12f734a8a67f723594 -->
526
+ - **Maximum Sequence Length:** 256 tokens
527
+ - **Output Dimensionality:** 768 dimensions
528
+ - **Similarity Function:** Cosine Similarity
529
+ <!-- - **Training Dataset:** Unknown -->
530
+ <!-- - **Language:** Unknown -->
531
+ <!-- - **License:** Unknown -->
532
+
533
+ ### Model Sources
534
+
535
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
536
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
537
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
538
+
539
+ ### Full Model Architecture
540
+
541
+ ```
542
+ SentenceTransformer(
543
+ (0): Transformer({'max_seq_length': 256, 'do_lower_case': False, 'architecture': 'BertModel'})
544
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
545
+ )
546
+ ```
547
+
548
+ ## Usage
549
+
550
+ ### Direct Usage (Sentence Transformers)
551
+
552
+ First install the Sentence Transformers library:
553
+
554
+ ```bash
555
+ pip install -U sentence-transformers
556
+ ```
557
+
558
+ Then you can load this model and run inference.
559
+ ```python
560
+ from sentence_transformers import SentenceTransformer
561
+
562
+ # Download from the 🤗 Hub
563
+ model = SentenceTransformer("ar9av/tsdae-civic-bert")
564
+ # Run inference
565
+ sentences = [
566
+ "right o'clock Oh Jack here . Man brought team 3rd November . going'll to . We Pledge of and . And we got guests Manager Brantley'll of . the . I appeal flag the United States of and to Republic which,, for . Let . God, come before this grateful live wonderful community We serve our community members in our respective capacities morning and give us eyes and full hearts, Lord, as handle business Thank you your, In Christ name amen Thank Price the pledge . First item is",
567
+ "All right, it's nine o'clock. Oh, Jack's here too. Man, y'all brought the whole team. 9 o'clock, November 3rd, November already. Gosh. Time keeps on going. We'll call the meeting to order. We'll open with the Pledge of Allegiance and prayer. And since we've got special guests, I'll ask Town Manager Brantley Price if he'll lead us in the Pledge of Allegiance. And I'll say the prayer. I appeal to the flag of the United States of America and to the Republic for which it stands, one nation under God, indivisible, with liberty and justice for all. Let's pray together. God, we come before you this morning with grateful hearts to live in such a wonderful community. We're grateful for the opportunity to serve our community members in our respective capacities here this morning. Be with us and give us clear eyes and full hearts, Lord, as we handle the people's business. Thank you for your many, many blessings. In Christ's name, amen. Amen. Thank you, Manager Price, for the pledge. First item is",
568
+ "for Wednesday, November 12th\nat 7:05 p.m. What you want? This meeting is being recorded. report anything. I have\nYeah. agenda\nbut there were other items\n>> that didn't get to this print Right. >> Caught you off guard there. >> I know you did. I'm not. Oh, so it\nworked very well. >> what did what did we end up\nbecause of the day? >> that's all right. You're fine. >> All right. So, we're still going to go\na wedding for\nI think we're gonna\nwant to go. All right. No problem. 1.7. First one is Parker Hill\nWell, it's already been\nConservation Commission. Yeah. And since it's not supposed to be\nwhat snacks,\n>> By the way, I I did autographs. put your\nyour name on your agenda and stuff and\nYou mind just announcing\nthat you've arrived and what time? Did provide? Guess we should wait. kind of that was\n>> right. So what doesn't go into the\ngeneral account instead\nof meeting\nI thought we had a\nCan you read it off? Oh, you\nYou just repeat the number.",
569
+ ]
570
+ embeddings = model.encode(sentences)
571
+ print(embeddings.shape)
572
+ # [3, 768]
573
+
574
+ # Get the similarity scores for the embeddings
575
+ similarities = model.similarity(embeddings, embeddings)
576
+ print(similarities)
577
+ # tensor([[1.0000, 0.8891, 0.8695],
578
+ # [0.8891, 1.0000, 0.8283],
579
+ # [0.8695, 0.8283, 1.0000]])
580
+ ```
581
+
582
+ <!--
583
+ ### Direct Usage (Transformers)
584
+
585
+ <details><summary>Click to see the direct usage in Transformers</summary>
586
+
587
+ </details>
588
+ -->
589
+
590
+ <!--
591
+ ### Downstream Usage (Sentence Transformers)
592
+
593
+ You can finetune this model on your own dataset.
594
+
595
+ <details><summary>Click to expand</summary>
596
+
597
+ </details>
598
+ -->
599
+
600
+ <!--
601
+ ### Out-of-Scope Use
602
+
603
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
604
+ -->
605
+
606
+ <!--
607
+ ## Bias, Risks and Limitations
608
+
609
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
610
+ -->
611
+
612
+ <!--
613
+ ### Recommendations
614
+
615
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
616
+ -->
617
+
618
+ ## Training Details
619
+
620
+ ### Training Dataset
621
+
622
+ #### Unnamed Dataset
623
+
624
+ * Size: 10,000 training samples
625
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
626
+ * Approximate statistics based on the first 1000 samples:
627
+ | | sentence_0 | sentence_1 |
628
+ |:--------|:-----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
629
+ | type | string | string |
630
+ | details | <ul><li>min: 3 tokens</li><li>mean: 91.75 tokens</li><li>max: 202 tokens</li></ul> | <ul><li>min: 13 tokens</li><li>mean: 223.56 tokens</li><li>max: 256 tokens</li></ul> |
631
+ * Samples:
632
+ | sentence_0 | sentence_1 |
633
+ |:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
634
+ | <code>right I call this town council to order on Recognize is a quorum all those who attendance in those are in online . hello Davis are tonight . Thanks, I We having you . and expeditious tonight a big game JMU at . like have just a wo we've had two works who their a of for Our are with them time Ms. we have?? .? Here?? Here . Mr. Hunter? Here Hardy</code> | <code>All right, I'd like to call this town council meeting to order on this November 12, 2025. Recognize there is a quorum present. Welcome all those who are in attendance in person and those who are tuned in online. A special hello to Dr. Davis's local and state government class students who are here tonight. Thanks for coming, although I think you were probably told you had to. We appreciate having you. We'll try and be expeditious in the people's business tonight because I know there's a big game later this evening between Longwood and JMU here at home. I'd also like to have just a moment. I won't mention names, but we've had two of our public works employees who lost their wives here very recently. And so we'll just have a brief moment of silence for them. Thank you. Our prayers and our hearts are with them at this difficult time. Ms. McKay, can we have a roll call, please? Mrs. Amos? Here. Mr. Reed? Here. Mr. Dwyer? Here. Mr. Parrott? Here. Mr. Yoland. Here. Mr. Hunter? Here. Mr. Hardy</code> |
635
+ | <code>everyone November the City Columbia Board of Chair for meeting like introduce the members: Harding'm, Davis Whittle, Sidney Bang, Duvall also to introduce the staff the, Andrew, Board, Erica Hyan, Deputy and Madeline Land . is special, appeals . for the record, wishing and come the . No can floor When come the podium state your and speak the because meeting is recorded Applicants the board</code> | <code>Welcome, everyone, to the November meeting of the City of Columbia Board of Zoning Appeals. I am Catherine Fenner, Chair for the Board, and will be serving as the chair for today's meeting. I would like to introduce the other members of the board: Josh Harding, I'm sorry, Davis Whittle, Sidney Lanham, Jonathan Bang, and Sherard Duvall. I would also like to introduce the staff that assists the board, Andrew Livingood, Zoning Board Administrator, Erica Hyan, Deputy Zoning Administrator, and Madeline Bowden, Land Use Board Coordinator. The board is charged with hearing applications for special exceptions, variances, and administrative appeals. All testimony is recorded for the record, and anyone wishing to speak will need to be sworn in and come to the podium to speak. No testimony can be taken from the floor. When you come to the podium, state your name and please speak clearly into the microphone because this meeting is being recorded. Applicants with cases before the board are allotted</code> |
636
+ | <code>Corporation 7:30 WEDC Room 250 Highway Texas TO & PLEDGE OF ON ITEMS member the may Board not Agenda of the fill out a form meeting . that comments limited minutes for, six In addition, is not allowed to converse deliberate take on any presented CONSENT AGENDA matters Agenda are be routine by Board will be will not of items . discussion desired, that item will removed from the separately . act upon</code> | <code>Wylie Economic Development Corporation <br>Board Regular Meeting <br>November 19, 2025 – 7:30 AM <br>WEDC Office Conference Room - 250 South Highway 78, Wylie, Texas <br>75098 <br> <br> <br> <br> <br> <br>CALL TO ORDER <br>INVOCATION & PLEDGE OF ALLEGIANCE <br>COMMENTS ON NON-AGENDA ITEMS <br>Any member of the public may address Board regarding an item that is not listed on the Agenda. Members of the public must <br>fill out a form prior to the meeting in order to speak. Board requests that comments be limited to three minutes for an individual, <br>six minutes for a group. In addition, Board is not allowed to converse, deliberate or take action on any matter presented during <br>citizen participation. CONSENT AGENDA <br>All matters listed under the Consent Agenda are considered to be routine by the Board and will be enacted by one motion. There will not be separate discussion of these items. If discussion is desired, that item will be removed from the Consent Agenda <br>and will be considered separately. A. Consider and act upon appr</code> |
637
+ * Loss: [<code>DenoisingAutoEncoderLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#denoisingautoencoderloss)
638
+
639
+ ### Training Hyperparameters
640
+ #### Non-Default Hyperparameters
641
+
642
+ - `num_train_epochs`: 1
643
+ - `multi_dataset_batch_sampler`: round_robin
644
+
645
+ #### All Hyperparameters
646
+ <details><summary>Click to expand</summary>
647
+
648
+ - `overwrite_output_dir`: False
649
+ - `do_predict`: False
650
+ - `eval_strategy`: no
651
+ - `prediction_loss_only`: True
652
+ - `per_device_train_batch_size`: 8
653
+ - `per_device_eval_batch_size`: 8
654
+ - `per_gpu_train_batch_size`: None
655
+ - `per_gpu_eval_batch_size`: None
656
+ - `gradient_accumulation_steps`: 1
657
+ - `eval_accumulation_steps`: None
658
+ - `torch_empty_cache_steps`: None
659
+ - `learning_rate`: 5e-05
660
+ - `weight_decay`: 0.0
661
+ - `adam_beta1`: 0.9
662
+ - `adam_beta2`: 0.999
663
+ - `adam_epsilon`: 1e-08
664
+ - `max_grad_norm`: 1
665
+ - `num_train_epochs`: 1
666
+ - `max_steps`: -1
667
+ - `lr_scheduler_type`: linear
668
+ - `lr_scheduler_kwargs`: {}
669
+ - `warmup_ratio`: 0.0
670
+ - `warmup_steps`: 0
671
+ - `log_level`: passive
672
+ - `log_level_replica`: warning
673
+ - `log_on_each_node`: True
674
+ - `logging_nan_inf_filter`: True
675
+ - `save_safetensors`: True
676
+ - `save_on_each_node`: False
677
+ - `save_only_model`: False
678
+ - `restore_callback_states_from_checkpoint`: False
679
+ - `no_cuda`: False
680
+ - `use_cpu`: False
681
+ - `use_mps_device`: False
682
+ - `seed`: 42
683
+ - `data_seed`: None
684
+ - `jit_mode_eval`: False
685
+ - `bf16`: False
686
+ - `fp16`: False
687
+ - `fp16_opt_level`: O1
688
+ - `half_precision_backend`: auto
689
+ - `bf16_full_eval`: False
690
+ - `fp16_full_eval`: False
691
+ - `tf32`: None
692
+ - `local_rank`: 0
693
+ - `ddp_backend`: None
694
+ - `tpu_num_cores`: None
695
+ - `tpu_metrics_debug`: False
696
+ - `debug`: []
697
+ - `dataloader_drop_last`: False
698
+ - `dataloader_num_workers`: 0
699
+ - `dataloader_prefetch_factor`: None
700
+ - `past_index`: -1
701
+ - `disable_tqdm`: False
702
+ - `remove_unused_columns`: True
703
+ - `label_names`: None
704
+ - `load_best_model_at_end`: False
705
+ - `ignore_data_skip`: False
706
+ - `fsdp`: []
707
+ - `fsdp_min_num_params`: 0
708
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
709
+ - `fsdp_transformer_layer_cls_to_wrap`: None
710
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
711
+ - `parallelism_config`: None
712
+ - `deepspeed`: None
713
+ - `label_smoothing_factor`: 0.0
714
+ - `optim`: adamw_torch
715
+ - `optim_args`: None
716
+ - `adafactor`: False
717
+ - `group_by_length`: False
718
+ - `length_column_name`: length
719
+ - `project`: huggingface
720
+ - `trackio_space_id`: trackio
721
+ - `ddp_find_unused_parameters`: None
722
+ - `ddp_bucket_cap_mb`: None
723
+ - `ddp_broadcast_buffers`: False
724
+ - `dataloader_pin_memory`: True
725
+ - `dataloader_persistent_workers`: False
726
+ - `skip_memory_metrics`: True
727
+ - `use_legacy_prediction_loop`: False
728
+ - `push_to_hub`: False
729
+ - `resume_from_checkpoint`: None
730
+ - `hub_model_id`: None
731
+ - `hub_strategy`: every_save
732
+ - `hub_private_repo`: None
733
+ - `hub_always_push`: False
734
+ - `hub_revision`: None
735
+ - `gradient_checkpointing`: False
736
+ - `gradient_checkpointing_kwargs`: None
737
+ - `include_inputs_for_metrics`: False
738
+ - `include_for_metrics`: []
739
+ - `eval_do_concat_batches`: True
740
+ - `fp16_backend`: auto
741
+ - `push_to_hub_model_id`: None
742
+ - `push_to_hub_organization`: None
743
+ - `mp_parameters`:
744
+ - `auto_find_batch_size`: False
745
+ - `full_determinism`: False
746
+ - `torchdynamo`: None
747
+ - `ray_scope`: last
748
+ - `ddp_timeout`: 1800
749
+ - `torch_compile`: False
750
+ - `torch_compile_backend`: None
751
+ - `torch_compile_mode`: None
752
+ - `include_tokens_per_second`: False
753
+ - `include_num_input_tokens_seen`: no
754
+ - `neftune_noise_alpha`: None
755
+ - `optim_target_modules`: None
756
+ - `batch_eval_metrics`: False
757
+ - `eval_on_start`: False
758
+ - `use_liger_kernel`: False
759
+ - `liger_kernel_config`: None
760
+ - `eval_use_gather_object`: False
761
+ - `average_tokens_across_devices`: True
762
+ - `prompts`: None
763
+ - `batch_sampler`: batch_sampler
764
+ - `multi_dataset_batch_sampler`: round_robin
765
+ - `router_mapping`: {}
766
+ - `learning_rate_mapping`: {}
767
+
768
+ </details>
769
+
770
+ ### Training Logs
771
+ | Epoch | Step | Training Loss |
772
+ |:-----:|:----:|:-------------:|
773
+ | 0.4 | 500 | 4.3532 |
774
+ | 0.8 | 1000 | 3.4547 |
775
+
776
+
777
+ ### Framework Versions
778
+ - Python: 3.11.10
779
+ - Sentence Transformers: 5.2.0
780
+ - Transformers: 4.57.3
781
+ - PyTorch: 2.4.1+cu124
782
+ - Accelerate: 1.12.0
783
+ - Datasets: 4.4.2
784
+ - Tokenizers: 0.22.1
785
+
786
+ ## Citation
787
+
788
+ ### BibTeX
789
+
790
+ #### Sentence Transformers
791
+ ```bibtex
792
+ @inproceedings{reimers-2019-sentence-bert,
793
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
794
+ author = "Reimers, Nils and Gurevych, Iryna",
795
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
796
+ month = "11",
797
+ year = "2019",
798
+ publisher = "Association for Computational Linguistics",
799
+ url = "https://arxiv.org/abs/1908.10084",
800
+ }
801
+ ```
802
+
803
+ #### DenoisingAutoEncoderLoss
804
+ ```bibtex
805
+ @inproceedings{wang-2021-TSDAE,
806
+ title = "TSDAE: Using Transformer-based Sequential Denoising Auto-Encoderfor Unsupervised Sentence Embedding Learning",
807
+ author = "Wang, Kexin and Reimers, Nils and Gurevych, Iryna",
808
+ booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2021",
809
+ month = nov,
810
+ year = "2021",
811
+ address = "Punta Cana, Dominican Republic",
812
+ publisher = "Association for Computational Linguistics",
813
+ pages = "671--688",
814
+ url = "https://arxiv.org/abs/2104.06979",
815
+ }
816
+ ```
817
+
818
+ <!--
819
+ ## Glossary
820
+
821
+ *Clearly define terms in order to be accessible across audiences.*
822
+ -->
823
+
824
+ <!--
825
+ ## Model Card Authors
826
+
827
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
828
+ -->
829
+
830
+ <!--
831
+ ## Model Card Contact
832
+
833
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
834
+ -->
config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "BertModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "classifier_dropout": null,
7
+ "dtype": "float32",
8
+ "gradient_checkpointing": false,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 768,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 3072,
14
+ "layer_norm_eps": 1e-12,
15
+ "max_position_embeddings": 512,
16
+ "model_type": "bert",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 12,
19
+ "pad_token_id": 0,
20
+ "position_embedding_type": "absolute",
21
+ "transformers_version": "4.57.3",
22
+ "type_vocab_size": 2,
23
+ "use_cache": true,
24
+ "vocab_size": 30522
25
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "model_type": "SentenceTransformer",
3
+ "__version__": {
4
+ "sentence_transformers": "5.2.0",
5
+ "transformers": "4.57.3",
6
+ "pytorch": "2.4.1+cu124"
7
+ },
8
+ "prompts": {
9
+ "query": "",
10
+ "document": ""
11
+ },
12
+ "default_prompt_name": null,
13
+ "similarity_fn_name": "cosine"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f0c1270d0165d9bc550d467a3350bb0addd86b288ad3b35e26de32234a81c453
3
+ size 437951328
modules.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ }
14
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 256,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,63 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "clean_up_tokenization_spaces": false,
45
+ "cls_token": "[CLS]",
46
+ "do_lower_case": true,
47
+ "extra_special_tokens": {},
48
+ "mask_token": "[MASK]",
49
+ "max_length": 256,
50
+ "model_max_length": 256,
51
+ "pad_to_multiple_of": null,
52
+ "pad_token": "[PAD]",
53
+ "pad_token_type_id": 0,
54
+ "padding_side": "right",
55
+ "sep_token": "[SEP]",
56
+ "stride": 0,
57
+ "strip_accents": null,
58
+ "tokenize_chinese_chars": true,
59
+ "tokenizer_class": "BertTokenizer",
60
+ "truncation_side": "right",
61
+ "truncation_strategy": "longest_first",
62
+ "unk_token": "[UNK]"
63
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff