File size: 62,118 Bytes
1f25fdc
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
---
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- dense
- generated_from_trainer
- dataset_size:88
- loss:CachedMultipleNegativesRankingLoss
base_model: BAAI/bge-base-en-v1.5
widget:
- source_sentence: the nearest technology center under tcsp is too far and fully booked
    so i can t access their advanced machinery skill training or tech advisory for
    upgrading my production line. local msmes like mine are losing competitiveness
    without nearby centers in delhi region. please set up or expand tcsp facilities
    closer and prioritize slots for small units to adopt modern tech fast. issue non-availability
    of nearby technology center under tcsp context the user is reporting that the
    nearest technology center under tcsp is too far and fully booked which is hindering
    access to advanced machinery skill training and tech advisory for upgrading the
    production line and is requesting the establishment or expansion of tcsp facilities
    closer to the delhi region to prioritize slots for small units. details - location
    delhi region requirement establishment or expansion of tcsp facilities priority
    slots for small units
  sentences:
  - Technology, Quality and Institutions. Testing, Quality, Testing Center. this category
    encompasses grievances related to msmes micro small and medium enterprises inability
    to access utilize or rely on government-recognized testing calibration inspection
    or certification services required for regulatory compliance tenders gem listing
    or exports. the category covers a range of issues including delays in the issuance
    of test reports despite samples being submitted and fees paid denial or non-issuance
    of quality or conformity certificates without clear reasons difficulties accessing
    testing or calibration facilities due to - capacity constraints - administrative
    refusal - non-functional equipment procedural and system-level barriers such as
    - unclear or changing documentation requirements - portal mismatches - fees paid
    but testing not scheduled situations where business losses occur due to market
    access being blocked due to pending testing or certification at authorized labs
    or msme testing centers. example issues include testing completed and fees paid
    but test report is not issued even after many weeks quality certification was
    rejected without written reasons despite compliance with guidelines testing center
    is refusing to accept samples citing workload while deadlines are approaching
    fees paid online but testing not scheduled due to portal or procedural issues
    tender or export shipment is blocked because the required test certificate is
    still pending at the testing lab. the purpose of this category is to capture grievances
    related to the operational procedural policy or institutional causes that hinder
    msmes access to government-recognized testing calibration inspection or certification
    services. the category aims to identify and address the root causes of these issues
    including capacity constraints at testing facilities inade
  - Policy and Schemes. Related to MSME Scheme. this category encompasses grievances
    related to central sector schemes directly administered by the ministry of micro
    small and medium enterprises momsme where the ministry itself serves as the implementing
    authority. the category includes schemes such as zero defect zero effect zed credit
    linked capital subsidy scheme clcss lean manufacturing and other centrally managed
    msme support programs. it covers cases where msmes have applied for scheme benefits
    or subsidies received approvals or completed required assessments or certifications
    but the approved financial assistance has not been released or credited. the category
    also captures grievances where claims submitted under ministry-run schemes for
    incentives reimbursements or financial support remain pending for extended periods
    or are rejected without clear or consistent justification. this includes cases
    of rejection due to alleged documentation gaps system-generated ineligibility
    flags disputes over eligible machinery or activities and delays caused by human
    or system-level verification failures. additionally the category includes grievances
    arising from ambiguity or confusion regarding scheme eligibility scope or applicability
    such as uncertainty over mandatory certifications eligibility of second-hand versus
    new machinery applicability to service enterprises or inconsistent interpretations
    of scheme rules by different central or state offices. the category further covers
    portal-related issues affecting scheme access and execution including technical
    errors during registration or document upload login or authentication failures
    contradictory status messages and non-updating dashboards for application claim
    or training progress. these issues typically arise due to system bugs integration
    gaps between multiple portals file format or size restrictions or delays in updating
    portal logic after scheme guideline revisions.
  - Technology, Quality and Institutions. Technology Center System Program (TCSP).
    the technology centre systems programme tcsp is a flagship initiative of the ministry
    of micro small and medium enterprises approved in <NUM> with world bank support
    to significantly strengthen india s msme manufacturing ecosystem through advanced
    technology access and skill development. with a total outlay of about <NUM> <NUM>
    crore initially and later expanded to around <NUM> <NUM> crore the programme modernizes
    <NUM> existing technology centres earlier tool rooms technology development centres
    and establishes <NUM> new technology centres along with <NUM> extension centres
    across the country. these centres are equipped with state-of-the-art infrastructure
    such as cnc machining additive manufacturing and 3d printing robotics and automation
    laser machining electronics system design and manufacturing facilities advanced
    metrology and testing equipment. tcsp aims to enhance msme competitiveness by
    providing localized access to modern manufacturing technologies short- and long-term
    industry-oriented training skilled manpower development prototyping and testing
    services incubation and entrepreneurship support and advisory services. the programme
    also promotes cluster-based development through manufacturing technology partners
    and technology cluster managers supported by a centralized erp system for service
    delivery thereby improving productivity innovation and integration of msmes into
    national and global value chains. examples of common grievances under the tcsp
    include operational delays a newly approved technology centre in an underserved
    state remains partially functional for years due to delayed procurement of machines
    preventing msmes from accessing prototyping and testing facilities. training capacity
    constraints popular courses such as robotics or cnc programming at extension centres
    become oversubscribed resulting in eligible msme workers being denied admission
    despite early registration. weak cluster linkages local technology cluster managers
    fail to connect msmes with oems academic institutions or research bodies limiting
    the intended collaboration and market access benefits. erp system issues msmes
    face repeated difficulties in booking training slots advisory services or accessing
    technical specifications due to downtime or glitches in the centralized erp portal.
    infrastructure shortfalls at extension centres an approved extension centre lacks
    promised metrology or testing equipment restricting its ability to deliver precision
    measurement and quality assurance services to nearby msmes.
- source_sentence: my grievance are related to yes bank ifsc code- yesb0000565 as
    per msme act and rbi policy i have open new digital kyc verified open the current
    ac in yes bank yes bank provide me ckyc no and my ac no are - <NUM> on - <NUM>
    - <NUM> - <NUM> after ac opening without my permission after few days bank close
    my ac and reply after after few days your ac are activate after four month bank
    not activate my ac and return my ac fund available balance through post so i request
    you please order and take action against bank why close digital ckyc verified
    ac by bank without permission and i request you please open new current ac in
    your bank then i have transferred my old returned ac avalanche dd in your new
    current ac please take action against bank as soon as possible. thanks. issue
    closure of current account without permission and request for new account opening
    context the user is reporting that their current account was closed without their
    permission in yes bank and they are requesting action against the bank and opening
    a new current account in the complainant s bank to transfer the returned balance.
    details - account type current account ifsc code yesb0000565 account number <NUM>
    account opening date <NUM> - <NUM> - <NUM> bank yes bank
  sentences:
  - Technology, Quality and Institutions. Building Awareness on Intellectual Property
    Rights (IPR) for the Micro, Small & Medium Enterprises- an NMCP Scheme. the building
    awareness on intellectual property rights ipr for msmes scheme under the national
    manufacturing competitiveness programme nmcp is an initiative of the ministry
    of msme aimed at sensitizing micro small and medium enterprises to the importance
    of protecting and leveraging intellectual property such as patents trademarks
    copyrights industrial designs and geographical indications. the scheme supports
    the organization of awareness and sensitization programmes including seminars
    and workshops with financial assistance up to <NUM> . <NUM> lakh per event conducts
    pilot studies to assess intellectual property needs in specific industrial clusters
    with support up to <NUM> lakh and facilitates the establishment of intellectual
    property facilitation centres ipfcs in collaboration with technical institutions
    such as tifac. it also provides reimbursement support for ipr registration expenses
    including assistance up to <NUM> lakh for geographical indication registration
    and prescribed limits for other ip filings with proposals approved through a structured
    mechanism involving a project implementation committee and a steering committee.
    overall the scheme seeks to integrate ipr awareness into msme business strategy
    encourage innovation protection strengthen cluster-based ip management and enhance
    competitiveness in liberalized domestic and global markets. examples of common
    grievances under the ipr awareness scheme include reimbursement cap limitation
    an msme incurs <NUM> . <NUM> lakh in patent filing and prosecution costs but receives
    reimbursement only up to the maximum admissible limit leaving part of the eligible
    expense unsupported. workshop content inadequacy an ipr seminar organized for
    a manufacturing cluster lacks subject-matter experts and focuses only on basic
    concepts despite prior identification of advanced patenting and commercialization
    needs. pilot study rejection a cluster-level proposal for an ip needs assessment
    study is rejected due to documentation or data gaps even though it is backed by
    an industry association and local msmes. restricted ipfc access an intellectual
    property facilitation centre limits filing and advisory support only to notified
    clusters denying assistance to otherwise eligible msmes outside those clusters.
    processing delays approved reimbursement or support for geographical indication
    registration remains pending for several months after committee approval affecting
    timely protection and commercialization of the product.
  - UAM/Udyam Registration/Certificate related issues. Updation of Email ID/Mobile
    No. Linked to UDYAM Certificate. this category includes grievances related to
    updating or correcting the email id or mobile number associated with an existing
    udyam registration. contact details provided during registration are used for
    communication verification and authentication when accessing the enterprise profile
    on the portal. if these contact details become outdated incorrect or inaccessible
    the enterprise owner may face difficulty receiving otps accessing the portal or
    managing the registration information. common grievances under this category include
    requests to change the registered mobile number or email address because the original
    number is no longer active the sim card has been lost the email account is no
    longer accessible or the contact details were entered incorrectly during registration.
    some complaints arise when the registered contact details belong to an employee
    or consultant who is no longer associated with the enterprise preventing the current
    owner from receiving verification messages. in other cases entrepreneurs report
    that they cannot update contact details because the system requires authentication
    through the old mobile number or email which they no longer have access to. these
    grievances are typically raised by msme owners proprietors partners directors
    of companies or authorized representatives responsible for managing business registrations.
    small business owners who registered their enterprise personally may request updates
    when their phone number or email changes. in some cases accountants consultants
    or administrative staff handling compliance activities may also submit grievances
    when they cannot access the registration due to outdated contact details. this
    category therefore represents issues related specifically to correcting or updating
    communication details associated with an existing udyam certificate.
  - Starter, Credit and Finance. Any other banking issue. this category encompasses
    grievances related to banking operations conduct or services affecting micro small
    and medium enterprises msmes where the issue involves a bank or financial institution
    but does not fall under a specific loan product credit guarantee scheme or named
    government scheme. the category s purpose is to capture operational and service-level
    banking failures that directly disrupt business functioning working capital flow
    compliance obligations or financial credibility of msmes. scope of the category
    the category covers a wide range of issues including account restrictions and
    freezing msme bank accounts being frozen restricted or partially disabled due
    to kyc re-verification identity mismatches internal compliance flags alleged suspicious
    activity account restrictions despite compliance account restrictions continuing
    despite submission of required documents or compliance with bank instructions
    bank charges fees and penalties disputes related to bank charges fees and penalties
    applied without transparency or contrary to agreed account terms including unexpected
    minimum balance penalties annual maintenance charges on zero-balance accounts
    excessive transaction fees penalties levied without prior intimation and subsequently
    refused for reversal despite documented eligibility incorrect credit reporting
    and credit rating impacts grievances related to incorrect credit reporting and
    credit rating impacts including inaccurate reporting of defaults failure to update
    closed or repaid loans delay or refusal to correct cibil or credit bureau records
    wrongly classifying accounts as non-performing assets npas downgrading credit
    ratings due to internal errors bank
- source_sentence: recently we registered ourselves for udyam registration but after
    scanning the qr code verification is failing. issue qr code verification failure
    for udyam registration context the user is reporting that qr code verification
    is failing after registering for udyam registration. details - registration type
    udyam registration verification issue qr code verification failure
  sentences:
  - UAM/Udyam Registration/Certificate related issues. Updation of Address Details.
    this category refers to grievances related to updating or correcting the registered
    address of an enterprise in the udyam registration records. the registered address
    represents the official location of the enterprise and accurate address information
    is important for maintaining correct business records and identifying the location
    of operations. grievances under this category commonly occur when the enterprise
    has changed its business location after registration or when the address recorded
    during registration was entered incorrectly. entrepreneurs may request an update
    if the business has shifted to a new office factory or operational location. some
    grievances arise when the address contains typographical errors incomplete information
    or mismatched details that do not correspond with supporting documents. in other
    cases users may report that they are unable to modify the address details through
    the portal or that the changes they submitted have not been updated in the registration
    record. these grievances are usually raised by msme owners proprietors partners
    of partnership firms directors of companies or authorized representatives responsible
    for maintaining enterprise records. small business owners who relocate their operations
    to a different city or premises may request updates to ensure the registration
    reflects the correct address. similarly compliance officers accountants or consultants
    managing regulatory documentation for the enterprise may submit grievances when
    they identify discrepancies between the registered address and the actual business
    location.
  - Policy and Schemes. DBT / IT desk including Annual Report. dbt it desk including
    the annual report in msme refers to the data dbt wing functioning under the office
    of the development commissioner msme which is responsible for administering direct
    benefit transfer dbt of subsidies under msme schemes managing it and digital infrastructure
    and compiling the ministry s annual report. the wing oversees end-to-end dbt processes
    for scheme reimbursements such as ict and cloud computing subsidies where msmes
    initially incur eligible expenses and subsequently receive reimbursements directly
    into aadhaar-linked bank accounts through the public financial management system
    often after technical verification by agencies like telecommunications consultants
    india limited. it ensures compliance with national dbt standards in coordination
    with the dbt mission and national informatics centre maintains and upgrades msme
    it systems including the udyam registration portal supports cloud-based it adoption
    for msmes undertakes data analytics and mis reporting and onboards schemes to
    the national dbt framework. the wing also prepares the annual report of the ministry
    of msme consolidating performance indicators financial outlays scheme outcomes
    udyam registration trends and macro-level contributions such as msme share in
    gdp and employment which are used for parliament cabinet briefings and policy
    evaluation. while this framework promotes transparency leak-proof subsidy delivery
    evidence-based policymaking and digital efficiency stakeholders frequently raise
    grievances related to dbt execution data accuracy it reliability and reporting
    quality. examples of grievances include msmes experiencing delays in receipt of
    approved ict or cloud service subsidies due to pfms transaction or verification
    glitches reimbursement failures arising from aadhaar bank account linkage mismatches
    despite valid udyam registration inaccuracies or under-reporting of scheme achievements
    udyam registrations or msme gdp contribution in the annual report affecting policy
    advocacy and planning temporary downtime or access issues on udyam or other msme
    it portals during registration or subsidy claim periods and gaps in mis capture
    where scheme data duplications or leakages are not properly reflected in dbt dashboards
    or the annual report prompting appeals for correction and system strengthening.
  - UAM/Udyam Registration/Certificate related issues. QR Code Printed on UDYAM Certificate
    Not Readable (Technical). this category includes grievances related to qr codes
    printed on the udyam certificate that cannot be scanned or read properly. the
    qr code is intended to allow quick verification of the certificate and its associated
    enterprise information. if the qr code cannot be scanned users may face difficulty
    verifying the certificate or sharing it for official purposes. grievances under
    this category typically involve situations where the qr code on the downloaded
    or printed certificate appears blurred distorted or unresponsive when scanned
    with a qr reader. some users report that the qr code does not open any verification
    page after scanning while others find that the scanning application fails to recognize
    the code at all. these issues may arise due to errors during certificate generation
    problems with the downloaded file or printing-related distortions that make the
    qr code unreadable. these grievances are generally raised by msme owners proprietors
    partners directors or authorized representatives who use the udyam certificate
    as official documentation for their enterprise. small business owners who attempt
    to share the certificate for verification purposes may discover that the qr code
    is not functioning correctly. consultants accountants or administrative staff
    responsible for maintaining business documentation may also submit grievances
    when they identify that the qr code on the certificate cannot be scanned or verified.
- source_sentence: respectrd sir i have registered person of pm vishwakarma scheme
    as a broom maker i have also complited the trannung of broom maker at bhusawal
    pm vishwakarma center before three month ago sir but the tool kit of the pm vishwakarma
    scheme rs <NUM> - has not given or got to me so please your request to given of
    scheme benefit as a tool kit and loan ammount of <NUM> - rs issue non-receipt
    of tool kit and loan under pm vishwakarma scheme context the user is reporting
    non-receipt of the tool kit worth rs <NUM> <NUM> and loan amount of rs <NUM> <NUM>
    under the pm vishwakarma scheme despite completing the training three months ago.
    details - scheme pm vishwakarma scheme occupation broom maker training location
    bhusawal pm vishwakarma center amount requested rs <NUM> tool kit rs <NUM> loan
  sentences:
  - UAM/Udyam Registration/Certificate related issues. Migration from UAM to UDYAM.
    this category refers to grievances related to the migration of enterprises registered
    under the earlier udyog aadhaar memorandum uam system to the current udyam registration
    system. the uam registration system was used earlier for msme registration but
    enterprises registered under that system were required to migrate their registration
    details to the newer udyam portal to maintain updated records. during this migration
    process some enterprises encounter difficulties in transferring or verifying their
    existing registration details. grievances under this category typically include
    issues where business owners are unable to complete the migration process from
    uam to udyam due to errors or system restrictions. entrepreneurs may report that
    their uam number is not being recognized by the portal or that the migration process
    stops due to validation errors related to aadhaar pan or enterprise details. some
    users also experience problems when the migrated information does not match the
    original uam registration resulting in incomplete or incorrect enterprise records.
    other grievances may involve duplicate records being generated during migration
    or difficulty accessing the migrated certificate after the process is completed.
    these grievances are usually raised by existing msme owners who had previously
    registered their enterprises under the uam system and are attempting to migrate
    their information to the udyam portal. the complainants may include proprietors
    partners of partnership firms directors of companies or authorized representatives
    responsible for regulatory compliance of the enterprise. in many cases small business
    owners who registered their enterprises several years ago under uam may attempt
    migration when they need updated msme documentation and they raise grievances
    if the migration process cannot be completed successfully.
  - Policy and Schemes. PM Vishwakarma. the pm vishwakarma category encompasses the
    registration skill certification and benefit disbursal processes for artisans
    and craftspeople. the system aims to provide easy registration skill certification
    toolkit incentives credit support and strong market linkage. however operational
    issues eligibility interpretation challenges and bank coordination failures lead
    to breakdowns at the stages of registration certification benefit disbursal and
    bank linkage. common grievance scenarios registration stuck at pending verification
    applicants may experience delays in the registration process with applications
    remaining stuck at pending verification for <NUM> days without any response from
    the local officer. aadhaar-based registration failures aadhaar-based registration
    may fail due to occupation mismatch despite the individual being a traditional
    carpenter for <NUM> years. non-receipt of toolkit incentives artisans and craftspeople
    may not receive the toolkit incentive despite completing skill training and assessment.
    bank refusal of pm vishwakarma loans banks may refuse to provide pm vishwakarma
    loans due to unclear scheme guidelines. incorrect trade listing trades eligible
    under the scheme may not be listed correctly in the portal s dropdown options.
    operational procedural policy and institutional causes operational
  - Policy and Schemes. Related to GST. this category encompasses grievances related
    to operational and procedural frictions under the goods and services tax gst framework
    that directly affect micro small and medium enterprises msmes cash flow invoicing
    and day-to-day business continuity. the category includes the following subcategories
    <NUM> . gst registration issues applications remaining pending verification pan-gst
    name mismatches leading to rejection confusion arising during migration from uam
    udyam-linked records to gst rejection of registration due to pan and gst name
    mismatch non-response from portal support <NUM> . gst refund delays eligible refunds
    especially export-related input tax credit not disbursed within reasonable timelines
    despite correct filings refund status shows processed without actual credit due
    to backend mismatches delayed disbursement of input tax credit refunds for export
    sales refund status shows processed but no amount has been credited due to backend
    mismatch <NUM> . input tax credit itc blockages credits not reflecting because
    supplier invoices are missing on the portal invoices being wrongly flagged as
    ineligible itc reversals triggered by hsn mismatches or delayed supplier compliance
    supplier invoices not reflecting on the gst portal forcing msmes to pay tax from
    their own funds the category primarily captures operational rather than legal
    grievances. while champions does not adjudicate tax disputes it acts as an escalation
    and coordination channel with gstn or relevant tax authorities to resolve delays
    portal errors and process breakdowns impacting msmes. the purpose of this category
    is to address the following - resolve gst registration issues
- source_sentence: the msme portal software keeps crashing during udyam registration
    renewal and scheme applications with error messages and failed uploads every time
    i try. support team gives no help and i can t access my digital certificates or
    track status. this software glitch blocks my business from government benefits
    and loans. please fix the bugs improve server speed and add better error guides
    right away. issue software glitch in msme portal during udyam registration renewal
    and scheme applications context the user is reporting frequent crashes of the
    msme portal software during udyam registration renewal and scheme applications
    resulting in failed uploads error messages and inability to access digital certificates
    or track status which is hindering business access to government benefits and
    loans. details - software msme portal software issue frequent crashes during udyam
    registration renewal and scheme applications error messages failed uploads and
    error messages impact inability to access digital certificates track status and
    access government benefits and loans
  sentences:
  - Starter, Credit and Finance. Loans from Banks. this category loans from banks
    encompasses grievances related to access to credit from banks where micro small
    and medium enterprises msmes have applied for loans and the bottleneck lies at
    the bank level. the scope of this category includes issues involving commercial
    banks regional rural banks rrbs and cooperative banks. it specifically addresses
    situations where the problem is neither related to rbi policy government scheme
    design nor buyer default but arises from bank-side processing handling or decision-making
    of loan applications. the category captures the following scenarios - msmes have
    submitted loan applications along with required documentation complied with bank
    procedures and followed up through branches or portals but the application remains
    pending without a formal decision. - banks keep applications under prolonged under
    process or pending for verification status without issuing deficiency letters
    timelines or written communication. - situations involving repeated or circular
    document demands that effectively stall credit access. - grievances where branch-level
    offices do not forward eligible loan applications to regional or head offices.
    - delays in internal approvals. - avoidance of issuing a clear sanction or rejection
    decision despite prolonged engagement. these cases reflect administrative stalling
    rather than informed credit rejection based on risk or eligibility. the category
    includes the following example issues - i applied for a term loan under the msme
    category and submitted all documents but the bank has kept the application under
    process for several months without any written update. - my loan application status
    has been showing pending for verification on the bank portal for over <NUM> days
    with no deficiency letter issued. - the bank is repeatedly asking for documents
    that were already submitted causing unnecessary delay in loan processing. - the
    branch is not forwarding
  - Technology, Quality and Institutions. Related to NSIC. this category encompasses
    grievances related to the support and facilitation services provided by the national
    small industries corporation nsic to micro small and medium enterprises msmes
    . the scope of this category includes issues arising from the areas of raw material
    assistance market access and risk mitigation through guarantees. specifically
    it covers situations where approved raw material assistance is not released on
    time supplier coordination fails after nsic approval material supplied through
    nsic is delayed or does not meet specifications or documentation and regional
    office processes stall procurement. the category also captures failures in marketing
    support including - delayed or missing inclusion in tenders gem or psu vendor
    listings - late communication of bid opportunities - problems in nsic-sponsored
    exhibitions or buyer-connect programs additionally it includes issues related
    to performance and emd guarantees such as - delayed issuance - incorrect formats
    - non-renewal despite payment - rejection by psus - lack of response when guarantees
    are invoked these grievances typically result in missed orders blocked working
    capital contract delays or loss of business credibility and arise from execution
    coordination or service delivery breakdowns rather than policy interpretation.
    the category is further divided into the following subcategories <NUM> . corporate
    communication single point registration scheme and exhibition consortia and tender
    marketing <NUM> . internal audit and law recovery <NUM> . human resource <NUM>
    . vigilance law recovery <NUM> . international cooperation <NUM> . bank guarantee
    monitoring <NUM> . finance accounts <NUM> . national sc st hub <NUM> . chief vigilance
    officer <NUM> . contract procurement grievance officer <NUM> . digital services
    facilitation and training <NUM> .space marketing cell event management cell <NUM>
    .raw material assistance bank guarantee bill discounting bank tieup csr administration
    <NUM> .technology liaison officer for sc st pwd cmr <NUM> .epf trust superannuation
    pension trust <NUM> .center public information officers cpio <NUM> .company secretary
  - Technology, Quality and Institutions. Software Related. software-related initiatives
    for msmes mainly center on the digital msme scheme under the national manufacturing
    competitiveness programme which promotes adoption of information and communication
    technologies through cloud-based erp crm and accounting software to digitalize
    day-to-day business operations. the scheme combines awareness workshops needs
    assessment and financial support in the form of subsidies covering about <NUM>
    <NUM> of eligible costs subject to a ceiling of <NUM> lakh over two years specifically
    targeting micro and small enterprises. these initiatives are reinforced by complementary
    efforts such as software-enabled facilities under technology centre programmes
    for electronics and esdm sectors digital quality and process parameters under
    zed certification and software-focused modules within entrepreneurship and skill
    development programmes. together these measures aim to standardize workflows automate
    inventory finance and customer management reduce operational inefficiencies and
    inventory holding support online sales and compliance and enhance overall competitiveness
    without requiring heavy upfront investment in hardware. examples of grievances
    include subsidy denial an msme implementing a cloud-based erp costing <NUM> .
    <NUM> lakh receives no reimbursement beyond the <NUM> lakh cap despite meeting
    all eligibility conditions. software ineligibility a cloud application selected
    after needs assessment is later rejected as non-standard or non-approved forcing
    the enterprise to abandon or restart implementation mid-way. inadequate training
    awareness workshops focus only on theoretical benefits of digitalization and fail
    to provide hands-on demonstrations or practical guidance on using erp or crm software.
    post-subsidy continuity issue after the two-year subsidized period ends steep
    renewal or subscription costs make the software unaffordable disrupting business
    operations. needs mismatch an msme assessed for crm requirements is instead provided
    accounting software limiting the usefulness of the digital intervention and affecting
    adoption outcomes.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
- pearson_cosine
- spearman_cosine
model-index:
- name: SentenceTransformer based on BAAI/bge-base-en-v1.5
  results:
  - task:
      type: semantic-similarity
      name: Semantic Similarity
    dataset:
      name: Unknown
      type: unknown
    metrics:
    - type: pearson_cosine
      value: .nan
      name: Pearson Cosine
    - type: spearman_cosine
      value: .nan
      name: Spearman Cosine
---

# SentenceTransformer based on BAAI/bge-base-en-v1.5

This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

## Model Details

### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [BAAI/bge-base-en-v1.5](https://huggingface.co/BAAI/bge-base-en-v1.5) <!-- at revision a5beb1e3e68b9ab74eb54cfd186867f64f240e1a -->
- **Maximum Sequence Length:** 256 tokens
- **Output Dimensionality:** 768 dimensions
- **Similarity Function:** Cosine Similarity
<!-- - **Training Dataset:** Unknown -->
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->

### Model Sources

- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)

### Full Model Architecture

```
SentenceTransformer(
  (0): Transformer({'max_seq_length': 256, 'do_lower_case': True, 'architecture': 'BertModel'})
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)
```

## Usage

### Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

```bash
pip install -U sentence-transformers
```

Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
    'the msme portal software keeps crashing during udyam registration renewal and scheme applications with error messages and failed uploads every time i try. support team gives no help and i can t access my digital certificates or track status. this software glitch blocks my business from government benefits and loans. please fix the bugs improve server speed and add better error guides right away. issue software glitch in msme portal during udyam registration renewal and scheme applications context the user is reporting frequent crashes of the msme portal software during udyam registration renewal and scheme applications resulting in failed uploads error messages and inability to access digital certificates or track status which is hindering business access to government benefits and loans. details - software msme portal software issue frequent crashes during udyam registration renewal and scheme applications error messages failed uploads and error messages impact inability to access digital certificates track status and access government benefits and loans',
    'Technology, Quality and Institutions. Software Related. software-related initiatives for msmes mainly center on the digital msme scheme under the national manufacturing competitiveness programme which promotes adoption of information and communication technologies through cloud-based erp crm and accounting software to digitalize day-to-day business operations. the scheme combines awareness workshops needs assessment and financial support in the form of subsidies covering about <NUM> <NUM> of eligible costs subject to a ceiling of <NUM> lakh over two years specifically targeting micro and small enterprises. these initiatives are reinforced by complementary efforts such as software-enabled facilities under technology centre programmes for electronics and esdm sectors digital quality and process parameters under zed certification and software-focused modules within entrepreneurship and skill development programmes. together these measures aim to standardize workflows automate inventory finance and customer management reduce operational inefficiencies and inventory holding support online sales and compliance and enhance overall competitiveness without requiring heavy upfront investment in hardware. examples of grievances include subsidy denial an msme implementing a cloud-based erp costing <NUM> . <NUM> lakh receives no reimbursement beyond the <NUM> lakh cap despite meeting all eligibility conditions. software ineligibility a cloud application selected after needs assessment is later rejected as non-standard or non-approved forcing the enterprise to abandon or restart implementation mid-way. inadequate training awareness workshops focus only on theoretical benefits of digitalization and fail to provide hands-on demonstrations or practical guidance on using erp or crm software. post-subsidy continuity issue after the two-year subsidized period ends steep renewal or subscription costs make the software unaffordable disrupting business operations. needs mismatch an msme assessed for crm requirements is instead provided accounting software limiting the usefulness of the digital intervention and affecting adoption outcomes.',
    'Technology, Quality and Institutions. Related to NSIC. this category encompasses grievances related to the support and facilitation services provided by the national small industries corporation nsic to micro small and medium enterprises msmes . the scope of this category includes issues arising from the areas of raw material assistance market access and risk mitigation through guarantees. specifically it covers situations where approved raw material assistance is not released on time supplier coordination fails after nsic approval material supplied through nsic is delayed or does not meet specifications or documentation and regional office processes stall procurement. the category also captures failures in marketing support including - delayed or missing inclusion in tenders gem or psu vendor listings - late communication of bid opportunities - problems in nsic-sponsored exhibitions or buyer-connect programs additionally it includes issues related to performance and emd guarantees such as - delayed issuance - incorrect formats - non-renewal despite payment - rejection by psus - lack of response when guarantees are invoked these grievances typically result in missed orders blocked working capital contract delays or loss of business credibility and arise from execution coordination or service delivery breakdowns rather than policy interpretation. the category is further divided into the following subcategories <NUM> . corporate communication single point registration scheme and exhibition consortia and tender marketing <NUM> . internal audit and law recovery <NUM> . human resource <NUM> . vigilance law recovery <NUM> . international cooperation <NUM> . bank guarantee monitoring <NUM> . finance accounts <NUM> . national sc st hub <NUM> . chief vigilance officer <NUM> . contract procurement grievance officer <NUM> . digital services facilitation and training <NUM> .space marketing cell event management cell <NUM> .raw material assistance bank guarantee bill discounting bank tieup csr administration <NUM> .technology liaison officer for sc st pwd cmr <NUM> .epf trust superannuation pension trust <NUM> .center public information officers cpio <NUM> .company secretary',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.6265, 0.5981],
#         [0.6265, 1.0000, 0.7013],
#         [0.5981, 0.7013, 1.0000]])
```

<!--
### Direct Usage (Transformers)

<details><summary>Click to see the direct usage in Transformers</summary>

</details>
-->

<!--
### Downstream Usage (Sentence Transformers)

You can finetune this model on your own dataset.

<details><summary>Click to expand</summary>

</details>
-->

<!--
### Out-of-Scope Use

*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->

## Evaluation

### Metrics

#### Semantic Similarity

* Evaluated with [<code>EmbeddingSimilarityEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.EmbeddingSimilarityEvaluator)

| Metric              | Value   |
|:--------------------|:--------|
| pearson_cosine      | nan     |
| **spearman_cosine** | **nan** |

<!--
## Bias, Risks and Limitations

*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->

<!--
### Recommendations

*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->

## Training Details

### Training Dataset

#### Unnamed Dataset

* Size: 88 training samples
* Columns: <code>sentence_0</code> and <code>sentence_1</code>
* Approximate statistics based on the first 88 samples:
  |         | sentence_0                                                                           | sentence_1                                                                            |
  |:--------|:-------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|
  | type    | string                                                                               | string                                                                                |
  | details | <ul><li>min: 46 tokens</li><li>mean: 172.95 tokens</li><li>max: 256 tokens</li></ul> | <ul><li>min: 199 tokens</li><li>mean: 253.88 tokens</li><li>max: 256 tokens</li></ul> |
* Samples:
  | sentence_0                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | sentence_1                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
  |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
  | <code>with due respect i md mafijul husen would like to intimate that when i trying to edit my existing udyam registration certificate <udyam_no> certificate of my enterprise viz. md mafijul husen then i failed to enter otp as my earlier mobile number <phone_no> has been changed and the given gmail id is also inactive. hence it is my request to change my mobile number so that i can edit my existing udyam registration <udyam_no> certificate. my pan no is <pan_no> and aadhaar number is <uam_no> . issue update of mobile number and gmail id for udyam registration certificate editing context the user is requesting an update of the mobile number and gmail id associated with the existing udyam registration certificate udyam-wb- <NUM> - <NUM> to facilitate editing of the certificate. details - udyam registration certificate no udyam-wb- <NUM> - <NUM> old mobile no <NUM> old gmail id inactive pan no aetph0941n aadhar no <NUM></code>                                                                                | <code>UAM/Udyam Registration/Certificate related issues. Updation of Email ID/Mobile No. Linked to UDYAM Certificate. this category includes grievances related to updating or correcting the email id or mobile number associated with an existing udyam registration. contact details provided during registration are used for communication verification and authentication when accessing the enterprise profile on the portal. if these contact details become outdated incorrect or inaccessible the enterprise owner may face difficulty receiving otps accessing the portal or managing the registration information. common grievances under this category include requests to change the registered mobile number or email address because the original number is no longer active the sim card has been lost the email account is no longer accessible or the contact details were entered incorrectly during registration. some complaints arise when the registered contact details belong to an employee or consultant who is n...</code> |
  | <code>we had applied for msme registration under the application number <udyam_no> m <NUM> on 22nd march <NUM> . after reviewing the status and considering our circumstances we kindly request that our case be transferred to the micro and small enterprises facilitation council msefc for further processing and resolution. we believe that the msefc councils intervention will help address any concerns or disputes that may have arisen regarding our application. we are hopeful that this request will be processed swiftly and in accordance with the necessary regulations. thank you for your attention to this matter. we look forward to your prompt assistance in facilitating this request. issue request for transfer of msme registration case to msefc context the user is requesting to transfer their msme registration case to the micro and small enterprises facilitation council msefc for further processing and resolution. details - application number udyam-dl- <NUM> - <NUM> m <NUM> application date 22nd m...</code> | <code>Technology, Quality and Institutions. Related to NI-MSME. this category encompasses grievances related to training capacity-building and certification programs administered by the national institute for micro small and medium enterprises ni-msme for micro small and medium enterprises msmes entrepreneurs and their employees. the scope of this category includes issues arising from the delivery of training programs such as repeatedly postponed schedules without prior notification inaccessible online training portals unclear eligibility criteria unavailable trainers insufficient mentoring outdated or non-practical course content additionally this category captures certification-related issues including delayed issuance of certificates certificates issued with incorrect details difficulty verifying certificates online failure to deliver certificates after course completion furthermore the category includes course enrollment and admission disputes such as unjustified rejection of enrollment ...</code> |
  | <code>insurancy company national insurance company limited branch name of insurance company branch if other khamgaon branch date of application <NUM> - <NUM> - <NUM> policy number <NUM> my claim is kept pending even after submitting all the documents after changing all the requirements as changed by various surveyors. issue delayed insurance claim under national insurance company limited context the user is reporting that the insurance claim submitted on <NUM> - <NUM> - <NUM> with policy number <NUM> is still pending despite submission of all required documents as per changes made by various surveyors. details - policy number <NUM> claim submission date <NUM> - <NUM> - <NUM> branch khamgaon</code>                                                                                                                                                                                                                                                                                                                       | <code>Starter, Credit and Finance. Insurance Claim related issues. this category encompasses grievances related to insurance claims associated with various government-backed and private insurance products. the scope includes <NUM> . esic employees state insurance corporation insurance benefits <NUM> . epfo employees provident fund organisation -linked insurance benefits including edli employees deposit linked insurance <NUM> . cgtmse credit guarantee fund trust for micro and small enterprises -linked insurance elements <NUM> . private or general business insurance products where a government department psu public sector undertaking or bank acts as an intermediary or implementing authority the category covers a range of issues including opaque rejection decisions undocumented policy exclusions administrative closure without explanation shifting of risk and liability onto msmes micro small and medium enterprises or employees document and data mismatches across multiple systems such as aadhaar ...</code> |
* Loss: [<code>CachedMultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters:
  ```json
  {
      "scale": 20.0,
      "similarity_fct": "cos_sim",
      "mini_batch_size": 32,
      "gather_across_devices": false
  }
  ```

### Training Hyperparameters
#### Non-Default Hyperparameters

- `per_device_train_batch_size`: 64
- `per_device_eval_batch_size`: 64
- `num_train_epochs`: 6
- `fp16`: True
- `multi_dataset_batch_sampler`: round_robin

#### All Hyperparameters
<details><summary>Click to expand</summary>

- `do_predict`: False
- `eval_strategy`: no
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 64
- `per_device_eval_batch_size`: 64
- `gradient_accumulation_steps`: 1
- `eval_accumulation_steps`: None
- `torch_empty_cache_steps`: None
- `learning_rate`: 5e-05
- `weight_decay`: 0.0
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1
- `num_train_epochs`: 6
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: None
- `warmup_ratio`: None
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `enable_jit_checkpoint`: False
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `use_cpu`: False
- `seed`: 42
- `data_seed`: None
- `bf16`: False
- `fp16`: True
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: -1
- `ddp_backend`: None
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: False
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `parallelism_config`: None
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch_fused
- `optim_args`: None
- `group_by_length`: False
- `length_column_name`: length
- `project`: huggingface
- `trackio_space_id`: trackio
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: None
- `hub_always_push`: False
- `hub_revision`: None
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_for_metrics`: []
- `eval_do_concat_batches`: True
- `auto_find_batch_size`: False
- `full_determinism`: False
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `include_num_input_tokens_seen`: no
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `use_liger_kernel`: False
- `liger_kernel_config`: None
- `eval_use_gather_object`: False
- `average_tokens_across_devices`: True
- `use_cache`: False
- `prompts`: None
- `batch_sampler`: batch_sampler
- `multi_dataset_batch_sampler`: round_robin
- `router_mapping`: {}
- `learning_rate_mapping`: {}

</details>

### Training Logs
| Epoch | Step | spearman_cosine |
|:-----:|:----:|:---------------:|
| 1.0   | 2    | nan             |
| 2.0   | 4    | nan             |
| 3.0   | 6    | nan             |
| 4.0   | 8    | nan             |
| 5.0   | 10   | nan             |
| 6.0   | 12   | nan             |


### Framework Versions
- Python: 3.12.12
- Sentence Transformers: 5.2.3
- Transformers: 5.0.0
- PyTorch: 2.10.0+cu128
- Accelerate: 1.12.0
- Datasets: 4.0.0
- Tokenizers: 0.22.2

## Citation

### BibTeX

#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}
```

#### CachedMultipleNegativesRankingLoss
```bibtex
@misc{gao2021scaling,
    title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup},
    author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan},
    year={2021},
    eprint={2101.06983},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}
```

<!--
## Glossary

*Clearly define terms in order to be accessible across audiences.*
-->

<!--
## Model Card Authors

*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->

<!--
## Model Card Contact

*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
-->