Ambika14 commited on
Commit
15ba9f7
·
verified ·
1 Parent(s): aa92717

Upload folder using huggingface_hub

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,782 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:90
9
+ - loss:CachedMultipleNegativesRankingLoss
10
+ base_model: sentence-transformers/all-mpnet-base-v2
11
+ widget:
12
+ - source_sentence: when ever i try to get register for udyam registration the registration
13
+ is not getting completed because their is issue with the selection of longitude
14
+ and latitude on the website. it says to login on some this website https webgis3.nic.in
15
+ bharatmaps rest services and they only can able to select the longitude and latitude
16
+ location please resolve the glitch. issue issue with longitude and latitude selection
17
+ for udyam registration context the user is experiencing an issue with completing
18
+ the udyam registration process due to a problem with selecting longitude and latitude
19
+ on the website which requires login to a separate website https webgis3.nic.in
20
+ bharatmaps rest services. details - website for longitude and latitude selection
21
+ https webgis3.nic.in bharatmaps rest services
22
+ sentences:
23
+ - UAM/Udyam Registration/Certificate related issues. Update Company/Owner Name Details.
24
+ this category includes grievances related to corrections or updates to the name
25
+ of the enterprise or the name of the owner associated with a udyam registration.
26
+ accurate naming details are important for maintaining correct enterprise records
27
+ and ensuring that the information recorded in the registration reflects the official
28
+ business identity. grievances under this category typically arise when the name
29
+ of the enterprise or the owner s name recorded during registration contains an
30
+ error or needs to be updated due to changes in the business structure. for example
31
+ the enterprise name may have been entered incorrectly during registration or the
32
+ owner s name may not match official identification documents. in some cases the
33
+ enterprise name may change due to business rebranding conversion of the business
34
+ structure or correction of typographical errors made during the registration process.
35
+ users may also report that the system does not allow modification of the name
36
+ field or that their request to update the name has not been processed. these grievances
37
+ are generally raised by msme proprietors partners company directors or authorized
38
+ representatives responsible for managing the enterprise registration. business
39
+ owners who notice inconsistencies between their official business documents and
40
+ the name recorded in the udyam registration may request updates to correct the
41
+ information. in some situations accountants consultants or administrative staff
42
+ who manage regulatory documentation for the enterprise may also submit grievances
43
+ when they identify that the enterprise or owner name recorded in the registration
44
+ requires correction or modification.
45
+ - Technology, Quality and Institutions. Official Language Related Issues. official
46
+ language related issues in msme administration concern the implementation of hindi
47
+ rajbhasha in accordance with the official languages act <NUM> as amended across
48
+ the ministry of msme its development institutes field offices and attached organizations.
49
+ this framework mandates progressive use of hindi in official work bilingual hindi
50
+ english documentation replies in hindi to communications received in hindi availability
51
+ of hindi-enabled software on computers and regular training in hindi typing and
52
+ computing for officials. the ministry monitors compliance through official language
53
+ implementation committees quarterly progress reviews rajbhasha inspections and
54
+ conferences while ensuring that citizens charters schemes portals and public-facing
55
+ information are available bilingually. these measures aim to improve accessibility
56
+ for hindi-speaking msmes enhance transparency and inclusiveness strengthen regional
57
+ outreach especially in hindi-belt states and fulfill constitutional and administrative
58
+ obligations without restricting the use of english where required. examples of
59
+ grievances include non-hindi reply an msme submits an application or grievance
60
+ in hindi to a development institute but receives a response only in english contrary
61
+ to official language correspondence rules. bilingual documentation gap key documents
62
+ such as annual reports scheme guidelines or notices are issued only in english
63
+ or with incomplete hindi translations limiting accessibility for hindi-speaking
64
+ stakeholders. training shortfall field office staff are unable to type or process
65
+ files in hindi despite mandated hindi software and training provisions causing
66
+ delays in rajbhasha compliance. portal language issue hindi versions of portals
67
+ like udyam or champions contain missing pages partial translations or technical
68
+ glitches preventing rural or hindi-only users from completing registrations or
69
+ filing grievances. awareness and communication lapse regional msmes are not informed
70
+ in hindi about official language conferences workshops or policy updates leading
71
+ to missed participation and reduced stakeholder engagement.
72
+ - UAM/Udyam Registration/Certificate related issues. Issues in Updating Latitude
73
+ and Longitude Details (Technical). this category covers grievances related to
74
+ technical issues encountered while entering or updating the latitude and longitude
75
+ coordinates of the enterprise location in the udyam registration system. these
76
+ coordinates are used to identify the geographic location of the enterprise and
77
+ are sometimes required when updating address information or completing registration
78
+ details. grievances under this category usually arise when the portal does not
79
+ accept the latitude and longitude values entered by the user or when technical
80
+ errors prevent the coordinates from being saved. users may report that the location
81
+ detection feature does not function properly that the system repeatedly shows
82
+ errors while entering coordinates or that the map interface does not load correctly.
83
+ in some cases entrepreneurs may also face issues when the location selected on
84
+ the map does not match their actual address or when the coordinates fail to update
85
+ despite repeated attempts. these grievances are typically raised by msme owners
86
+ proprietors partners directors or authorized representatives who are attempting
87
+ to update enterprise location details in the registration system. small business
88
+ owners completing registration updates themselves may encounter technical difficulties
89
+ while entering location coordinates. similarly consultants accountants or administrative
90
+ staff who assist enterprises with registration or profile updates may submit grievances
91
+ if the portal prevents them from completing the required location information
92
+ due to technical errors.
93
+ - source_sentence: sub fake udyam assist certificate udyam-i-ts-i2- <NUM> generated-
94
+ reg. dear sir i am sage arun kumar would like to inform you that someone has generated
95
+ udyam assist certificate vide no. udyam-i-ts-i2- <NUM> registration date <NUM>
96
+ - <NUM> - <NUM> on my name without my knowledge intimation. i received a text
97
+ message on <NUM> - <NUM> - <NUM> to my mobile no. <phone_no> stating that udyam
98
+ certificate is generated. i was shocked and when i checked in udyam portal the
99
+ certificate was generated on my name sage arun kumar. as per the certificate the
100
+ details are as mentioned below name sage arun kumar enterprise type - micro major
101
+ activity-services. dob <NUM> - <NUM> - <NUM> social category general issue retrieval
102
+ of fake udyam assist certificate context the user is reporting that a fake udyam
103
+ assist certificate udyam-i-ts-i2- <NUM> was generated on their name without their
104
+ knowledge or intimation. details - certificate no udyam-i-ts-i2- <NUM> name sage
105
+ arun kumar enterprise type micro major activity services date of birth <NUM> -
106
+ <NUM> - <NUM> social category general certificate generation date <NUM> - <NUM>
107
+ - <NUM> notification date <NUM> - <NUM> - <NUM>
108
+ sentences:
109
+ - Policy and Schemes. PM Vishwakarma. the pm vishwakarma category encompasses the
110
+ registration skill certification and benefit disbursal processes for artisans
111
+ and craftspeople. the system aims to provide easy registration skill certification
112
+ toolkit incentives credit support and strong market linkage. however operational
113
+ issues eligibility interpretation challenges and bank coordination failures lead
114
+ to breakdowns at the stages of registration certification benefit disbursal and
115
+ bank linkage. common grievance scenarios registration stuck at pending verification
116
+ applicants may experience delays in the registration process with applications
117
+ remaining stuck at pending verification for <NUM> days without any response from
118
+ the local officer. aadhaar-based registration failures aadhaar-based registration
119
+ may fail due to occupation mismatch despite the individual being a traditional
120
+ carpenter for <NUM> years. non-receipt of toolkit incentives artisans and craftspeople
121
+ may not receive the toolkit incentive despite completing skill training and assessment.
122
+ bank refusal of pm vishwakarma loans banks may refuse to provide pm vishwakarma
123
+ loans due to unclear scheme guidelines. incorrect trade listing trades eligible
124
+ under the scheme may not be listed correctly in the portal s dropdown options.
125
+ operational procedural policy and institutional causes operational
126
+ - Technology, Quality and Institutions. Related to NSIC. this category encompasses
127
+ grievances related to the support and facilitation services provided by the national
128
+ small industries corporation nsic to micro small and medium enterprises msmes
129
+ . the scope of this category includes issues arising from the areas of raw material
130
+ assistance market access and risk mitigation through guarantees. specifically
131
+ it covers situations where approved raw material assistance is not released on
132
+ time supplier coordination fails after nsic approval material supplied through
133
+ nsic is delayed or does not meet specifications or documentation and regional
134
+ office processes stall procurement. the category also captures failures in marketing
135
+ support including - delayed or missing inclusion in tenders gem or psu vendor
136
+ listings - late communication of bid opportunities - problems in nsic-sponsored
137
+ exhibitions or buyer-connect programs additionally it includes issues related
138
+ to performance and emd guarantees such as - delayed issuance - incorrect formats
139
+ - non-renewal despite payment - rejection by psus - lack of response when guarantees
140
+ are invoked these grievances typically result in missed orders blocked working
141
+ capital contract delays or loss of business credibility and arise from execution
142
+ coordination or service delivery breakdowns rather than policy interpretation.
143
+ the category is further divided into the following subcategories <NUM> . corporate
144
+ communication single point registration scheme and exhibition consortia and tender
145
+ marketing <NUM> . internal audit and law recovery <NUM> . human resource <NUM>
146
+ . vigilance law recovery <NUM> . international cooperation <NUM> . bank guarantee
147
+ monitoring <NUM> . finance accounts <NUM> . national sc st hub <NUM> . chief vigilance
148
+ officer <NUM> . contract procurement grievance officer <NUM> . digital services
149
+ facilitation and training <NUM> .space marketing cell event management cell <NUM>
150
+ .raw material assistance bank guarantee bill discounting bank tieup csr administration
151
+ <NUM> .technology liaison officer for sc st pwd cmr <NUM> .epf trust superannuation
152
+ pension trust <NUM> .center public information officers cpio <NUM> .company secretary
153
+ - UAM/Udyam Registration/Certificate related issues. Existing / Unauthorized UDYAM
154
+ Registration Against PAN. this category includes grievances related to updating
155
+ or correcting the email id or mobile number associated with an existing udyam
156
+ registration. contact details provided during registration are used for communication
157
+ verification and authentication when accessing the enterprise profile on the portal.
158
+ if these contact details become outdated incorrect or inaccessible the enterprise
159
+ owner may face difficulty receiving otps accessing the portal or managing the
160
+ registration information. common grievances under this category include requests
161
+ to change the registered mobile number or email address because the original number
162
+ is no longer active the sim card has been lost the email account is no longer
163
+ accessible or the contact details were entered incorrectly during registration.
164
+ some complaints arise when the registered contact details belong to an employee
165
+ or consultant who is no longer associated with the enterprise preventing the current
166
+ owner from receiving verification messages. in other cases entrepreneurs report
167
+ that they cannot update contact details because the system requires authentication
168
+ through the old mobile number or email which they no longer have access to. these
169
+ grievances are typically raised by msme owners proprietors partners directors
170
+ of companies or authorized representatives responsible for managing business registrations.
171
+ small business owners who registered their enterprise personally may request updates
172
+ when their phone number or email changes. in some cases accountants consultants
173
+ or administrative staff handling compliance activities may also submit grievances
174
+ when they cannot access the registration due to outdated contact details. this
175
+ category therefore represents issues related specifically to correcting or updating
176
+ communication details associated with an existing udyam certificate.
177
+ - source_sentence: as per the attached letter issue request for reference to attached
178
+ letter context the user is requesting reference to the attached letter for further
179
+ details. details - attached with application
180
+ sentences:
181
+ - Technology, Quality and Institutions. Testing, Quality, Testing Center. this category
182
+ encompasses grievances related to msmes micro small and medium enterprises inability
183
+ to access utilize or rely on government-recognized testing calibration inspection
184
+ or certification services required for regulatory compliance tenders gem listing
185
+ or exports. the category covers a range of issues including delays in the issuance
186
+ of test reports despite samples being submitted and fees paid denial or non-issuance
187
+ of quality or conformity certificates without clear reasons difficulties accessing
188
+ testing or calibration facilities due to - capacity constraints - administrative
189
+ refusal - non-functional equipment procedural and system-level barriers such as
190
+ - unclear or changing documentation requirements - portal mismatches - fees paid
191
+ but testing not scheduled situations where business losses occur due to market
192
+ access being blocked due to pending testing or certification at authorized labs
193
+ or msme testing centers. example issues include testing completed and fees paid
194
+ but test report is not issued even after many weeks quality certification was
195
+ rejected without written reasons despite compliance with guidelines testing center
196
+ is refusing to accept samples citing workload while deadlines are approaching
197
+ fees paid online but testing not scheduled due to portal or procedural issues
198
+ tender or export shipment is blocked because the required test certificate is
199
+ still pending at the testing lab. the purpose of this category is to capture grievances
200
+ related to the operational procedural policy or institutional causes that hinder
201
+ msmes access to government-recognized testing calibration inspection or certification
202
+ services. the category aims to identify and address the root causes of these issues
203
+ including capacity constraints at testing facilities inade
204
+ - Technology, Quality and Institutions. Related to MSME-DFO. this category encompasses
205
+ grievances related to field-level execution failures at msme development facilitation
206
+ offices dfos which are responsible for facilitating msme schemes loans subsidies
207
+ and services. the scope of this category includes field-level execution failures
208
+ non-responsive dfo officers failure to provide guidance on documentation or procedures
209
+ inaction on queries submitted through champions or physical visits inspection
210
+ delays or inconsistencies postponed or repeatedly rescheduled site visits delayed
211
+ inspection reports unnecessary multiple inspections that stall loan disbursement
212
+ or subsidy release local facilitation and coordination failures misrouting of
213
+ applications between offices lack of facilitation for land or utilities approvals
214
+ unavailability of promised local support services poor coordination between dfos
215
+ banks psus and state nodal officers resulting in projects remaining stuck despite
216
+ eligibility or prior approvals example issues dfo officials not responding to
217
+ phone calls or emails regarding subsidy applications with no guidance provided
218
+ on required documents on-site inspection for msme projects pending for several
219
+ months blocking bank loan disbursement inspection scheduled multiple times but
220
+ cancelled without notice with the inspection report still not issued applications
221
+ being sent from one local office to another by the dfo without clear instructions
222
+ or responsibility lack of coordination between dfo and bank delaying loan sanction
223
+ even after project verification operational procedural policy or institutional
224
+ causes inadequate communication and coordination between dfos banks psus and state
225
+ nodal officers inefficient documentation and procedure guidance inaction
226
+ - Others. Others. this category includes udyam uam registration grievances that
227
+ cannot be clearly classified under the defined technical categories. it covers
228
+ complaints where the grievance description or technical summary is invalid incomplete
229
+ irrelevant vague or lacks sufficient details to identify the specific issue. examples
230
+ include unclear statements such as udyam not working submissions without key identifiers
231
+ like urn or pan queries unrelated to registration processes such as scheme eligibility
232
+ or bank loan inquiries foreign language submissions without translation or attachments
233
+ shared without proper explanation. the others category ensures that such unclassifiable
234
+ grievances are not ignored or abandoned. instead they are flagged for manual review
235
+ and preliminary assessment. during this process reviewers attempt to understand
236
+ the issue request additional information if necessary and determine whether the
237
+ grievance can be redirected to a relevant category or requires further technical
238
+ attention. this approach helps maintain continuity in grievance handling by allowing
239
+ submissions that do not initially meet classification standards to still enter
240
+ the review system. it also supports data quality by encouraging clarification
241
+ and correction of incomplete inputs. by enabling manual triage and follow-up the
242
+ others category helps ensure that stakeholders receive appropriate guidance and
243
+ that legitimate concerns are eventually directed to the correct resolution pathway
244
+ reducing repeated or misclassified submissions.
245
+ - source_sentence: banks deny <NUM> interest subvention under the scheme for my incremental
246
+ loan saying my turnover growth doesn t qualify despite meeting criteria. without
247
+ this finance relief high rates kill my starter profits. enforce scheme benefits
248
+ for new borrowers. issue non-disbursement of <NUM> interest subvention under interest
249
+ subvention scheme for incremental credit to msmes <NUM> context the user is reporting
250
+ that banks are denying the <NUM> interest subvention under the scheme for their
251
+ incremental loan citing that the turnover growth does not qualify despite meeting
252
+ the criteria and is requesting enforcement of scheme benefits for new borrowers.
253
+ details - scheme interest subvention scheme for incremental credit to msmes <NUM>
254
+ claim denied <NUM> interest subvention reason turnover growth not qualifying despite
255
+ meeting criteria
256
+ sentences:
257
+ - Technology, Quality and Institutions. Related to Tool Rooms. this category encompasses
258
+ grievances related to the operational and technical services provided by government-supported
259
+ msme tool rooms. the scope includes issues with access to machinery prototyping
260
+ facilities manufacturing support and skill-development or training programs. key
261
+ areas of concern include unavailability of machine time despite confirmed bookings
262
+ equipment under maintenance or frequent breakdowns high-demand machines consistently
263
+ overbooked infrastructure promised for msme production support not accessible
264
+ when required delays cancellations or poor execution of technical training programs
265
+ non-availability of trainers or technical experts mismatch between published and
266
+ actual service fees lack of transparency during machine usage or training delivery
267
+ these grievances directly impact production timelines project execution and workforce
268
+ upskilling. they arise from service delivery and operational failures rather than
269
+ administrative management or policy interpretation. example issues include cnc
270
+ machine booked and confirmed but under maintenance upon arrival resulting in a
271
+ two-week production delay 3d printing and prototyping facilities shown as available
272
+ online but fully occupied upon arrival training program for advanced machining
273
+ postponed multiple times due to trainer unavailability affecting project schedules
274
+ higher machine usage fees charged than those mentioned in the official tool room
275
+ rate card additional material and service charges demanded during machine access
276
+ despite full online pre-payment the operational procedural and institutional causes
277
+ of these grievances include inadequate maintenance and equipment management insufficient
278
+ capacity or resource allocation poor communication and transparency inefficient
279
+ service delivery processes lack of clear policies and procedures inadequate training
280
+ and capacity building for trainers and technical experts the impact of these grievances
281
+ on users systems eligibility or implementation includes del
282
+ - Starter, Credit and Finance. Interest Subvention Scheme for Incremental Credit
283
+ to MSMEs 2018. the interest subvention scheme for incremental credit to msmes
284
+ <NUM> is a financial relief initiative of the ministry of micro small and medium
285
+ enterprises introduced to reduce the cost of formal credit for micro and small
286
+ enterprises by providing an interest relief of <NUM> per annum on fresh or incremental
287
+ term loans and working capital facilities subject to an aggregate cap of <NUM>
288
+ crore per enterprise. the scheme applies to gst-registered and udyam-registered
289
+ msmes that availed eligible credit from scheduled commercial banks later extended
290
+ to co-operative banks and select nbfcs with operational support and reimbursement
291
+ routed through the small industries development bank of india. implemented initially
292
+ during fy <NUM> <NUM> with a dedicated budgetary allocation the scheme aimed to
293
+ incentivize formalization through gst and udyam registration lower effective borrowing
294
+ costs support working capital needs and capacity expansion in manufacturing and
295
+ service msmes and aid recovery during periods of economic stress without imposing
296
+ additional collateral requirements. although beneficial in intent several operational
297
+ issues were reported by stakeholders. examples of grievances include cases where
298
+ eligible msmes were denied the <NUM> interest relief because banks classified
299
+ incremental credit as non-fresh despite being sanctioned in the eligible financial
300
+ year prolonged delays of several months in processing and reimbursement of quarterly
301
+ subvention claims by lending institutions or the nodal agency adversely impacting
302
+ enterprise cash flows partial grant of subvention where aggregate borrowings from
303
+ multiple banks exceeded the <NUM> crore ceiling even though individual loans were
304
+ otherwise eligible exclusion of loans disbursed by certain lenders during initial
305
+ phases of the scheme leading to disputes over retrospective applicability and
306
+ post-disbursal verification issues such as gst data mismatches resulting in recovery
307
+ or clawback of already credited subvention amounts compelling msmes to approach
308
+ bank grievance cells or ministry-level redressal mechanisms for resolution.
309
+ - Technology, Quality and Institutions. Design clinic Scheme - an NMCP Scheme. the
310
+ design clinic scheme under the national manufacturing competitiveness programme
311
+ nmcp is an initiative of the ministry of micro small and medium enterprises implemented
312
+ through the national institute of design ahmedabad to integrate professional design
313
+ expertise into msme operations and promote innovation-driven manufacturing. the
314
+ scheme supports msmes that are new to structured design interventions by funding
315
+ projects related to product design process improvement packaging ergonomics branding
316
+ and prototype development. financial assistance is provided in the form of grants
317
+ covering up to <NUM> of the approved project cost subject to a maximum of <NUM>
318
+ lakh for individual enterprises or small groups and up to <NUM> lakh for larger
319
+ group projects while student-led design projects supervised by recognized design
320
+ institutions are supported up to <NUM> of the cost with a ceiling of <NUM> . <NUM>
321
+ lakh. through design clinics workshops and expert mentoring delivered via regional
322
+ centers the scheme helps msmes enhance product quality improve market appeal support
323
+ new product launches strengthen export competitiveness and transition from contract
324
+ manufacturing to original design manufacturing with higher value addition. examples
325
+ of common grievances under the design clinic scheme include subsidy cap limitation
326
+ an msme undertakes a comprehensive packaging and branding redesign costing <NUM>
327
+ lakh but receives reimbursement only up to the maximum admissible limit leaving
328
+ a significant portion self-funded. designer eligibility rejection a mutually agreed
329
+ design consultant is later declared ineligible because the firm is not empanelled
330
+ forcing the msme to restart the project selection process. reimbursement delays
331
+ despite timely submission of completion reports and utilization certificates the
332
+ approved grant is delayed for several months affecting the msme s cash flow. student
333
+ project funding gap a student-led prototype development project incurs higher
334
+ costs but reimbursement is restricted to the capped percentage creating a shortfall
335
+ for the participating msme or institution. regional support gaps msmes in remote
336
+ or northeastern regions report lack of promised workshops clinics or facilitation
337
+ support from nearby implementing or outreach centers limiting access to scheme
338
+ benefits.
339
+ - source_sentence: loanagreement no isbl00910729978dated26- <NUM> - <NUM> loan payment
340
+ is pending since <NUM> sep <NUM> . hdfc bank has returned the cheque stating it
341
+ as alteration under rbi guidelines. pli and nodal agencies contact numbers are
342
+ found out of service. i am unable to connect with them. i have attached the loan
343
+ agreement pdf for reference. please support to get the resolution as it is pending
344
+ since <NUM> sep <NUM> . issue non-receipt of loan payment under dcmsme scheme
345
+ context the user is reporting non-receipt of loan payment since <NUM> <NUM> <NUM>
346
+ citing hdfc bank s return of cheque as alteration under rbi guidelines and requesting
347
+ assistance in resolving the issue. details - loan agreement no isbl00910729978
348
+ loan agreement date <NUM> <NUM> <NUM> cheque return reason alteration under rbi
349
+ guidelines attached with application
350
+ sentences:
351
+ - Policy and Schemes. Related to GST. this category encompasses grievances related
352
+ to operational and procedural frictions under the goods and services tax gst framework
353
+ that directly affect micro small and medium enterprises msmes cash flow invoicing
354
+ and day-to-day business continuity. the category includes the following subcategories
355
+ <NUM> . gst registration issues applications remaining pending verification pan-gst
356
+ name mismatches leading to rejection confusion arising during migration from uam
357
+ udyam-linked records to gst rejection of registration due to pan and gst name
358
+ mismatch non-response from portal support <NUM> . gst refund delays eligible refunds
359
+ especially export-related input tax credit not disbursed within reasonable timelines
360
+ despite correct filings refund status shows processed without actual credit due
361
+ to backend mismatches delayed disbursement of input tax credit refunds for export
362
+ sales refund status shows processed but no amount has been credited due to backend
363
+ mismatch <NUM> . input tax credit itc blockages credits not reflecting because
364
+ supplier invoices are missing on the portal invoices being wrongly flagged as
365
+ ineligible itc reversals triggered by hsn mismatches or delayed supplier compliance
366
+ supplier invoices not reflecting on the gst portal forcing msmes to pay tax from
367
+ their own funds the category primarily captures operational rather than legal
368
+ grievances. while champions does not adjudicate tax disputes it acts as an escalation
369
+ and coordination channel with gstn or relevant tax authorities to resolve delays
370
+ portal errors and process breakdowns impacting msmes. the purpose of this category
371
+ is to address the following - resolve gst registration issues
372
+ - Technology, Quality and Institutions. Related to Scheme of KVIC. this category
373
+ encompasses grievances related to schemes subsidies certifications and implementation
374
+ processes administered by the khadi village industries commission kvic and its
375
+ implementing authorities including state kvic and district industries centre dic
376
+ offices. it specifically addresses issues that originate from kvic or its field-level
377
+ offices excluding problems solely with banks generic msme schemes or non-kvic
378
+ authorities. the category covers a range of issues including <NUM> . delays or
379
+ failures in the release of pmegp margin money subsidies where loans have already
380
+ been sanctioned and units have been set up but kvic has not credited the subsidy
381
+ to the bank due to pending portal actions physical verification delays repeated
382
+ document objections or prolonged under process status without timelines. <NUM>
383
+ . grievances related to khadi subsidies including non-release partial release
384
+ or unexplained reduction of admissible subsidy amounts stoppage of subsidy citing
385
+ non-compliance without sharing inspection reports deviations from prescribed scheme
386
+ norms in determining subsidy eligibility or quantum <NUM> . issues related to
387
+ kvic certification and registration including pending or delayed issuance of khadi
388
+ certificates cancellation of certification without prior notice or stated reasons
389
+ inspection-related delays without clarification delayed renewal of certificates
390
+ that directly affect eligibility for subsidies tenders and market access subcategories
391
+ <NUM> . providing financial assistance to set up new enterprises under pmegp <NUM>
392
+ . providing insurance cover to khadi artisans under aam admi bima yojana <NUM>
393
+ . providing financial assistance to khadi institutions under mda <NUM> . workshed
394
+ scheme for khadi artisans <NUM> . loans under interest subsidy eligibility certificate
395
+ scheme isec <NUM> . mission solar charkha
396
+ - Policy and Schemes. Related to DCMSME Scheme. this category is related to grievances
397
+ under the dcmsme scheme specifically focusing on issues related to access to credit
398
+ from banks for micro small and medium enterprises msmes . the category applies
399
+ to commercial banks regional rural banks rrbs and cooperative banks and covers
400
+ cases where the bottleneck lies entirely at the bank level. it excludes issues
401
+ related to rbi policy government scheme design credit guarantee mechanisms or
402
+ buyer default but rather addresses bank-side processing conditions or conduct
403
+ in extending credit to msmes. the category includes cases where msmes have applied
404
+ for loans submitted required documents and followed up through branches or digital
405
+ portals but the loan application remains pending without a formal sanction or
406
+ rejection decision. it captures administrative stalling such as prolonged under
407
+ process or pending for verification status absence of deficiency letters or timelines
408
+ repeated demands for already-submitted documents and failure of branch offices
409
+ to forward eligible applications to regional or head offices for approval. additionally
410
+ the category covers situations where loans have been formally sanctioned but disbursement
411
+ is delayed or withheld by the bank without valid or documented reasons. it includes
412
+ cases of prolonged non-disbursement despite fulfilment of sanction conditions
413
+ partial disbursement with unexplained withholding of the balance amount delays
414
+ citing internal audits or reviews and imposition of additional post-sanction conditions
415
+ that were not mentioned in the original sanction letter. the category also includes
416
+ grievances related to excessive or unreasonable collateral demands by banks where
417
+ security requirements exceed applicable msme rbi or cgtmse guidelines. this includes
418
+ insistence on collateral despite eligibility for credit guarantee coverage demands
419
+ for disproportionate collateral value rejection of loan applications solely due
420
+ to refusal to provide personal or residential property as security and requirements
421
+ for subcategories <NUM> . tcec division for implementation of the scheme establishement
422
+ of new technology centres extension centres <NUM> . economic analysis <NUM> .
423
+ statistics data division <NUM> . national awards <NUM> . entrepreneurship skill
424
+ development programmes esdp <NUM> . vendor development programme for ancillarisation
425
+ <NUM> . export promotion wto <NUM> . msme policy industry associations related
426
+ issues <NUM> .software related <NUM> . zero defect zero effect zed <NUM> .technology
427
+ center system program tcsp <NUM> . north east region cell ner promotion of msmes
428
+ in ner and sikkim <NUM> .international trade fair itf and international cooperation
429
+ ic <NUM> .support for entrepreneurial and managerial development of smes through
430
+ incubators- an nmcp scheme <NUM> .building awareness on intellectual property
431
+ rights ipr for the micro small medium enterprises- an nmcp scheme <NUM> .lean
432
+ manufacturing competitiveness scheme lmcs <NUM> . design clinic scheme - an nmcp
433
+ scheme <NUM> . pms scheme <NUM> . technology and quality upgradation tequp support
434
+ to msmes- an nmcp scheme <NUM> . digital msme - an nmcp scheme <NUM> .micro small
435
+ enterprises cluster development programme mse-cdp <NUM> .credit linked capital
436
+ subsidy for technology upgradation clcs- tu special clcs for sc st <NUM> .credit
437
+ guarantee fund for micro and smali enterprises cgtmse <NUM> . market development
438
+ assistance mda to msmes
439
+ pipeline_tag: sentence-similarity
440
+ library_name: sentence-transformers
441
+ metrics:
442
+ - pearson_cosine
443
+ - spearman_cosine
444
+ model-index:
445
+ - name: SentenceTransformer based on sentence-transformers/all-mpnet-base-v2
446
+ results:
447
+ - task:
448
+ type: semantic-similarity
449
+ name: Semantic Similarity
450
+ dataset:
451
+ name: Unknown
452
+ type: unknown
453
+ metrics:
454
+ - type: pearson_cosine
455
+ value: .nan
456
+ name: Pearson Cosine
457
+ - type: spearman_cosine
458
+ value: .nan
459
+ name: Spearman Cosine
460
+ ---
461
+
462
+ # SentenceTransformer based on sentence-transformers/all-mpnet-base-v2
463
+
464
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
465
+
466
+ ## Model Details
467
+
468
+ ### Model Description
469
+ - **Model Type:** Sentence Transformer
470
+ - **Base model:** [sentence-transformers/all-mpnet-base-v2](https://huggingface.co/sentence-transformers/all-mpnet-base-v2) <!-- at revision e8c3b32edf5434bc2275fc9bab85f82640a19130 -->
471
+ - **Maximum Sequence Length:** 128 tokens
472
+ - **Output Dimensionality:** 768 dimensions
473
+ - **Similarity Function:** Cosine Similarity
474
+ <!-- - **Training Dataset:** Unknown -->
475
+ <!-- - **Language:** Unknown -->
476
+ <!-- - **License:** Unknown -->
477
+
478
+ ### Model Sources
479
+
480
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
481
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/huggingface/sentence-transformers)
482
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
483
+
484
+ ### Full Model Architecture
485
+
486
+ ```
487
+ SentenceTransformer(
488
+ (0): Transformer({'max_seq_length': 128, 'do_lower_case': False, 'architecture': 'MPNetModel'})
489
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
490
+ (2): Normalize()
491
+ )
492
+ ```
493
+
494
+ ## Usage
495
+
496
+ ### Direct Usage (Sentence Transformers)
497
+
498
+ First install the Sentence Transformers library:
499
+
500
+ ```bash
501
+ pip install -U sentence-transformers
502
+ ```
503
+
504
+ Then you can load this model and run inference.
505
+ ```python
506
+ from sentence_transformers import SentenceTransformer
507
+
508
+ # Download from the 🤗 Hub
509
+ model = SentenceTransformer("sentence_transformers_model_id")
510
+ # Run inference
511
+ sentences = [
512
+ 'loanagreement no isbl00910729978dated26- <NUM> - <NUM> loan payment is pending since <NUM> sep <NUM> . hdfc bank has returned the cheque stating it as alteration under rbi guidelines. pli and nodal agencies contact numbers are found out of service. i am unable to connect with them. i have attached the loan agreement pdf for reference. please support to get the resolution as it is pending since <NUM> sep <NUM> . issue non-receipt of loan payment under dcmsme scheme context the user is reporting non-receipt of loan payment since <NUM> <NUM> <NUM> citing hdfc bank s return of cheque as alteration under rbi guidelines and requesting assistance in resolving the issue. details - loan agreement no isbl00910729978 loan agreement date <NUM> <NUM> <NUM> cheque return reason alteration under rbi guidelines attached with application',
513
+ 'Policy and Schemes. Related to DCMSME Scheme. this category is related to grievances under the dcmsme scheme specifically focusing on issues related to access to credit from banks for micro small and medium enterprises msmes . the category applies to commercial banks regional rural banks rrbs and cooperative banks and covers cases where the bottleneck lies entirely at the bank level. it excludes issues related to rbi policy government scheme design credit guarantee mechanisms or buyer default but rather addresses bank-side processing conditions or conduct in extending credit to msmes. the category includes cases where msmes have applied for loans submitted required documents and followed up through branches or digital portals but the loan application remains pending without a formal sanction or rejection decision. it captures administrative stalling such as prolonged under process or pending for verification status absence of deficiency letters or timelines repeated demands for already-submitted documents and failure of branch offices to forward eligible applications to regional or head offices for approval. additionally the category covers situations where loans have been formally sanctioned but disbursement is delayed or withheld by the bank without valid or documented reasons. it includes cases of prolonged non-disbursement despite fulfilment of sanction conditions partial disbursement with unexplained withholding of the balance amount delays citing internal audits or reviews and imposition of additional post-sanction conditions that were not mentioned in the original sanction letter. the category also includes grievances related to excessive or unreasonable collateral demands by banks where security requirements exceed applicable msme rbi or cgtmse guidelines. this includes insistence on collateral despite eligibility for credit guarantee coverage demands for disproportionate collateral value rejection of loan applications solely due to refusal to provide personal or residential property as security and requirements for subcategories <NUM> . tcec division for implementation of the scheme establishement of new technology centres extension centres <NUM> . economic analysis <NUM> . statistics data division <NUM> . national awards <NUM> . entrepreneurship skill development programmes esdp <NUM> . vendor development programme for ancillarisation <NUM> . export promotion wto <NUM> . msme policy industry associations related issues <NUM> .software related <NUM> . zero defect zero effect zed <NUM> .technology center system program tcsp <NUM> . north east region cell ner promotion of msmes in ner and sikkim <NUM> .international trade fair itf and international cooperation ic <NUM> .support for entrepreneurial and managerial development of smes through incubators- an nmcp scheme <NUM> .building awareness on intellectual property rights ipr for the micro small medium enterprises- an nmcp scheme <NUM> .lean manufacturing competitiveness scheme lmcs <NUM> . design clinic scheme - an nmcp scheme <NUM> . pms scheme <NUM> . technology and quality upgradation tequp support to msmes- an nmcp scheme <NUM> . digital msme - an nmcp scheme <NUM> .micro small enterprises cluster development programme mse-cdp <NUM> .credit linked capital subsidy for technology upgradation clcs- tu special clcs for sc st <NUM> .credit guarantee fund for micro and smali enterprises cgtmse <NUM> . market development assistance mda to msmes',
514
+ 'Technology, Quality and Institutions. Related to Scheme of KVIC. this category encompasses grievances related to schemes subsidies certifications and implementation processes administered by the khadi village industries commission kvic and its implementing authorities including state kvic and district industries centre dic offices. it specifically addresses issues that originate from kvic or its field-level offices excluding problems solely with banks generic msme schemes or non-kvic authorities. the category covers a range of issues including <NUM> . delays or failures in the release of pmegp margin money subsidies where loans have already been sanctioned and units have been set up but kvic has not credited the subsidy to the bank due to pending portal actions physical verification delays repeated document objections or prolonged under process status without timelines. <NUM> . grievances related to khadi subsidies including non-release partial release or unexplained reduction of admissible subsidy amounts stoppage of subsidy citing non-compliance without sharing inspection reports deviations from prescribed scheme norms in determining subsidy eligibility or quantum <NUM> . issues related to kvic certification and registration including pending or delayed issuance of khadi certificates cancellation of certification without prior notice or stated reasons inspection-related delays without clarification delayed renewal of certificates that directly affect eligibility for subsidies tenders and market access subcategories <NUM> . providing financial assistance to set up new enterprises under pmegp <NUM> . providing insurance cover to khadi artisans under aam admi bima yojana <NUM> . providing financial assistance to khadi institutions under mda <NUM> . workshed scheme for khadi artisans <NUM> . loans under interest subsidy eligibility certificate scheme isec <NUM> . mission solar charkha',
515
+ ]
516
+ embeddings = model.encode(sentences)
517
+ print(embeddings.shape)
518
+ # [3, 768]
519
+
520
+ # Get the similarity scores for the embeddings
521
+ similarities = model.similarity(embeddings, embeddings)
522
+ print(similarities)
523
+ # tensor([[1.0000, 0.5738, 0.4289],
524
+ # [0.5738, 1.0000, 0.5811],
525
+ # [0.4289, 0.5811, 1.0000]])
526
+ ```
527
+
528
+ <!--
529
+ ### Direct Usage (Transformers)
530
+
531
+ <details><summary>Click to see the direct usage in Transformers</summary>
532
+
533
+ </details>
534
+ -->
535
+
536
+ <!--
537
+ ### Downstream Usage (Sentence Transformers)
538
+
539
+ You can finetune this model on your own dataset.
540
+
541
+ <details><summary>Click to expand</summary>
542
+
543
+ </details>
544
+ -->
545
+
546
+ <!--
547
+ ### Out-of-Scope Use
548
+
549
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
550
+ -->
551
+
552
+ ## Evaluation
553
+
554
+ ### Metrics
555
+
556
+ #### Semantic Similarity
557
+
558
+ * Evaluated with [<code>EmbeddingSimilarityEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.EmbeddingSimilarityEvaluator)
559
+
560
+ | Metric | Value |
561
+ |:--------------------|:--------|
562
+ | pearson_cosine | nan |
563
+ | **spearman_cosine** | **nan** |
564
+
565
+ <!--
566
+ ## Bias, Risks and Limitations
567
+
568
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
569
+ -->
570
+
571
+ <!--
572
+ ### Recommendations
573
+
574
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
575
+ -->
576
+
577
+ ## Training Details
578
+
579
+ ### Training Dataset
580
+
581
+ #### Unnamed Dataset
582
+
583
+ * Size: 90 training samples
584
+ * Columns: <code>sentence_0</code> and <code>sentence_1</code>
585
+ * Approximate statistics based on the first 90 samples:
586
+ | | sentence_0 | sentence_1 |
587
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
588
+ | type | string | string |
589
+ | details | <ul><li>min: 33 tokens</li><li>mean: 116.86 tokens</li><li>max: 128 tokens</li></ul> | <ul><li>min: 128 tokens</li><li>mean: 128.0 tokens</li><li>max: 128 tokens</li></ul> |
590
+ * Samples:
591
+ | sentence_0 | sentence_1 |
592
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
593
+ | <code>the msme portal software keeps crashing during udyam registration renewal and scheme applications with error messages and failed uploads every time i try. support team gives no help and i can t access my digital certificates or track status. this software glitch blocks my business from government benefits and loans. please fix the bugs improve server speed and add better error guides right away. issue software glitch in msme portal during udyam registration renewal and scheme applications context the user is reporting frequent crashes of the msme portal software during udyam registration renewal and scheme applications resulting in failed uploads error messages and inability to access digital certificates or track status which is hindering business access to government benefits and loans. details - software msme portal software issue frequent crashes during udyam registration renewal and scheme applications error messages failed uploads and error messages impact inability to access dig...</code> | <code>Technology, Quality and Institutions. Software Related. software-related initiatives for msmes mainly center on the digital msme scheme under the national manufacturing competitiveness programme which promotes adoption of information and communication technologies through cloud-based erp crm and accounting software to digitalize day-to-day business operations. the scheme combines awareness workshops needs assessment and financial support in the form of subsidies covering about <NUM> <NUM> of eligible costs subject to a ceiling of <NUM> lakh over two years specifically targeting micro and small enterprises. these initiatives are reinforced by complementary efforts such as software-enabled facilities under technology centre programmes for electronics and esdm sectors digital quality and process parameters under zed certification and software-focused modules within entrepreneurship and skill development programmes. together these measures aim to standardize workflows automate inventory fi...</code> |
594
+ | <code>msme scheme guidelines and forms under official language policy are only in hindi or poorly translated english making it hard for me to understand eligibility and apply correctly. i keep making errors in submissions because of confusing language and staff reject them without clear explanations. please provide all msme documents in simple english or bilingual format to help non-hindi speakers like me access schemes easily. issue non-availability of msme scheme guidelines and forms in simple english context the user is reporting difficulty in understanding the eligibility and applying for msme schemes due to the availability of guidelines and forms only in hindi or poorly translated english and is requesting provision of these documents in simple english or bilingual format to facilitate access for non-hindi speakers. details - language issue msme scheme guidelines and forms available only in hindi or poorly translated english request provision of documents in simple english or bilingual...</code> | <code>Technology, Quality and Institutions. Official Language Related Issues. official language related issues in msme administration concern the implementation of hindi rajbhasha in accordance with the official languages act <NUM> as amended across the ministry of msme its development institutes field offices and attached organizations. this framework mandates progressive use of hindi in official work bilingual hindi english documentation replies in hindi to communications received in hindi availability of hindi-enabled software on computers and regular training in hindi typing and computing for officials. the ministry monitors compliance through official language implementation committees quarterly progress reviews rajbhasha inspections and conferences while ensuring that citizens charters schemes portals and public-facing information are available bilingually. these measures aim to improve accessibility for hindi-speaking msmes enhance transparency and inclusiveness strengthen regional ou...</code> |
595
+ | <code>dear sir my uam <udyam_no> has already cancelled but unable to register new firm through my aadhar number <NUM> - <NUM> - <NUM> kindly delete my aadhar number or suggest to how register new firm with same aadhar number issue deletion of aadhar number from udyam registration system context the user is requesting deletion of the aadhar number from the udyam registration system as it is associated with a cancelled udyam registration number and is unable to register a new firm using the same aadhar number. details - udyam registration number udyam-ap- <NUM> - <NUM> aadhar number <NUM> - <NUM> - <NUM></code> | <code>UAM/Udyam Registration/Certificate related issues. After Cancellation, Unable to Register with PAN Details (Technical). this category refers to grievances where an entrepreneur is unable to create a new udyam registration using their pan after an earlier registration has already been cancelled. in such situations the system may continue to recognize the pan as already associated with an existing registration preventing the user from completing a new registration. grievances under this category generally occur when an enterprise previously cancelled its registration due to closure incorrect details or duplication and later attempts to register again using the same pan. users may report that the system still displays a message indicating that a registration already exists for that pan even though the earlier registration was cancelled. some entrepreneurs also encounter errors where the portal does not allow them to proceed with registration because the pan remains linked to the previous ...</code> |
596
+ * Loss: [<code>CachedMultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cachedmultiplenegativesrankingloss) with these parameters:
597
+ ```json
598
+ {
599
+ "scale": 20.0,
600
+ "similarity_fct": "cos_sim",
601
+ "mini_batch_size": 32,
602
+ "gather_across_devices": false
603
+ }
604
+ ```
605
+
606
+ ### Training Hyperparameters
607
+ #### Non-Default Hyperparameters
608
+
609
+ - `per_device_train_batch_size`: 32
610
+ - `per_device_eval_batch_size`: 32
611
+ - `num_train_epochs`: 5
612
+ - `fp16`: True
613
+ - `multi_dataset_batch_sampler`: round_robin
614
+
615
+ #### All Hyperparameters
616
+ <details><summary>Click to expand</summary>
617
+
618
+ - `do_predict`: False
619
+ - `eval_strategy`: no
620
+ - `prediction_loss_only`: True
621
+ - `per_device_train_batch_size`: 32
622
+ - `per_device_eval_batch_size`: 32
623
+ - `gradient_accumulation_steps`: 1
624
+ - `eval_accumulation_steps`: None
625
+ - `torch_empty_cache_steps`: None
626
+ - `learning_rate`: 5e-05
627
+ - `weight_decay`: 0.0
628
+ - `adam_beta1`: 0.9
629
+ - `adam_beta2`: 0.999
630
+ - `adam_epsilon`: 1e-08
631
+ - `max_grad_norm`: 1
632
+ - `num_train_epochs`: 5
633
+ - `max_steps`: -1
634
+ - `lr_scheduler_type`: linear
635
+ - `lr_scheduler_kwargs`: None
636
+ - `warmup_ratio`: None
637
+ - `warmup_steps`: 0
638
+ - `log_level`: passive
639
+ - `log_level_replica`: warning
640
+ - `log_on_each_node`: True
641
+ - `logging_nan_inf_filter`: True
642
+ - `enable_jit_checkpoint`: False
643
+ - `save_on_each_node`: False
644
+ - `save_only_model`: False
645
+ - `restore_callback_states_from_checkpoint`: False
646
+ - `use_cpu`: False
647
+ - `seed`: 42
648
+ - `data_seed`: None
649
+ - `bf16`: False
650
+ - `fp16`: True
651
+ - `bf16_full_eval`: False
652
+ - `fp16_full_eval`: False
653
+ - `tf32`: None
654
+ - `local_rank`: -1
655
+ - `ddp_backend`: None
656
+ - `debug`: []
657
+ - `dataloader_drop_last`: False
658
+ - `dataloader_num_workers`: 0
659
+ - `dataloader_prefetch_factor`: None
660
+ - `disable_tqdm`: False
661
+ - `remove_unused_columns`: True
662
+ - `label_names`: None
663
+ - `load_best_model_at_end`: False
664
+ - `ignore_data_skip`: False
665
+ - `fsdp`: []
666
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
667
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
668
+ - `parallelism_config`: None
669
+ - `deepspeed`: None
670
+ - `label_smoothing_factor`: 0.0
671
+ - `optim`: adamw_torch_fused
672
+ - `optim_args`: None
673
+ - `group_by_length`: False
674
+ - `length_column_name`: length
675
+ - `project`: huggingface
676
+ - `trackio_space_id`: trackio
677
+ - `ddp_find_unused_parameters`: None
678
+ - `ddp_bucket_cap_mb`: None
679
+ - `ddp_broadcast_buffers`: False
680
+ - `dataloader_pin_memory`: True
681
+ - `dataloader_persistent_workers`: False
682
+ - `skip_memory_metrics`: True
683
+ - `push_to_hub`: False
684
+ - `resume_from_checkpoint`: None
685
+ - `hub_model_id`: None
686
+ - `hub_strategy`: every_save
687
+ - `hub_private_repo`: None
688
+ - `hub_always_push`: False
689
+ - `hub_revision`: None
690
+ - `gradient_checkpointing`: False
691
+ - `gradient_checkpointing_kwargs`: None
692
+ - `include_for_metrics`: []
693
+ - `eval_do_concat_batches`: True
694
+ - `auto_find_batch_size`: False
695
+ - `full_determinism`: False
696
+ - `ddp_timeout`: 1800
697
+ - `torch_compile`: False
698
+ - `torch_compile_backend`: None
699
+ - `torch_compile_mode`: None
700
+ - `include_num_input_tokens_seen`: no
701
+ - `neftune_noise_alpha`: None
702
+ - `optim_target_modules`: None
703
+ - `batch_eval_metrics`: False
704
+ - `eval_on_start`: False
705
+ - `use_liger_kernel`: False
706
+ - `liger_kernel_config`: None
707
+ - `eval_use_gather_object`: False
708
+ - `average_tokens_across_devices`: True
709
+ - `use_cache`: False
710
+ - `prompts`: None
711
+ - `batch_sampler`: batch_sampler
712
+ - `multi_dataset_batch_sampler`: round_robin
713
+ - `router_mapping`: {}
714
+ - `learning_rate_mapping`: {}
715
+
716
+ </details>
717
+
718
+ ### Training Logs
719
+ | Epoch | Step | spearman_cosine |
720
+ |:-----:|:----:|:---------------:|
721
+ | 1.0 | 3 | nan |
722
+ | 2.0 | 6 | nan |
723
+ | 3.0 | 9 | nan |
724
+ | 4.0 | 12 | nan |
725
+ | 5.0 | 15 | nan |
726
+
727
+
728
+ ### Framework Versions
729
+ - Python: 3.12.12
730
+ - Sentence Transformers: 5.2.3
731
+ - Transformers: 5.0.0
732
+ - PyTorch: 2.10.0+cu128
733
+ - Accelerate: 1.13.0
734
+ - Datasets: 4.0.0
735
+ - Tokenizers: 0.22.2
736
+
737
+ ## Citation
738
+
739
+ ### BibTeX
740
+
741
+ #### Sentence Transformers
742
+ ```bibtex
743
+ @inproceedings{reimers-2019-sentence-bert,
744
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
745
+ author = "Reimers, Nils and Gurevych, Iryna",
746
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
747
+ month = "11",
748
+ year = "2019",
749
+ publisher = "Association for Computational Linguistics",
750
+ url = "https://arxiv.org/abs/1908.10084",
751
+ }
752
+ ```
753
+
754
+ #### CachedMultipleNegativesRankingLoss
755
+ ```bibtex
756
+ @misc{gao2021scaling,
757
+ title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup},
758
+ author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan},
759
+ year={2021},
760
+ eprint={2101.06983},
761
+ archivePrefix={arXiv},
762
+ primaryClass={cs.LG}
763
+ }
764
+ ```
765
+
766
+ <!--
767
+ ## Glossary
768
+
769
+ *Clearly define terms in order to be accessible across audiences.*
770
+ -->
771
+
772
+ <!--
773
+ ## Model Card Authors
774
+
775
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
776
+ -->
777
+
778
+ <!--
779
+ ## Model Card Contact
780
+
781
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
782
+ -->
config.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "MPNetModel"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.1,
6
+ "bos_token_id": 0,
7
+ "dtype": "float32",
8
+ "eos_token_id": 2,
9
+ "hidden_act": "gelu",
10
+ "hidden_dropout_prob": 0.1,
11
+ "hidden_size": 768,
12
+ "initializer_range": 0.02,
13
+ "intermediate_size": 3072,
14
+ "layer_norm_eps": 1e-05,
15
+ "max_position_embeddings": 514,
16
+ "model_type": "mpnet",
17
+ "num_attention_heads": 12,
18
+ "num_hidden_layers": 12,
19
+ "pad_token_id": 1,
20
+ "relative_attention_num_buckets": 32,
21
+ "tie_word_embeddings": true,
22
+ "transformers_version": "5.0.0",
23
+ "vocab_size": 30527
24
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "5.2.3",
4
+ "transformers": "5.0.0",
5
+ "pytorch": "2.10.0+cu128"
6
+ },
7
+ "model_type": "SentenceTransformer",
8
+ "prompts": {
9
+ "query": "",
10
+ "document": ""
11
+ },
12
+ "default_prompt_name": null,
13
+ "similarity_fn_name": "cosine"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b5f9f0a4d19ff98d3c9cf028128a99c8eae575bf1cf22b68fb23ca620fab331
3
+ size 437967648
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 128,
3
+ "do_lower_case": false
4
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "backend": "tokenizers",
3
+ "bos_token": "<s>",
4
+ "cls_token": "<s>",
5
+ "do_lower_case": true,
6
+ "eos_token": "</s>",
7
+ "is_local": false,
8
+ "mask_token": "<mask>",
9
+ "model_max_length": 384,
10
+ "pad_token": "<pad>",
11
+ "sep_token": "</s>",
12
+ "strip_accents": null,
13
+ "tokenize_chinese_chars": true,
14
+ "tokenizer_class": "MPNetTokenizer",
15
+ "unk_token": "[UNK]"
16
+ }