Ambika14's picture
Upload folder using huggingface_hub
1f25fdc verified
metadata
tags:
  - sentence-transformers
  - sentence-similarity
  - feature-extraction
  - dense
  - generated_from_trainer
  - dataset_size:88
  - loss:CachedMultipleNegativesRankingLoss
base_model: BAAI/bge-base-en-v1.5
widget:
  - source_sentence: >-
      the nearest technology center under tcsp is too far and fully booked so i
      can t access their advanced machinery skill training or tech advisory for
      upgrading my production line. local msmes like mine are losing
      competitiveness without nearby centers in delhi region. please set up or
      expand tcsp facilities closer and prioritize slots for small units to
      adopt modern tech fast. issue non-availability of nearby technology center
      under tcsp context the user is reporting that the nearest technology
      center under tcsp is too far and fully booked which is hindering access to
      advanced machinery skill training and tech advisory for upgrading the
      production line and is requesting the establishment or expansion of tcsp
      facilities closer to the delhi region to prioritize slots for small units.
      details - location delhi region requirement establishment or expansion of
      tcsp facilities priority slots for small units
    sentences:
      - >-
        Technology, Quality and Institutions. Testing, Quality, Testing Center.
        this category encompasses grievances related to msmes micro small and
        medium enterprises inability to access utilize or rely on
        government-recognized testing calibration inspection or certification
        services required for regulatory compliance tenders gem listing or
        exports. the category covers a range of issues including delays in the
        issuance of test reports despite samples being submitted and fees paid
        denial or non-issuance of quality or conformity certificates without
        clear reasons difficulties accessing testing or calibration facilities
        due to - capacity constraints - administrative refusal - non-functional
        equipment procedural and system-level barriers such as - unclear or
        changing documentation requirements - portal mismatches - fees paid but
        testing not scheduled situations where business losses occur due to
        market access being blocked due to pending testing or certification at
        authorized labs or msme testing centers. example issues include testing
        completed and fees paid but test report is not issued even after many
        weeks quality certification was rejected without written reasons despite
        compliance with guidelines testing center is refusing to accept samples
        citing workload while deadlines are approaching fees paid online but
        testing not scheduled due to portal or procedural issues tender or
        export shipment is blocked because the required test certificate is
        still pending at the testing lab. the purpose of this category is to
        capture grievances related to the operational procedural policy or
        institutional causes that hinder msmes access to government-recognized
        testing calibration inspection or certification services. the category
        aims to identify and address the root causes of these issues including
        capacity constraints at testing facilities inade
      - >-
        Policy and Schemes. Related to MSME Scheme. this category encompasses
        grievances related to central sector schemes directly administered by
        the ministry of micro small and medium enterprises momsme where the
        ministry itself serves as the implementing authority. the category
        includes schemes such as zero defect zero effect zed credit linked
        capital subsidy scheme clcss lean manufacturing and other centrally
        managed msme support programs. it covers cases where msmes have applied
        for scheme benefits or subsidies received approvals or completed
        required assessments or certifications but the approved financial
        assistance has not been released or credited. the category also captures
        grievances where claims submitted under ministry-run schemes for
        incentives reimbursements or financial support remain pending for
        extended periods or are rejected without clear or consistent
        justification. this includes cases of rejection due to alleged
        documentation gaps system-generated ineligibility flags disputes over
        eligible machinery or activities and delays caused by human or
        system-level verification failures. additionally the category includes
        grievances arising from ambiguity or confusion regarding scheme
        eligibility scope or applicability such as uncertainty over mandatory
        certifications eligibility of second-hand versus new machinery
        applicability to service enterprises or inconsistent interpretations of
        scheme rules by different central or state offices. the category further
        covers portal-related issues affecting scheme access and execution
        including technical errors during registration or document upload login
        or authentication failures contradictory status messages and
        non-updating dashboards for application claim or training progress.
        these issues typically arise due to system bugs integration gaps between
        multiple portals file format or size restrictions or delays in updating
        portal logic after scheme guideline revisions.
      - >-
        Technology, Quality and Institutions. Technology Center System Program
        (TCSP). the technology centre systems programme tcsp is a flagship
        initiative of the ministry of micro small and medium enterprises
        approved in <NUM> with world bank support to significantly strengthen
        india s msme manufacturing ecosystem through advanced technology access
        and skill development. with a total outlay of about <NUM> <NUM> crore
        initially and later expanded to around <NUM> <NUM> crore the programme
        modernizes <NUM> existing technology centres earlier tool rooms
        technology development centres and establishes <NUM> new technology
        centres along with <NUM> extension centres across the country. these
        centres are equipped with state-of-the-art infrastructure such as cnc
        machining additive manufacturing and 3d printing robotics and automation
        laser machining electronics system design and manufacturing facilities
        advanced metrology and testing equipment. tcsp aims to enhance msme
        competitiveness by providing localized access to modern manufacturing
        technologies short- and long-term industry-oriented training skilled
        manpower development prototyping and testing services incubation and
        entrepreneurship support and advisory services. the programme also
        promotes cluster-based development through manufacturing technology
        partners and technology cluster managers supported by a centralized erp
        system for service delivery thereby improving productivity innovation
        and integration of msmes into national and global value chains. examples
        of common grievances under the tcsp include operational delays a newly
        approved technology centre in an underserved state remains partially
        functional for years due to delayed procurement of machines preventing
        msmes from accessing prototyping and testing facilities. training
        capacity constraints popular courses such as robotics or cnc programming
        at extension centres become oversubscribed resulting in eligible msme
        workers being denied admission despite early registration. weak cluster
        linkages local technology cluster managers fail to connect msmes with
        oems academic institutions or research bodies limiting the intended
        collaboration and market access benefits. erp system issues msmes face
        repeated difficulties in booking training slots advisory services or
        accessing technical specifications due to downtime or glitches in the
        centralized erp portal. infrastructure shortfalls at extension centres
        an approved extension centre lacks promised metrology or testing
        equipment restricting its ability to deliver precision measurement and
        quality assurance services to nearby msmes.
  - source_sentence: >-
      my grievance are related to yes bank ifsc code- yesb0000565 as per msme
      act and rbi policy i have open new digital kyc verified open the current
      ac in yes bank yes bank provide me ckyc no and my ac no are - <NUM> on -
      <NUM> - <NUM> - <NUM> after ac opening without my permission after few
      days bank close my ac and reply after after few days your ac are activate
      after four month bank not activate my ac and return my ac fund available
      balance through post so i request you please order and take action against
      bank why close digital ckyc verified ac by bank without permission and i
      request you please open new current ac in your bank then i have
      transferred my old returned ac avalanche dd in your new current ac please
      take action against bank as soon as possible. thanks. issue closure of
      current account without permission and request for new account opening
      context the user is reporting that their current account was closed
      without their permission in yes bank and they are requesting action
      against the bank and opening a new current account in the complainant s
      bank to transfer the returned balance. details - account type current
      account ifsc code yesb0000565 account number <NUM> account opening date
      <NUM> - <NUM> - <NUM> bank yes bank
    sentences:
      - >-
        Technology, Quality and Institutions. Building Awareness on Intellectual
        Property Rights (IPR) for the Micro, Small & Medium Enterprises- an NMCP
        Scheme. the building awareness on intellectual property rights ipr for
        msmes scheme under the national manufacturing competitiveness programme
        nmcp is an initiative of the ministry of msme aimed at sensitizing micro
        small and medium enterprises to the importance of protecting and
        leveraging intellectual property such as patents trademarks copyrights
        industrial designs and geographical indications. the scheme supports the
        organization of awareness and sensitization programmes including
        seminars and workshops with financial assistance up to <NUM> . <NUM>
        lakh per event conducts pilot studies to assess intellectual property
        needs in specific industrial clusters with support up to <NUM> lakh and
        facilitates the establishment of intellectual property facilitation
        centres ipfcs in collaboration with technical institutions such as
        tifac. it also provides reimbursement support for ipr registration
        expenses including assistance up to <NUM> lakh for geographical
        indication registration and prescribed limits for other ip filings with
        proposals approved through a structured mechanism involving a project
        implementation committee and a steering committee. overall the scheme
        seeks to integrate ipr awareness into msme business strategy encourage
        innovation protection strengthen cluster-based ip management and enhance
        competitiveness in liberalized domestic and global markets. examples of
        common grievances under the ipr awareness scheme include reimbursement
        cap limitation an msme incurs <NUM> . <NUM> lakh in patent filing and
        prosecution costs but receives reimbursement only up to the maximum
        admissible limit leaving part of the eligible expense unsupported.
        workshop content inadequacy an ipr seminar organized for a manufacturing
        cluster lacks subject-matter experts and focuses only on basic concepts
        despite prior identification of advanced patenting and commercialization
        needs. pilot study rejection a cluster-level proposal for an ip needs
        assessment study is rejected due to documentation or data gaps even
        though it is backed by an industry association and local msmes.
        restricted ipfc access an intellectual property facilitation centre
        limits filing and advisory support only to notified clusters denying
        assistance to otherwise eligible msmes outside those clusters.
        processing delays approved reimbursement or support for geographical
        indication registration remains pending for several months after
        committee approval affecting timely protection and commercialization of
        the product.
      - >-
        UAM/Udyam Registration/Certificate related issues. Updation of Email
        ID/Mobile No. Linked to UDYAM Certificate. this category includes
        grievances related to updating or correcting the email id or mobile
        number associated with an existing udyam registration. contact details
        provided during registration are used for communication verification and
        authentication when accessing the enterprise profile on the portal. if
        these contact details become outdated incorrect or inaccessible the
        enterprise owner may face difficulty receiving otps accessing the portal
        or managing the registration information. common grievances under this
        category include requests to change the registered mobile number or
        email address because the original number is no longer active the sim
        card has been lost the email account is no longer accessible or the
        contact details were entered incorrectly during registration. some
        complaints arise when the registered contact details belong to an
        employee or consultant who is no longer associated with the enterprise
        preventing the current owner from receiving verification messages. in
        other cases entrepreneurs report that they cannot update contact details
        because the system requires authentication through the old mobile number
        or email which they no longer have access to. these grievances are
        typically raised by msme owners proprietors partners directors of
        companies or authorized representatives responsible for managing
        business registrations. small business owners who registered their
        enterprise personally may request updates when their phone number or
        email changes. in some cases accountants consultants or administrative
        staff handling compliance activities may also submit grievances when
        they cannot access the registration due to outdated contact details.
        this category therefore represents issues related specifically to
        correcting or updating communication details associated with an existing
        udyam certificate.
      - >-
        Starter, Credit and Finance. Any other banking issue. this category
        encompasses grievances related to banking operations conduct or services
        affecting micro small and medium enterprises msmes where the issue
        involves a bank or financial institution but does not fall under a
        specific loan product credit guarantee scheme or named government
        scheme. the category s purpose is to capture operational and
        service-level banking failures that directly disrupt business
        functioning working capital flow compliance obligations or financial
        credibility of msmes. scope of the category the category covers a wide
        range of issues including account restrictions and freezing msme bank
        accounts being frozen restricted or partially disabled due to kyc
        re-verification identity mismatches internal compliance flags alleged
        suspicious activity account restrictions despite compliance account
        restrictions continuing despite submission of required documents or
        compliance with bank instructions bank charges fees and penalties
        disputes related to bank charges fees and penalties applied without
        transparency or contrary to agreed account terms including unexpected
        minimum balance penalties annual maintenance charges on zero-balance
        accounts excessive transaction fees penalties levied without prior
        intimation and subsequently refused for reversal despite documented
        eligibility incorrect credit reporting and credit rating impacts
        grievances related to incorrect credit reporting and credit rating
        impacts including inaccurate reporting of defaults failure to update
        closed or repaid loans delay or refusal to correct cibil or credit
        bureau records wrongly classifying accounts as non-performing assets
        npas downgrading credit ratings due to internal errors bank
  - source_sentence: >-
      recently we registered ourselves for udyam registration but after scanning
      the qr code verification is failing. issue qr code verification failure
      for udyam registration context the user is reporting that qr code
      verification is failing after registering for udyam registration. details
      - registration type udyam registration verification issue qr code
      verification failure
    sentences:
      - >-
        UAM/Udyam Registration/Certificate related issues. Updation of Address
        Details. this category refers to grievances related to updating or
        correcting the registered address of an enterprise in the udyam
        registration records. the registered address represents the official
        location of the enterprise and accurate address information is important
        for maintaining correct business records and identifying the location of
        operations. grievances under this category commonly occur when the
        enterprise has changed its business location after registration or when
        the address recorded during registration was entered incorrectly.
        entrepreneurs may request an update if the business has shifted to a new
        office factory or operational location. some grievances arise when the
        address contains typographical errors incomplete information or
        mismatched details that do not correspond with supporting documents. in
        other cases users may report that they are unable to modify the address
        details through the portal or that the changes they submitted have not
        been updated in the registration record. these grievances are usually
        raised by msme owners proprietors partners of partnership firms
        directors of companies or authorized representatives responsible for
        maintaining enterprise records. small business owners who relocate their
        operations to a different city or premises may request updates to ensure
        the registration reflects the correct address. similarly compliance
        officers accountants or consultants managing regulatory documentation
        for the enterprise may submit grievances when they identify
        discrepancies between the registered address and the actual business
        location.
      - >-
        Policy and Schemes. DBT / IT desk including Annual Report. dbt it desk
        including the annual report in msme refers to the data dbt wing
        functioning under the office of the development commissioner msme which
        is responsible for administering direct benefit transfer dbt of
        subsidies under msme schemes managing it and digital infrastructure and
        compiling the ministry s annual report. the wing oversees end-to-end dbt
        processes for scheme reimbursements such as ict and cloud computing
        subsidies where msmes initially incur eligible expenses and subsequently
        receive reimbursements directly into aadhaar-linked bank accounts
        through the public financial management system often after technical
        verification by agencies like telecommunications consultants india
        limited. it ensures compliance with national dbt standards in
        coordination with the dbt mission and national informatics centre
        maintains and upgrades msme it systems including the udyam registration
        portal supports cloud-based it adoption for msmes undertakes data
        analytics and mis reporting and onboards schemes to the national dbt
        framework. the wing also prepares the annual report of the ministry of
        msme consolidating performance indicators financial outlays scheme
        outcomes udyam registration trends and macro-level contributions such as
        msme share in gdp and employment which are used for parliament cabinet
        briefings and policy evaluation. while this framework promotes
        transparency leak-proof subsidy delivery evidence-based policymaking and
        digital efficiency stakeholders frequently raise grievances related to
        dbt execution data accuracy it reliability and reporting quality.
        examples of grievances include msmes experiencing delays in receipt of
        approved ict or cloud service subsidies due to pfms transaction or
        verification glitches reimbursement failures arising from aadhaar bank
        account linkage mismatches despite valid udyam registration inaccuracies
        or under-reporting of scheme achievements udyam registrations or msme
        gdp contribution in the annual report affecting policy advocacy and
        planning temporary downtime or access issues on udyam or other msme it
        portals during registration or subsidy claim periods and gaps in mis
        capture where scheme data duplications or leakages are not properly
        reflected in dbt dashboards or the annual report prompting appeals for
        correction and system strengthening.
      - >-
        UAM/Udyam Registration/Certificate related issues. QR Code Printed on
        UDYAM Certificate Not Readable (Technical). this category includes
        grievances related to qr codes printed on the udyam certificate that
        cannot be scanned or read properly. the qr code is intended to allow
        quick verification of the certificate and its associated enterprise
        information. if the qr code cannot be scanned users may face difficulty
        verifying the certificate or sharing it for official purposes.
        grievances under this category typically involve situations where the qr
        code on the downloaded or printed certificate appears blurred distorted
        or unresponsive when scanned with a qr reader. some users report that
        the qr code does not open any verification page after scanning while
        others find that the scanning application fails to recognize the code at
        all. these issues may arise due to errors during certificate generation
        problems with the downloaded file or printing-related distortions that
        make the qr code unreadable. these grievances are generally raised by
        msme owners proprietors partners directors or authorized representatives
        who use the udyam certificate as official documentation for their
        enterprise. small business owners who attempt to share the certificate
        for verification purposes may discover that the qr code is not
        functioning correctly. consultants accountants or administrative staff
        responsible for maintaining business documentation may also submit
        grievances when they identify that the qr code on the certificate cannot
        be scanned or verified.
  - source_sentence: >-
      respectrd sir i have registered person of pm vishwakarma scheme as a broom
      maker i have also complited the trannung of broom maker at bhusawal pm
      vishwakarma center before three month ago sir but the tool kit of the pm
      vishwakarma scheme rs <NUM> - has not given or got to me so please your
      request to given of scheme benefit as a tool kit and loan ammount of <NUM>
      - rs issue non-receipt of tool kit and loan under pm vishwakarma scheme
      context the user is reporting non-receipt of the tool kit worth rs <NUM>
      <NUM> and loan amount of rs <NUM> <NUM> under the pm vishwakarma scheme
      despite completing the training three months ago. details - scheme pm
      vishwakarma scheme occupation broom maker training location bhusawal pm
      vishwakarma center amount requested rs <NUM> tool kit rs <NUM> loan
    sentences:
      - >-
        UAM/Udyam Registration/Certificate related issues. Migration from UAM to
        UDYAM. this category refers to grievances related to the migration of
        enterprises registered under the earlier udyog aadhaar memorandum uam
        system to the current udyam registration system. the uam registration
        system was used earlier for msme registration but enterprises registered
        under that system were required to migrate their registration details to
        the newer udyam portal to maintain updated records. during this
        migration process some enterprises encounter difficulties in
        transferring or verifying their existing registration details.
        grievances under this category typically include issues where business
        owners are unable to complete the migration process from uam to udyam
        due to errors or system restrictions. entrepreneurs may report that
        their uam number is not being recognized by the portal or that the
        migration process stops due to validation errors related to aadhaar pan
        or enterprise details. some users also experience problems when the
        migrated information does not match the original uam registration
        resulting in incomplete or incorrect enterprise records. other
        grievances may involve duplicate records being generated during
        migration or difficulty accessing the migrated certificate after the
        process is completed. these grievances are usually raised by existing
        msme owners who had previously registered their enterprises under the
        uam system and are attempting to migrate their information to the udyam
        portal. the complainants may include proprietors partners of partnership
        firms directors of companies or authorized representatives responsible
        for regulatory compliance of the enterprise. in many cases small
        business owners who registered their enterprises several years ago under
        uam may attempt migration when they need updated msme documentation and
        they raise grievances if the migration process cannot be completed
        successfully.
      - >-
        Policy and Schemes. PM Vishwakarma. the pm vishwakarma category
        encompasses the registration skill certification and benefit disbursal
        processes for artisans and craftspeople. the system aims to provide easy
        registration skill certification toolkit incentives credit support and
        strong market linkage. however operational issues eligibility
        interpretation challenges and bank coordination failures lead to
        breakdowns at the stages of registration certification benefit disbursal
        and bank linkage. common grievance scenarios registration stuck at
        pending verification applicants may experience delays in the
        registration process with applications remaining stuck at pending
        verification for <NUM> days without any response from the local officer.
        aadhaar-based registration failures aadhaar-based registration may fail
        due to occupation mismatch despite the individual being a traditional
        carpenter for <NUM> years. non-receipt of toolkit incentives artisans
        and craftspeople may not receive the toolkit incentive despite
        completing skill training and assessment. bank refusal of pm vishwakarma
        loans banks may refuse to provide pm vishwakarma loans due to unclear
        scheme guidelines. incorrect trade listing trades eligible under the
        scheme may not be listed correctly in the portal s dropdown options.
        operational procedural policy and institutional causes operational
      - >-
        Policy and Schemes. Related to GST. this category encompasses grievances
        related to operational and procedural frictions under the goods and
        services tax gst framework that directly affect micro small and medium
        enterprises msmes cash flow invoicing and day-to-day business
        continuity. the category includes the following subcategories <NUM> .
        gst registration issues applications remaining pending verification
        pan-gst name mismatches leading to rejection confusion arising during
        migration from uam udyam-linked records to gst rejection of registration
        due to pan and gst name mismatch non-response from portal support <NUM>
        . gst refund delays eligible refunds especially export-related input tax
        credit not disbursed within reasonable timelines despite correct filings
        refund status shows processed without actual credit due to backend
        mismatches delayed disbursement of input tax credit refunds for export
        sales refund status shows processed but no amount has been credited due
        to backend mismatch <NUM> . input tax credit itc blockages credits not
        reflecting because supplier invoices are missing on the portal invoices
        being wrongly flagged as ineligible itc reversals triggered by hsn
        mismatches or delayed supplier compliance supplier invoices not
        reflecting on the gst portal forcing msmes to pay tax from their own
        funds the category primarily captures operational rather than legal
        grievances. while champions does not adjudicate tax disputes it acts as
        an escalation and coordination channel with gstn or relevant tax
        authorities to resolve delays portal errors and process breakdowns
        impacting msmes. the purpose of this category is to address the
        following - resolve gst registration issues
  - source_sentence: >-
      the msme portal software keeps crashing during udyam registration renewal
      and scheme applications with error messages and failed uploads every time
      i try. support team gives no help and i can t access my digital
      certificates or track status. this software glitch blocks my business from
      government benefits and loans. please fix the bugs improve server speed
      and add better error guides right away. issue software glitch in msme
      portal during udyam registration renewal and scheme applications context
      the user is reporting frequent crashes of the msme portal software during
      udyam registration renewal and scheme applications resulting in failed
      uploads error messages and inability to access digital certificates or
      track status which is hindering business access to government benefits and
      loans. details - software msme portal software issue frequent crashes
      during udyam registration renewal and scheme applications error messages
      failed uploads and error messages impact inability to access digital
      certificates track status and access government benefits and loans
    sentences:
      - >-
        Starter, Credit and Finance. Loans from Banks. this category loans from
        banks encompasses grievances related to access to credit from banks
        where micro small and medium enterprises msmes have applied for loans
        and the bottleneck lies at the bank level. the scope of this category
        includes issues involving commercial banks regional rural banks rrbs and
        cooperative banks. it specifically addresses situations where the
        problem is neither related to rbi policy government scheme design nor
        buyer default but arises from bank-side processing handling or
        decision-making of loan applications. the category captures the
        following scenarios - msmes have submitted loan applications along with
        required documentation complied with bank procedures and followed up
        through branches or portals but the application remains pending without
        a formal decision. - banks keep applications under prolonged under
        process or pending for verification status without issuing deficiency
        letters timelines or written communication. - situations involving
        repeated or circular document demands that effectively stall credit
        access. - grievances where branch-level offices do not forward eligible
        loan applications to regional or head offices. - delays in internal
        approvals. - avoidance of issuing a clear sanction or rejection decision
        despite prolonged engagement. these cases reflect administrative
        stalling rather than informed credit rejection based on risk or
        eligibility. the category includes the following example issues - i
        applied for a term loan under the msme category and submitted all
        documents but the bank has kept the application under process for
        several months without any written update. - my loan application status
        has been showing pending for verification on the bank portal for over
        <NUM> days with no deficiency letter issued. - the bank is repeatedly
        asking for documents that were already submitted causing unnecessary
        delay in loan processing. - the branch is not forwarding
      - >-
        Technology, Quality and Institutions. Related to NSIC. this category
        encompasses grievances related to the support and facilitation services
        provided by the national small industries corporation nsic to micro
        small and medium enterprises msmes . the scope of this category includes
        issues arising from the areas of raw material assistance market access
        and risk mitigation through guarantees. specifically it covers
        situations where approved raw material assistance is not released on
        time supplier coordination fails after nsic approval material supplied
        through nsic is delayed or does not meet specifications or documentation
        and regional office processes stall procurement. the category also
        captures failures in marketing support including - delayed or missing
        inclusion in tenders gem or psu vendor listings - late communication of
        bid opportunities - problems in nsic-sponsored exhibitions or
        buyer-connect programs additionally it includes issues related to
        performance and emd guarantees such as - delayed issuance - incorrect
        formats - non-renewal despite payment - rejection by psus - lack of
        response when guarantees are invoked these grievances typically result
        in missed orders blocked working capital contract delays or loss of
        business credibility and arise from execution coordination or service
        delivery breakdowns rather than policy interpretation. the category is
        further divided into the following subcategories <NUM> . corporate
        communication single point registration scheme and exhibition consortia
        and tender marketing <NUM> . internal audit and law recovery <NUM> .
        human resource <NUM> . vigilance law recovery <NUM> . international
        cooperation <NUM> . bank guarantee monitoring <NUM> . finance accounts
        <NUM> . national sc st hub <NUM> . chief vigilance officer <NUM> .
        contract procurement grievance officer <NUM> . digital services
        facilitation and training <NUM> .space marketing cell event management
        cell <NUM> .raw material assistance bank guarantee bill discounting bank
        tieup csr administration <NUM> .technology liaison officer for sc st pwd
        cmr <NUM> .epf trust superannuation pension trust <NUM> .center public
        information officers cpio <NUM> .company secretary
      - >-
        Technology, Quality and Institutions. Software Related. software-related
        initiatives for msmes mainly center on the digital msme scheme under the
        national manufacturing competitiveness programme which promotes adoption
        of information and communication technologies through cloud-based erp
        crm and accounting software to digitalize day-to-day business
        operations. the scheme combines awareness workshops needs assessment and
        financial support in the form of subsidies covering about <NUM> <NUM> of
        eligible costs subject to a ceiling of <NUM> lakh over two years
        specifically targeting micro and small enterprises. these initiatives
        are reinforced by complementary efforts such as software-enabled
        facilities under technology centre programmes for electronics and esdm
        sectors digital quality and process parameters under zed certification
        and software-focused modules within entrepreneurship and skill
        development programmes. together these measures aim to standardize
        workflows automate inventory finance and customer management reduce
        operational inefficiencies and inventory holding support online sales
        and compliance and enhance overall competitiveness without requiring
        heavy upfront investment in hardware. examples of grievances include
        subsidy denial an msme implementing a cloud-based erp costing <NUM> .
        <NUM> lakh receives no reimbursement beyond the <NUM> lakh cap despite
        meeting all eligibility conditions. software ineligibility a cloud
        application selected after needs assessment is later rejected as
        non-standard or non-approved forcing the enterprise to abandon or
        restart implementation mid-way. inadequate training awareness workshops
        focus only on theoretical benefits of digitalization and fail to provide
        hands-on demonstrations or practical guidance on using erp or crm
        software. post-subsidy continuity issue after the two-year subsidized
        period ends steep renewal or subscription costs make the software
        unaffordable disrupting business operations. needs mismatch an msme
        assessed for crm requirements is instead provided accounting software
        limiting the usefulness of the digital intervention and affecting
        adoption outcomes.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
  - pearson_cosine
  - spearman_cosine
model-index:
  - name: SentenceTransformer based on BAAI/bge-base-en-v1.5
    results:
      - task:
          type: semantic-similarity
          name: Semantic Similarity
        dataset:
          name: Unknown
          type: unknown
        metrics:
          - type: pearson_cosine
            value: .nan
            name: Pearson Cosine
          - type: spearman_cosine
            value: .nan
            name: Spearman Cosine

SentenceTransformer based on BAAI/bge-base-en-v1.5

This is a sentence-transformers model finetuned from BAAI/bge-base-en-v1.5. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.

Model Details

Model Description

  • Model Type: Sentence Transformer
  • Base model: BAAI/bge-base-en-v1.5
  • Maximum Sequence Length: 256 tokens
  • Output Dimensionality: 768 dimensions
  • Similarity Function: Cosine Similarity

Model Sources

Full Model Architecture

SentenceTransformer(
  (0): Transformer({'max_seq_length': 256, 'do_lower_case': True, 'architecture': 'BertModel'})
  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
  (2): Normalize()
)

Usage

Direct Usage (Sentence Transformers)

First install the Sentence Transformers library:

pip install -U sentence-transformers

Then you can load this model and run inference.

from sentence_transformers import SentenceTransformer

# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
    'the msme portal software keeps crashing during udyam registration renewal and scheme applications with error messages and failed uploads every time i try. support team gives no help and i can t access my digital certificates or track status. this software glitch blocks my business from government benefits and loans. please fix the bugs improve server speed and add better error guides right away. issue software glitch in msme portal during udyam registration renewal and scheme applications context the user is reporting frequent crashes of the msme portal software during udyam registration renewal and scheme applications resulting in failed uploads error messages and inability to access digital certificates or track status which is hindering business access to government benefits and loans. details - software msme portal software issue frequent crashes during udyam registration renewal and scheme applications error messages failed uploads and error messages impact inability to access digital certificates track status and access government benefits and loans',
    'Technology, Quality and Institutions. Software Related. software-related initiatives for msmes mainly center on the digital msme scheme under the national manufacturing competitiveness programme which promotes adoption of information and communication technologies through cloud-based erp crm and accounting software to digitalize day-to-day business operations. the scheme combines awareness workshops needs assessment and financial support in the form of subsidies covering about <NUM> <NUM> of eligible costs subject to a ceiling of <NUM> lakh over two years specifically targeting micro and small enterprises. these initiatives are reinforced by complementary efforts such as software-enabled facilities under technology centre programmes for electronics and esdm sectors digital quality and process parameters under zed certification and software-focused modules within entrepreneurship and skill development programmes. together these measures aim to standardize workflows automate inventory finance and customer management reduce operational inefficiencies and inventory holding support online sales and compliance and enhance overall competitiveness without requiring heavy upfront investment in hardware. examples of grievances include subsidy denial an msme implementing a cloud-based erp costing <NUM> . <NUM> lakh receives no reimbursement beyond the <NUM> lakh cap despite meeting all eligibility conditions. software ineligibility a cloud application selected after needs assessment is later rejected as non-standard or non-approved forcing the enterprise to abandon or restart implementation mid-way. inadequate training awareness workshops focus only on theoretical benefits of digitalization and fail to provide hands-on demonstrations or practical guidance on using erp or crm software. post-subsidy continuity issue after the two-year subsidized period ends steep renewal or subscription costs make the software unaffordable disrupting business operations. needs mismatch an msme assessed for crm requirements is instead provided accounting software limiting the usefulness of the digital intervention and affecting adoption outcomes.',
    'Technology, Quality and Institutions. Related to NSIC. this category encompasses grievances related to the support and facilitation services provided by the national small industries corporation nsic to micro small and medium enterprises msmes . the scope of this category includes issues arising from the areas of raw material assistance market access and risk mitigation through guarantees. specifically it covers situations where approved raw material assistance is not released on time supplier coordination fails after nsic approval material supplied through nsic is delayed or does not meet specifications or documentation and regional office processes stall procurement. the category also captures failures in marketing support including - delayed or missing inclusion in tenders gem or psu vendor listings - late communication of bid opportunities - problems in nsic-sponsored exhibitions or buyer-connect programs additionally it includes issues related to performance and emd guarantees such as - delayed issuance - incorrect formats - non-renewal despite payment - rejection by psus - lack of response when guarantees are invoked these grievances typically result in missed orders blocked working capital contract delays or loss of business credibility and arise from execution coordination or service delivery breakdowns rather than policy interpretation. the category is further divided into the following subcategories <NUM> . corporate communication single point registration scheme and exhibition consortia and tender marketing <NUM> . internal audit and law recovery <NUM> . human resource <NUM> . vigilance law recovery <NUM> . international cooperation <NUM> . bank guarantee monitoring <NUM> . finance accounts <NUM> . national sc st hub <NUM> . chief vigilance officer <NUM> . contract procurement grievance officer <NUM> . digital services facilitation and training <NUM> .space marketing cell event management cell <NUM> .raw material assistance bank guarantee bill discounting bank tieup csr administration <NUM> .technology liaison officer for sc st pwd cmr <NUM> .epf trust superannuation pension trust <NUM> .center public information officers cpio <NUM> .company secretary',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]

# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities)
# tensor([[1.0000, 0.6265, 0.5981],
#         [0.6265, 1.0000, 0.7013],
#         [0.5981, 0.7013, 1.0000]])

Evaluation

Metrics

Semantic Similarity

Metric Value
pearson_cosine nan
spearman_cosine nan

Training Details

Training Dataset

Unnamed Dataset

  • Size: 88 training samples
  • Columns: sentence_0 and sentence_1
  • Approximate statistics based on the first 88 samples:
    sentence_0 sentence_1
    type string string
    details
    • min: 46 tokens
    • mean: 172.95 tokens
    • max: 256 tokens
    • min: 199 tokens
    • mean: 253.88 tokens
    • max: 256 tokens
  • Samples:
    sentence_0 sentence_1
    with due respect i md mafijul husen would like to intimate that when i trying to edit my existing udyam registration certificate certificate of my enterprise viz. md mafijul husen then i failed to enter otp as my earlier mobile number has been changed and the given gmail id is also inactive. hence it is my request to change my mobile number so that i can edit my existing udyam registration certificate. my pan no is and aadhaar number is . issue update of mobile number and gmail id for udyam registration certificate editing context the user is requesting an update of the mobile number and gmail id associated with the existing udyam registration certificate udyam-wb- - to facilitate editing of the certificate. details - udyam registration certificate no udyam-wb- - old mobile no old gmail id inactive pan no aetph0941n aadhar no UAM/Udyam Registration/Certificate related issues. Updation of Email ID/Mobile No. Linked to UDYAM Certificate. this category includes grievances related to updating or correcting the email id or mobile number associated with an existing udyam registration. contact details provided during registration are used for communication verification and authentication when accessing the enterprise profile on the portal. if these contact details become outdated incorrect or inaccessible the enterprise owner may face difficulty receiving otps accessing the portal or managing the registration information. common grievances under this category include requests to change the registered mobile number or email address because the original number is no longer active the sim card has been lost the email account is no longer accessible or the contact details were entered incorrectly during registration. some complaints arise when the registered contact details belong to an employee or consultant who is n...
    we had applied for msme registration under the application number m on 22nd march . after reviewing the status and considering our circumstances we kindly request that our case be transferred to the micro and small enterprises facilitation council msefc for further processing and resolution. we believe that the msefc councils intervention will help address any concerns or disputes that may have arisen regarding our application. we are hopeful that this request will be processed swiftly and in accordance with the necessary regulations. thank you for your attention to this matter. we look forward to your prompt assistance in facilitating this request. issue request for transfer of msme registration case to msefc context the user is requesting to transfer their msme registration case to the micro and small enterprises facilitation council msefc for further processing and resolution. details - application number udyam-dl- - m application date 22nd m... Technology, Quality and Institutions. Related to NI-MSME. this category encompasses grievances related to training capacity-building and certification programs administered by the national institute for micro small and medium enterprises ni-msme for micro small and medium enterprises msmes entrepreneurs and their employees. the scope of this category includes issues arising from the delivery of training programs such as repeatedly postponed schedules without prior notification inaccessible online training portals unclear eligibility criteria unavailable trainers insufficient mentoring outdated or non-practical course content additionally this category captures certification-related issues including delayed issuance of certificates certificates issued with incorrect details difficulty verifying certificates online failure to deliver certificates after course completion furthermore the category includes course enrollment and admission disputes such as unjustified rejection of enrollment ...
    insurancy company national insurance company limited branch name of insurance company branch if other khamgaon branch date of application - - policy number my claim is kept pending even after submitting all the documents after changing all the requirements as changed by various surveyors. issue delayed insurance claim under national insurance company limited context the user is reporting that the insurance claim submitted on - - with policy number is still pending despite submission of all required documents as per changes made by various surveyors. details - policy number claim submission date - - branch khamgaon Starter, Credit and Finance. Insurance Claim related issues. this category encompasses grievances related to insurance claims associated with various government-backed and private insurance products. the scope includes . esic employees state insurance corporation insurance benefits . epfo employees provident fund organisation -linked insurance benefits including edli employees deposit linked insurance . cgtmse credit guarantee fund trust for micro and small enterprises -linked insurance elements . private or general business insurance products where a government department psu public sector undertaking or bank acts as an intermediary or implementing authority the category covers a range of issues including opaque rejection decisions undocumented policy exclusions administrative closure without explanation shifting of risk and liability onto msmes micro small and medium enterprises or employees document and data mismatches across multiple systems such as aadhaar ...
  • Loss: CachedMultipleNegativesRankingLoss with these parameters:
    {
        "scale": 20.0,
        "similarity_fct": "cos_sim",
        "mini_batch_size": 32,
        "gather_across_devices": false
    }
    

Training Hyperparameters

Non-Default Hyperparameters

  • per_device_train_batch_size: 64
  • per_device_eval_batch_size: 64
  • num_train_epochs: 6
  • fp16: True
  • multi_dataset_batch_sampler: round_robin

All Hyperparameters

Click to expand
  • do_predict: False
  • eval_strategy: no
  • prediction_loss_only: True
  • per_device_train_batch_size: 64
  • per_device_eval_batch_size: 64
  • gradient_accumulation_steps: 1
  • eval_accumulation_steps: None
  • torch_empty_cache_steps: None
  • learning_rate: 5e-05
  • weight_decay: 0.0
  • adam_beta1: 0.9
  • adam_beta2: 0.999
  • adam_epsilon: 1e-08
  • max_grad_norm: 1
  • num_train_epochs: 6
  • max_steps: -1
  • lr_scheduler_type: linear
  • lr_scheduler_kwargs: None
  • warmup_ratio: None
  • warmup_steps: 0
  • log_level: passive
  • log_level_replica: warning
  • log_on_each_node: True
  • logging_nan_inf_filter: True
  • enable_jit_checkpoint: False
  • save_on_each_node: False
  • save_only_model: False
  • restore_callback_states_from_checkpoint: False
  • use_cpu: False
  • seed: 42
  • data_seed: None
  • bf16: False
  • fp16: True
  • bf16_full_eval: False
  • fp16_full_eval: False
  • tf32: None
  • local_rank: -1
  • ddp_backend: None
  • debug: []
  • dataloader_drop_last: False
  • dataloader_num_workers: 0
  • dataloader_prefetch_factor: None
  • disable_tqdm: False
  • remove_unused_columns: True
  • label_names: None
  • load_best_model_at_end: False
  • ignore_data_skip: False
  • fsdp: []
  • fsdp_config: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
  • accelerator_config: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
  • parallelism_config: None
  • deepspeed: None
  • label_smoothing_factor: 0.0
  • optim: adamw_torch_fused
  • optim_args: None
  • group_by_length: False
  • length_column_name: length
  • project: huggingface
  • trackio_space_id: trackio
  • ddp_find_unused_parameters: None
  • ddp_bucket_cap_mb: None
  • ddp_broadcast_buffers: False
  • dataloader_pin_memory: True
  • dataloader_persistent_workers: False
  • skip_memory_metrics: True
  • push_to_hub: False
  • resume_from_checkpoint: None
  • hub_model_id: None
  • hub_strategy: every_save
  • hub_private_repo: None
  • hub_always_push: False
  • hub_revision: None
  • gradient_checkpointing: False
  • gradient_checkpointing_kwargs: None
  • include_for_metrics: []
  • eval_do_concat_batches: True
  • auto_find_batch_size: False
  • full_determinism: False
  • ddp_timeout: 1800
  • torch_compile: False
  • torch_compile_backend: None
  • torch_compile_mode: None
  • include_num_input_tokens_seen: no
  • neftune_noise_alpha: None
  • optim_target_modules: None
  • batch_eval_metrics: False
  • eval_on_start: False
  • use_liger_kernel: False
  • liger_kernel_config: None
  • eval_use_gather_object: False
  • average_tokens_across_devices: True
  • use_cache: False
  • prompts: None
  • batch_sampler: batch_sampler
  • multi_dataset_batch_sampler: round_robin
  • router_mapping: {}
  • learning_rate_mapping: {}

Training Logs

Epoch Step spearman_cosine
1.0 2 nan
2.0 4 nan
3.0 6 nan
4.0 8 nan
5.0 10 nan
6.0 12 nan

Framework Versions

  • Python: 3.12.12
  • Sentence Transformers: 5.2.3
  • Transformers: 5.0.0
  • PyTorch: 2.10.0+cu128
  • Accelerate: 1.12.0
  • Datasets: 4.0.0
  • Tokenizers: 0.22.2

Citation

BibTeX

Sentence Transformers

@inproceedings{reimers-2019-sentence-bert,
    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
    author = "Reimers, Nils and Gurevych, Iryna",
    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
    month = "11",
    year = "2019",
    publisher = "Association for Computational Linguistics",
    url = "https://arxiv.org/abs/1908.10084",
}

CachedMultipleNegativesRankingLoss

@misc{gao2021scaling,
    title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup},
    author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan},
    year={2021},
    eprint={2101.06983},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}