Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

aisquared
/
bolt-embedding-small

Sentence Similarity
sentence-transformers
Safetensors
English
modernbert
feature-extraction
dense
Generated from Trainer
dataset_size:500000
loss:CachedMultipleNegativesRankingLoss
text-embeddings-inference
Model card Files Files and versions
xet
Community

Instructions to use aisquared/bolt-embedding-small with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use aisquared/bolt-embedding-small with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("aisquared/bolt-embedding-small")
    
    sentences = [
        "I'm trying to write a PHP script which reads SIP (session initiation protocol) signals from a hardware switch to gets specific details and then return some data back to the switch.\nBeing a complete newbie to this SIP thing I don't know how to interact with the switch sending SIP signal. Do we need to send some message to the switch to get response?\nI googled SIP but got only general info regarding what SIP is all about but nothing programmatic.\nCan any one provide any pointers to any tutorials which show how interact with a SIP signal programmatically?\nAre there any free online services that simulate SIP signals for testing purpose?\n",
        "Lake Okahumpka is a freshwater lake in Wildwood, Florida, United States. Lake Okahumpka Park is along part of its shoreline. In 1980, the United States Geological Survey reported on the hydrology of Lake Okahumpka and Lake Deaton area.\n\nThe lake is east of Wildwood on the south side of State Road 44. The lake has been treated for hydrilla. Ring neck ducks have been hunted from its shores.\n\nSee also\nOkahumpka, Florida\n\nReferences\n\nBodies of water of Sumter County, Florida\nOkahumpka",
        "Because of different regional setting on different machines. To have date time output in the same format you ahve to specify format string explciitly:\ndate.ToString(\"yyyy-MM-dd HH:mm:ss\");\n\nAlso as John recommeded in comments below if you want having date time output in the same format on different machines despite local regional settings you can use InvariantCulture format provider:\ndate.ToString(CultureInfo.InvariantCulture);\n\nMSDN:\n\nThe invariant culture is culture-insensitive; it is associated with\n  the English language but not with any country/region\n\nMSDN:\n\nStandard Date and Time Format Strings\nCustom Date and Time Format Strings\n\n",
        "The President of India plays a ceremonial role in foreign affairs, appointing ambassadors and ratifying treaties, but the day‑to‑day conduct of diplomacy is handled by the Ministry of External Affairs and the Prime Minister's Office."
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [4, 4]
  • Notebooks
  • Google Colab
  • Kaggle

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • 1_Pooling
    Add new SentenceTransformer model 3 months ago
  • .gitattributes
    1.52 kB
    initial commit 3 months ago
  • README.md
    17.2 kB
    Update README.md 13 days ago
  • config.json
    1.31 kB
    Add new SentenceTransformer model 3 months ago
  • config_sentence_transformers.json
    283 Bytes
    Add new SentenceTransformer model 3 months ago
  • model.safetensors
    191 MB
    xet
    Add new SentenceTransformer model 3 months ago
  • modules.json
    229 Bytes
    Add new SentenceTransformer model 3 months ago
  • sentence_bert_config.json
    58 Bytes
    Add new SentenceTransformer model 3 months ago
  • special_tokens_map.json
    694 Bytes
    Add new SentenceTransformer model 3 months ago
  • tokenizer.json
    3.58 MB
    Add new SentenceTransformer model 3 months ago
  • tokenizer_config.json
    21 kB
    Add new SentenceTransformer model 3 months ago