Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

deman539
/
nomic-embed-text-v1

Sentence Similarity
sentence-transformers
Safetensors
nomic_bert
feature-extraction
Generated from Trainer
dataset_size:2459
loss:MatryoshkaLoss
loss:MultipleNegativesRankingLoss
custom_code
Eval Results (legacy)
text-embeddings-inference
Model card Files Files and versions
xet
Community

Instructions to use deman539/nomic-embed-text-v1 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use deman539/nomic-embed-text-v1 with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("deman539/nomic-embed-text-v1", trust_remote_code=True)
    
    sentences = [
        "What types of applications may require confidentiality during their launch?",
        "Taken together, the technical protections and practices laid out in the Blueprint for an AI Bill of Rights can help \nguard the American public against many of the potential and actual harms identified by researchers, technolo­\ngists, advocates, journalists, policymakers, and communities in the United States and around the world. This \ntechnical companion is intended to be used as a reference by people across many circumstances – anyone",
        "deactivate AI systems that demonstrate performance or outcomes inconsistent with intended use. \nAction ID \nSuggested Action \nGAI Risks \nMG-2.4-001 \nEstablish and maintain communication plans to inform AI stakeholders as part of \nthe deactivation or disengagement process of a specific GAI system (including for \nopen-source models) or context of use, including reasons, workarounds, user \naccess removal, alternative processes, contact information, etc. \nHuman-AI Configuration",
        "launch may need to be confidential. Government applications, particularly law enforcement applications or \napplications that raise national security considerations, may require confidential or limited engagement based \non system sensitivities and preexisting oversight laws and structures. Concerns raised in this consultation \nshould be documented, and the automated system developers were proposing to create, use, or deploy should \nbe reconsidered based on this feedback."
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [4, 4]
  • Notebooks
  • Google Colab
  • Kaggle
nomic-embed-text-v1
548 MB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 3 commits
deman539's picture
deman539
Update README.md
116050f verified over 1 year ago
  • 1_Pooling
    Add new SentenceTransformer model. over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    30.1 kB
    Update README.md over 1 year ago
  • config.json
    1.69 kB
    Add new SentenceTransformer model. over 1 year ago
  • config_sentence_transformers.json
    201 Bytes
    Add new SentenceTransformer model. over 1 year ago
  • model.safetensors
    547 MB
    xet
    Add new SentenceTransformer model. over 1 year ago
  • modules.json
    349 Bytes
    Add new SentenceTransformer model. over 1 year ago
  • sentence_bert_config.json
    54 Bytes
    Add new SentenceTransformer model. over 1 year ago
  • special_tokens_map.json
    695 Bytes
    Add new SentenceTransformer model. over 1 year ago
  • tokenizer.json
    712 kB
    Add new SentenceTransformer model. over 1 year ago
  • tokenizer_config.json
    1.19 kB
    Add new SentenceTransformer model. over 1 year ago
  • vocab.txt
    232 kB
    Add new SentenceTransformer model. over 1 year ago