Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Parallia
/
Fairly-Multilingual-ModernBERT-Embed-BE-EN

Sentence Similarity
sentence-transformers
Safetensors
English
modernbert
feature-extraction
Generated from Trainer
dataset_size:8066634
loss:MultipleNegativesRankingLoss
text-embeddings-inference
Model card Files Files and versions
xet
Community
1

Instructions to use Parallia/Fairly-Multilingual-ModernBERT-Embed-BE-EN with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use Parallia/Fairly-Multilingual-ModernBERT-Embed-BE-EN with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("Parallia/Fairly-Multilingual-ModernBERT-Embed-BE-EN")
    
    sentences = [
        "These three mysterious men came to our help.",
        "Three strange guys helped us then.",
        "These three black birds came in our garden.",
        "Some people are helpful.",
        "One, two, three... Who can guess the next digits?"
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [5, 5]
  • Notebooks
  • Google Colab
  • Kaggle
Fairly-Multilingual-ModernBERT-Embed-BE-EN
600 MB
Ctrl+K
Ctrl+K
  • 2 contributors
History: 10 commits
FremyCompany's picture
FremyCompany
Update tokenizer_config.json
f6c8a30 verified over 1 year ago
  • 1_Pooling
    Create 1_Pooling/config.json over 1 year ago
  • .gitattributes
    1.52 kB
    initial commit over 1 year ago
  • README.md
    12.7 kB
    Update README.md over 1 year ago
  • config.json
    1.41 kB
    Provide pad token in the config over 1 year ago
  • config_sentence_transformers.json
    210 Bytes
    Initial upload over 1 year ago
  • model.safetensors
    596 MB
    xet
    Initial upload over 1 year ago
  • modules.json
    229 Bytes
    Initial upload over 1 year ago
  • sentence_bert_config.json
    54 Bytes
    Initial upload over 1 year ago
  • special_tokens_map.json
    694 Bytes
    Initial upload over 1 year ago
  • tokenizer.json
    3.58 MB
    Initial upload over 1 year ago
  • tokenizer_config.json
    20.8 kB
    Update tokenizer_config.json over 1 year ago