Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

tomaarsen
/
inference-free-splade-bert-tiny-nq-3e-3-lambda-corpus

Feature Extraction
sentence-transformers
Safetensors
English
sparse-encoder
sparse
asymmetric
inference-free
splade
Generated from Trainer
dataset_size:99000
loss:SpladeLoss
Eval Results (legacy)
Model card Files Files and versions
xet
Community

Instructions to use tomaarsen/inference-free-splade-bert-tiny-nq-3e-3-lambda-corpus with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • sentence-transformers

    How to use tomaarsen/inference-free-splade-bert-tiny-nq-3e-3-lambda-corpus with sentence-transformers:

    from sentence_transformers import SentenceTransformer
    
    model = SentenceTransformer("tomaarsen/inference-free-splade-bert-tiny-nq-3e-3-lambda-corpus")
    
    sentences = [
        "where is the tiber river located in italy",
        "Sales taxes in British Columbia On 1 July 2010, the PST and GST were combined into the Harmonized Sales Tax (HST) levied according to the provisions of the GST. The conversion to HST was controversial. Popular opposition led to a referendum on the tax system, the first such referendum in the Commonwealth of Nations, resulting in the province reverting to the former PST/GST model on 1 April 2013.",
        "Tiber The Tiber (/ˈtaɪbər/, Latin: Tiberis,[1] Italian: Tevere [ˈteːvere])[2] is the third-longest river in Italy, rising in the Apennine Mountains in Emilia-Romagna and flowing 406 kilometres (252 mi) through Tuscany, Umbria and Lazio, where it is joined by the river Aniene, to the Tyrrhenian Sea, between Ostia and Fiumicino.[3] It drains a basin estimated at 17,375 square kilometres (6,709 sq mi). The river has achieved lasting fame as the main watercourse of the city of Rome, founded on its eastern banks.",
        "Water in California California's limited water supply comes from two main sources: surface water, or water that travels or gathers on the ground, like rivers, streams, and lakes; and groundwater, which is water that is pumped out from the ground. California has also begun producing a small amount of desalinated water, water that was once sea water, but has been purified."
    ]
    embeddings = model.encode(sentences)
    
    similarities = model.similarity(embeddings, embeddings)
    print(similarities.shape)
    # [4, 4]
  • Notebooks
  • Google Colab
  • Kaggle
inference-free-splade-bert-tiny-nq-3e-3-lambda-corpus
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
tomaarsen's picture
tomaarsen HF Staff
Add new SparseEncoder model
71467c8 verified about 1 year ago
  • corpus_0_MLMTransformer
    Add new SparseEncoder model about 1 year ago
  • corpus_1_SpladePooling
    Add new SparseEncoder model about 1 year ago
  • query_0_IDF
    Add new SparseEncoder model about 1 year ago
  • .gitattributes
    1.52 kB
    initial commit about 1 year ago
  • README.md
    48.6 kB
    Add new SparseEncoder model about 1 year ago
  • config.json
    492 Bytes
    Add new SparseEncoder model about 1 year ago
  • config_sentence_transformers.json
    250 Bytes
    Add new SparseEncoder model about 1 year ago
  • modules.json
    113 Bytes
    Add new SparseEncoder model about 1 year ago