rasyosef
/

natural_questions_108k_splade_index

Model card Files Files and versions

rasyosef commited on Sep 16

Commit

d2bf710

·

verified ·

1 Parent(s): 28210d5

Update README.md

Files changed (1) hide show

README.md +15 -18

README.md CHANGED Viewed

@@ -1,13 +1,15 @@
----
-language: en
-library_name: splade-index
-tags:
-- splade
-- splade-index
-- retrieval
-- search
-- sparse
----
 # Splade-Index
@@ -32,20 +34,16 @@ pip install huggingface_hub
 You can use the following code to load this SPLADE index from Hugging Face hub:
 ```python
-import os
 from sentence_transformers import SparseEncoder
 from splade_index import SPLADE
 # Download the SPLADE model that was used to create the index from the HuggingFace Hub
-model_id = "the-splade-model-id" # Enter the splade model id
 model = SparseEncoder(model_id)
-# Set your huggingface token if repo is private
-token = os.environ["HF_TOKEN"]
 repo_id = "rasyosef/natural_questions_108k_splade_index"
 # Load a SPLADE index from the Hugging Face model hub
-retriever = SPLADE.load_from_hub(repo_id, model=model, token=token)
 ```
 ## Stats
@@ -56,5 +54,4 @@ This dataset was created using the following data:
 | --- | --- |
 | Number of documents | 108593 |
 | Number of tokens | 20265694 |
-| Average tokens per document | 186.62 |

+---
+language: en
+library_name: splade-index
+tags:
+- splade
+- splade-index
+- retrieval
+- search
+- sparse
+datasets:
+- yosefw/natural-questions-108k
+---
 # Splade-Index
 You can use the following code to load this SPLADE index from Hugging Face hub:
 ```python
 from sentence_transformers import SparseEncoder
 from splade_index import SPLADE
 # Download the SPLADE model that was used to create the index from the HuggingFace Hub
+model_id = "naver/splade-v3-distilbert" # the SPLADE model id
 model = SparseEncoder(model_id)
 repo_id = "rasyosef/natural_questions_108k_splade_index"
 # Load a SPLADE index from the Hugging Face model hub
+retriever = SPLADE.load_from_hub(repo_id, model=model)
 ```
 ## Stats
 | --- | --- |
 | Number of documents | 108593 |
 | Number of tokens | 20265694 |
+| Average tokens per document | 186.62 |