rasyosef
/

splade-tiny

Feature Extraction

sentence-transformers

Generated from Trainer

dataset_size:1200000

loss:SpladeLoss

loss:SparseMarginMSELoss

Eval Results (legacy)

text-embeddings-inference

Model card Files Files and versions

rasyosef commited on Jul 21, 2025

Commit

34aef2d

·

verified ·

1 Parent(s): e03eb83

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -152,7 +152,7 @@ datasets:
 This is a SPLADE sparse retrieval model based on BERT-Tiny (4M) that was trained by distilling a Cross-Encoder on the MSMARCO dataset. The cross-encoder used was [ms-marco-MiniLM-L6-v2](https://huggingface.co/cross-encoder/ms-marco-MiniLM-L6-v2).
-This tiny SPLADE model is `15x` smaller than Naver's official `splade-v3-distilbert` while having `80%` of it's performance on the MSMARCO benchmark. This model is small enough to be used without a GPU on a dataset of a few thousand documents.
 - `Collection:` https://huggingface.co/collections/rasyosef/splade-tiny-msmarco-687c548c0691d95babf65b70
 - `Distillation Dataset:` https://huggingface.co/datasets/yosefw/msmarco-train-distil-v2

 This is a SPLADE sparse retrieval model based on BERT-Tiny (4M) that was trained by distilling a Cross-Encoder on the MSMARCO dataset. The cross-encoder used was [ms-marco-MiniLM-L6-v2](https://huggingface.co/cross-encoder/ms-marco-MiniLM-L6-v2).
+Theis Tiny SPLADE model beats `BM25` by `65.6%` on the MSMARCO benchmark. While this model is `15x` smaller than Naver's official `splade-v3-distilbert`, is posesses `80%` of it's performance on the MSMARCO benchmark. This model is small enough to be used without a GPU on a dataset of a few thousand documents.
 - `Collection:` https://huggingface.co/collections/rasyosef/splade-tiny-msmarco-687c548c0691d95babf65b70
 - `Distillation Dataset:` https://huggingface.co/datasets/yosefw/msmarco-train-distil-v2