joe32140
/

ColModernBERT-base-msmarco-en-bge

Sentence Similarity

sentence-transformers

feature-extraction

Generated from Trainer

dataset_size:808728

loss:Distillation

text-embeddings-inference

Model card Files Files and versions

joe32140 commited on Dec 21, 2024

Commit

7c0a9e4

·

verified ·

1 Parent(s): 9b611fd

Update README.md

Files changed (1) hide show

README.md +39 -0

README.md CHANGED Viewed

@@ -6,6 +6,36 @@ language:
 - en
 library_name: PyLate
 pipeline_tag: sentence-similarity
 tags:
 - ColBERT
 - PyLate
@@ -212,6 +242,15 @@ You can finetune this model on your own dataset.
 *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
 -->
 ## Training Details
 ### Training Dataset

 - en
 library_name: PyLate
 pipeline_tag: sentence-similarity
+model-index:
+- name: ColBERT based on answerdotai/ModernBERT-base
+  results:
+  - dataset:
+      name: FiQA
+      split: test
+      type: beir/fiqa
+    metrics:
+    - type: ndcg_at_10
+      value: 39.86
+    task:
+      type: Retrieval
+  - dataset:
+      name: SciFact
+      split: test
+      type: beir/scifact
+    metrics:
+    - type: ndcg_at_10
+      value: 73.67
+    task:
+      type: Retrieval
+  - dataset:
+      name: nfcorpus
+      split: test
+      type: beir/nfcorpus
+    metrics:
+    - type: ndcg_at_10
+      value: 33.98
+    task:
+      type: Retrieval
 tags:
 - ColBERT
 - PyLate
 *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
 -->
+## Evaluation
+NDCG@10
+|Dataset | Score|
+|:-------|------|
+|FiQA | 0.3986|
+|SciFact | 0.7367|
+|nfcorpus | 0.3398 |
 ## Training Details
 ### Training Dataset