Update README.md
Browse files
README.md
CHANGED
|
@@ -6,6 +6,36 @@ language:
|
|
| 6 |
- en
|
| 7 |
library_name: PyLate
|
| 8 |
pipeline_tag: sentence-similarity
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
tags:
|
| 10 |
- ColBERT
|
| 11 |
- PyLate
|
|
@@ -212,6 +242,15 @@ You can finetune this model on your own dataset.
|
|
| 212 |
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
|
| 213 |
-->
|
| 214 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 215 |
## Training Details
|
| 216 |
|
| 217 |
### Training Dataset
|
|
|
|
| 6 |
- en
|
| 7 |
library_name: PyLate
|
| 8 |
pipeline_tag: sentence-similarity
|
| 9 |
+
model-index:
|
| 10 |
+
- name: ColBERT based on answerdotai/ModernBERT-base
|
| 11 |
+
results:
|
| 12 |
+
- dataset:
|
| 13 |
+
name: FiQA
|
| 14 |
+
split: test
|
| 15 |
+
type: beir/fiqa
|
| 16 |
+
metrics:
|
| 17 |
+
- type: ndcg_at_10
|
| 18 |
+
value: 39.86
|
| 19 |
+
task:
|
| 20 |
+
type: Retrieval
|
| 21 |
+
- dataset:
|
| 22 |
+
name: SciFact
|
| 23 |
+
split: test
|
| 24 |
+
type: beir/scifact
|
| 25 |
+
metrics:
|
| 26 |
+
- type: ndcg_at_10
|
| 27 |
+
value: 73.67
|
| 28 |
+
task:
|
| 29 |
+
type: Retrieval
|
| 30 |
+
- dataset:
|
| 31 |
+
name: nfcorpus
|
| 32 |
+
split: test
|
| 33 |
+
type: beir/nfcorpus
|
| 34 |
+
metrics:
|
| 35 |
+
- type: ndcg_at_10
|
| 36 |
+
value: 33.98
|
| 37 |
+
task:
|
| 38 |
+
type: Retrieval
|
| 39 |
tags:
|
| 40 |
- ColBERT
|
| 41 |
- PyLate
|
|
|
|
| 242 |
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
|
| 243 |
-->
|
| 244 |
|
| 245 |
+
## Evaluation
|
| 246 |
+
NDCG@10
|
| 247 |
+
|
| 248 |
+
|Dataset | Score|
|
| 249 |
+
|:-------|------|
|
| 250 |
+
|FiQA | 0.3986|
|
| 251 |
+
|SciFact | 0.7367|
|
| 252 |
+
|nfcorpus | 0.3398 |
|
| 253 |
+
|
| 254 |
## Training Details
|
| 255 |
|
| 256 |
### Training Dataset
|