deepset
/

gbert-base-germandpr-reranking

Text Classification

text-embeddings-inference

Model card Files Files and versions

julianrisch commited on Jun 4, 2021

Commit

04c190f

·

1 Parent(s): 3bda274

Update README.md

Files changed (1) hide show

README.md +11 -15

README.md CHANGED Viewed

@@ -26,29 +26,25 @@ lr_schedule = LinearWarmup
 embeds_dropout_prob = 0.1
 ```
 ## Performance
-We use the GermanDPR test dataset as ground truth labels and run two experiments to compare how a BM25 retriever performs with or without reranking with our model. The first experiment runs retrieval on the full German Wikipedia (>2million passages) and second experiment runs retrieval on the GermanDPR dataset only (<5000 passages). Both experiments use 1025 queries. Note that the second experiment is evaluating on a much simpler task because of the smaller dataset size, which explains strong BM25 retrieval performance.
-Full German Wikipedia:
 BM25 Retriever without Reranking
------------------
-recall@3: 0.4088 (419 / 1025)
-mean_reciprocal_rank@3: 0.3322
 BM25 Retriever with Reranking Top 10 Documents
------------------
-recall@3: 0.5200 (533 / 1025)
-mean_reciprocal_rank@3: 0.4800
-Germandpr only:
 BM25 Retriever without Reranking
------------------
-recall@3: 0.9102 (933 / 1025)
-mean_reciprocal_rank@3: 0.8528
 BM25 Retriever with Reranking Top 10 Documents
------------------
-recall@3: 0.9298 (953 / 1025)
-mean_reciprocal_rank@3: 0.8813

 embeds_dropout_prob = 0.1
 ```
 ## Performance
+We use the GermanDPR test dataset as ground truth labels and run two experiments to compare how a BM25 retriever performs with or without reranking with our model. The first experiment runs retrieval on the full German Wikipedia (more than 2 million passages) and second experiment runs retrieval on the GermanDPR dataset only (not more than 5000 passages). Both experiments use 1025 queries. Note that the second experiment is evaluating on a much simpler task because of the smaller dataset size, which explains strong BM25 retrieval performance.
+### Full German Wikipedia (more than 2 million passages):
 BM25 Retriever without Reranking
+- recall@3: 0.4088 (419 / 1025)
+- mean_reciprocal_rank@3: 0.3322
 BM25 Retriever with Reranking Top 10 Documents
+- recall@3: 0.5200 (533 / 1025)
+- mean_reciprocal_rank@3: 0.4800
+### GermanDPR Dataset only (not more than 5000 passages):
 BM25 Retriever without Reranking
+- recall@3: 0.9102 (933 / 1025)
+- mean_reciprocal_rank@3: 0.8528
 BM25 Retriever with Reranking Top 10 Documents
+- recall@3: 0.9298 (953 / 1025)
+- mean_reciprocal_rank@3: 0.8813