Instructions to use webis/set-encoder-base with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Lightning IR
How to use webis/set-encoder-base with Lightning IR:
#install from https://github.com/webis-de/lightning-ir from lightning_ir import CrossEncoderModule model = CrossEncoderModule("webis/set-encoder-base") model.score("query", ["doc1", "doc2", "doc3"]) - Notebooks
- Google Colab
- Kaggle
Add pipeline tag, library name, link to code and copy Github README
Browse filesThis PR ensures the model is tagged correctly, and people can find the model under the "feature-extraction" tag. It also adds the
library name to display an inferencing widget in the top right corner. It also copies the content of the Github README to the model card.
README.md
CHANGED
|
@@ -1,3 +1,32 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: apache-2.0
|
| 3 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: apache-2.0
|
| 3 |
+
pipeline_tag: feature-extraction
|
| 4 |
+
library_name: transformers
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
# Set-Encoder
|
| 8 |
+
|
| 9 |
+
This repository contains the code for the paper: [`Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders`](https://arxiv.org/abs/2404.06912).
|
| 10 |
+
|
| 11 |
+
We use [`lightning-ir`](https://github.com/webis-de/lightning-ir) to train and fine-tune models. Download and install the library to use the code in this repository.
|
| 12 |
+
|
| 13 |
+
## Model Zoo
|
| 14 |
+
|
| 15 |
+
We provide the following pre-trained models:
|
| 16 |
+
|
| 17 |
+
| Model Name | TREC DL 19 (BM25) | TREC DL 20 (BM25) | TREC DL 19 (ColBERTv2) | TREC DL 20 (ColBERTv2) |
|
| 18 |
+
| ------------------------------------------------------------------- | ----------------- | ----------------- | ---------------------- | ---------------------- |
|
| 19 |
+
| [set-encoder-base](https://huggingface.co/webis/set-encoder-base) | 0.724 | 0.710 | 0.788 | 0.777 |
|
| 20 |
+
| [set-encoder-large](https://huggingface.co/webis/set-encoder-large) | 0.727 | 0.735 | 0.789 | 0.790 |
|
| 21 |
+
|
| 22 |
+
## Inference
|
| 23 |
+
|
| 24 |
+
We recommend using the `lightning-ir` cli to run inference. The following command can be used to run inference using the `set-encoder-base` model on the TREC DL 19 and TREC DL 20 datasets:
|
| 25 |
+
|
| 26 |
+
```bash
|
| 27 |
+
lightning-ir re_rank --config configs/re-rank.yaml --config configs/set-encoder-finetuned.yaml --config configs/trec-dl.yaml
|
| 28 |
+
```
|
| 29 |
+
|
| 30 |
+
## Fine-Tuning
|
| 31 |
+
|
| 32 |
+
WIP
|