Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Paper • 1908.10084 • Published • 13
This model is a BERT-based bi-encoder[^1] model fine-tuned using Lightning IR.
See the Lightning IR Model Zoo for a comparison with other models.
To reproduce the model training, install Lightning IR and run the following command using the fine-tune.yaml configuration file:
lightning-ir fit --config fine-tune.yaml
To index MS~MARCO passages, use the following command and the index.yaml configuration file:
lightning-ir index --config index.yaml
After indexing, to evaluate the model on TREC Deep Learning 2019 and 2020, use the following command and the search.yaml configuration file:
lightning-ir search --config search.yaml
[^1]: Reimers and Gurevych, Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Base model
google-bert/bert-base-uncased
#install from https://github.com/webis-de/lightning-ir from lightning_ir import BiEncoderModule model = BiEncoderModule("webis/bert-bi-encoder") model.score("query", ["doc1", "doc2", "doc3"])