NbAiLab
/

nb-sbert-v2-large

Sentence Similarity

sentence-transformers

feature-extraction

Generated from Trainer

dataset_size:527098

loss:MultipleNegativesRankingLoss

Eval Results (legacy)

text-embeddings-inference

🇪🇺 Region: EU

Model card Files Files and versions

vlhandfo commited on Apr 10

Commit

28752cb

·

1 Parent(s): b0830e6

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -68,7 +68,7 @@ model-index:
 # SentenceTransformer based on NbAiLab/nb-bert-large
-This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [NbAiLab/nb-bert-large](https://huggingface.co/NbAiLab/nb-bert-large).
 The model maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more. The easiest way is to simply measure the cosine distance between two sentences. Sentences that are close to each other in meaning, will have a small cosine distance and a similarity close to 1. The model is trained in such a way that similar sentences in different languages should also be close to each other. Ideally, an English-Norwegian sentence pair should have high similarity.
@@ -547,7 +547,7 @@ You can finetune this model on your own dataset.
   year      = {2021},
   address   = {Reykjavik, Iceland (Online)},
   publisher = {Linköping University Electronic Press, Sweden},
-  url       = {https://aclanthology.org/2021.nodalida-main.3},
   pages     = {20--29},
   abstract  = {In this work, we show the process of building a large-scale training set from digital and digitized collections at a national library.
   The resulting Bidirectional Encoder Representations from Transformers (BERT)-based language model for Norwegian outperforms multilingual BERT (mBERT) models

 # SentenceTransformer based on NbAiLab/nb-bert-large
+This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [NbAiLab/nb-bert-large](https://huggingface.co/NbAiLab/nb-bert-large). It builds on the previous work of the existing [NbAiLab/nb-sbert-base](https://huggingface.co/NbAiLab/nb-sbert-base) model, using a larger foundational model and providing a larger max sequence length for inputs.
 The model maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more. The easiest way is to simply measure the cosine distance between two sentences. Sentences that are close to each other in meaning, will have a small cosine distance and a similarity close to 1. The model is trained in such a way that similar sentences in different languages should also be close to each other. Ideally, an English-Norwegian sentence pair should have high similarity.
   year      = {2021},
   address   = {Reykjavik, Iceland (Online)},
   publisher = {Linköping University Electronic Press, Sweden},
+  url       = {https://huggingface.co/papers/2104.09617},
   pages     = {20--29},
   abstract  = {In this work, we show the process of building a large-scale training set from digital and digitized collections at a national library.
   The resulting Bidirectional Encoder Representations from Transformers (BERT)-based language model for Norwegian outperforms multilingual BERT (mBERT) models