KocLab-Bilkent
/

BERTurk-Legal

@@ -13,6 +13,21 @@ We introduce BERTurk-Legal which is a transformer-based language model to retrie
 Test dataset can be accessed from the following link: https://github.com/koc-lab/yargitay_retrieval_dataset
 ## Citation
 If you use the model, please cite the following conference paper.
 ```

 Test dataset can be accessed from the following link: https://github.com/koc-lab/yargitay_retrieval_dataset
+The model can be loaded and used to create document embeddings as follows. Then, the document embeddings can be utilized for retrieval.
+```
+from transformers import AutoModelForSequenceClassification, AutoTokenizer
+bert_model = "KocLab-Bilkent/BERTurk-Legal"
+model = AutoModelForSequenceClassification.from_pretrained(bert_model, output_hidden_states=True)
+tokenizer = AutoTokenizer.from_pretrained(bert_model)
+tokens = tokenizer("Örnek metin") # a dummy text is provided as input
+output = model(tokens)
+docEmbeddings = output.hidden_states[-1]
+```
 ## Citation
 If you use the model, please cite the following conference paper.
 ```