Murhaf commited on
Commit
fc7bf8d
·
verified ·
1 Parent(s): 139facd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -45
README.md CHANGED
@@ -9,15 +9,17 @@ tags:
9
  - generated_from_trainer
10
  - dataset_size:556367
11
  - loss:CachedMultipleNegativesRankingLoss
12
- base_model: Murhaf/ltg-norbert4-base_ndla
 
13
  widget:
14
  - source_sentence: Inne i igloen gjør den unge mannen seg klar for sitt overnattingsopphold.
15
  sentences:
16
  - Folk danser i gaten.
17
  - Den unge mannen gjør seg klar for sitt overnattingsopphold.
18
  - Den unge mannen gjør seg klar til å dra.
19
- - source_sentence: En kvinne i rullestol snakker med vennen sin mens hun er omgitt
20
- av andre mennesker som går i parken.
 
21
  sentences:
22
  - Barna blir fotografert.
23
  - Kvinnen er utendørs.
@@ -27,8 +29,9 @@ widget:
27
  - En mann og en kvinne ser på frukt og grønnsaker.
28
  - En kvinne løper.
29
  - En kvinne sitter ved et piknikbord nær den steinete kysten.
30
- - source_sentence: To basketballspillere i svart og hvitt antrekk står på en basketballbane
31
- og snakker.
 
32
  sentences:
33
  - De to basketballspillerne snakker sammen.
34
  - Den unge gutten multitasker.
@@ -39,7 +42,8 @@ widget:
39
  - På fornøyelsesturen var det to jenter som smilte og lo
40
  - En kvinne ødelegger et sandmaleri.
41
  datasets:
42
- - Murhaf/all-nli-norwegian
 
43
  pipeline_tag: sentence-similarity
44
  library_name: sentence-transformers
45
  metrics:
@@ -57,6 +61,7 @@ model-index:
57
  - type: cosine_accuracy
58
  value: 0.9470000267028809
59
  name: Cosine Accuracy
 
60
  ---
61
 
62
  # SentenceTransformer based on Murhaf/ltg-norbert4-base_ndla
@@ -67,20 +72,15 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [M
67
 
68
  ### Model Description
69
  - **Model Type:** Sentence Transformer
70
- - **Base model:** [Murhaf/ltg-norbert4-base_ndla](https://huggingface.co/Murhaf/ltg-norbert4-base_ndla) <!-- at revision 762fb095e1c571e52d8690bf07ec8b65d3551026 -->
71
  - **Maximum Sequence Length:** 75 tokens
72
  - **Output Dimensionality:** 640 dimensions
73
  - **Similarity Function:** Cosine Similarity
74
  - **Training Dataset:**
75
- - [all-nli-norwegian](https://huggingface.co/datasets/Murhaf/all-nli-norwegian)
76
  - **Language:** no
77
  <!-- - **License:** Unknown -->
78
 
79
- ### Model Sources
80
-
81
- - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
82
- - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
83
- - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
84
 
85
  ### Full Model Architecture
86
 
@@ -234,6 +234,7 @@ You can finetune this model on your own dataset.
234
 
235
  ### Training Hyperparameters
236
  #### Non-Default Hyperparameters
 
237
 
238
  - `eval_strategy`: steps
239
  - `per_device_train_batch_size`: 512
@@ -241,6 +242,7 @@ You can finetune this model on your own dataset.
241
  - `num_train_epochs`: 1
242
  - `warmup_ratio`: 0.1
243
  - `batch_sampler`: no_duplicates
 
244
 
245
  #### All Hyperparameters
246
  <details><summary>Click to expand</summary>
@@ -366,14 +368,12 @@ You can finetune this model on your own dataset.
366
 
367
  </details>
368
 
369
- ### Training Logs
370
- | Epoch | Step | Training Loss | Validation Loss | nob_all_nli_test_cosine_accuracy |
371
- |:------:|:----:|:-------------:|:---------------:|:--------------------------------:|
372
- | 0.3690 | 100 | 1.8282 | 0.6138 | 0.9420 |
373
- | 0.7380 | 200 | 1.1887 | 0.5645 | 0.9470 |
374
 
375
 
376
  ### Framework Versions
 
 
377
  - Python: 3.12.11
378
  - Sentence Transformers: 5.1.1
379
  - Transformers: 4.56.2
@@ -382,34 +382,8 @@ You can finetune this model on your own dataset.
382
  - Datasets: 4.1.1
383
  - Tokenizers: 0.22.1
384
 
385
- ## Citation
386
-
387
- ### BibTeX
388
-
389
- #### Sentence Transformers
390
- ```bibtex
391
- @inproceedings{reimers-2019-sentence-bert,
392
- title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
393
- author = "Reimers, Nils and Gurevych, Iryna",
394
- booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
395
- month = "11",
396
- year = "2019",
397
- publisher = "Association for Computational Linguistics",
398
- url = "https://arxiv.org/abs/1908.10084",
399
- }
400
- ```
401
 
402
- #### CachedMultipleNegativesRankingLoss
403
- ```bibtex
404
- @misc{gao2021scaling,
405
- title={Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup},
406
- author={Luyu Gao and Yunyi Zhang and Jiawei Han and Jamie Callan},
407
- year={2021},
408
- eprint={2101.06983},
409
- archivePrefix={arXiv},
410
- primaryClass={cs.LG}
411
- }
412
- ```
413
 
414
  <!--
415
  ## Glossary
 
9
  - generated_from_trainer
10
  - dataset_size:556367
11
  - loss:CachedMultipleNegativesRankingLoss
12
+ base_model:
13
+ - ltg/norbert4-base
14
  widget:
15
  - source_sentence: Inne i igloen gjør den unge mannen seg klar for sitt overnattingsopphold.
16
  sentences:
17
  - Folk danser i gaten.
18
  - Den unge mannen gjør seg klar for sitt overnattingsopphold.
19
  - Den unge mannen gjør seg klar til å dra.
20
+ - source_sentence: >-
21
+ En kvinne i rullestol snakker med vennen sin mens hun er omgitt av andre
22
+ mennesker som går i parken.
23
  sentences:
24
  - Barna blir fotografert.
25
  - Kvinnen er utendørs.
 
29
  - En mann og en kvinne ser på frukt og grønnsaker.
30
  - En kvinne løper.
31
  - En kvinne sitter ved et piknikbord nær den steinete kysten.
32
+ - source_sentence: >-
33
+ To basketballspillere i svart og hvitt antrekk står på en basketballbane og
34
+ snakker.
35
  sentences:
36
  - De to basketballspillerne snakker sammen.
37
  - Den unge gutten multitasker.
 
42
  - På fornøyelsesturen var det to jenter som smilte og lo
43
  - En kvinne ødelegger et sandmaleri.
44
  datasets:
45
+ - Fremtind/all-nli-norwegian
46
+ - NbAiLab/ndla_parallel_paragraphs
47
  pipeline_tag: sentence-similarity
48
  library_name: sentence-transformers
49
  metrics:
 
61
  - type: cosine_accuracy
62
  value: 0.9470000267028809
63
  name: Cosine Accuracy
64
+ license: apache-2.0
65
  ---
66
 
67
  # SentenceTransformer based on Murhaf/ltg-norbert4-base_ndla
 
72
 
73
  ### Model Description
74
  - **Model Type:** Sentence Transformer
75
+ - **Base model:** [ltg/norbert4-base](https://huggingface.co/ltg/norbert4-base) <!-- at revision 762fb095e1c571e52d8690bf07ec8b65d3551026 -->
76
  - **Maximum Sequence Length:** 75 tokens
77
  - **Output Dimensionality:** 640 dimensions
78
  - **Similarity Function:** Cosine Similarity
79
  - **Training Dataset:**
80
+ - [all-nli-norwegian](https://huggingface.co/datasets/Fremtind/all-nli-norwegian)
81
  - **Language:** no
82
  <!-- - **License:** Unknown -->
83
 
 
 
 
 
 
84
 
85
  ### Full Model Architecture
86
 
 
234
 
235
  ### Training Hyperparameters
236
  #### Non-Default Hyperparameters
237
+ <details><summary>Click to expand</summary>
238
 
239
  - `eval_strategy`: steps
240
  - `per_device_train_batch_size`: 512
 
242
  - `num_train_epochs`: 1
243
  - `warmup_ratio`: 0.1
244
  - `batch_sampler`: no_duplicates
245
+ </details>
246
 
247
  #### All Hyperparameters
248
  <details><summary>Click to expand</summary>
 
368
 
369
  </details>
370
 
371
+
 
 
 
 
372
 
373
 
374
  ### Framework Versions
375
+ <details><summary>Click to expand</summary>
376
+
377
  - Python: 3.12.11
378
  - Sentence Transformers: 5.1.1
379
  - Transformers: 4.56.2
 
382
  - Datasets: 4.1.1
383
  - Tokenizers: 0.22.1
384
 
385
+ </details>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
386
 
 
 
 
 
 
 
 
 
 
 
 
387
 
388
  <!--
389
  ## Glossary