Update README.md
Browse files
README.md
CHANGED
|
@@ -14,6 +14,10 @@ license: cc-by-4.0
|
|
| 14 |
|
| 15 |
# NorBERT 3 base
|
| 16 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 17 |
|
| 18 |
## Other sizes:
|
| 19 |
- [NorBERT 3 xs (15M)](https://huggingface.co/ltg/norbert3-xs)
|
|
@@ -21,6 +25,12 @@ license: cc-by-4.0
|
|
| 21 |
- [NorBERT 3 base (123M)](https://huggingface.co/ltg/norbert3-base)
|
| 22 |
- [NorBERT 3 large (323M)](https://huggingface.co/ltg/norbert3-large)
|
| 23 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 24 |
|
| 25 |
## Example usage
|
| 26 |
|
|
@@ -43,4 +53,17 @@ output_text = torch.where(input_text.input_ids == mask_id, output_p.logits.argma
|
|
| 43 |
print(tokenizer.decode(output_text[0].tolist()))
|
| 44 |
```
|
| 45 |
|
| 46 |
-
The following classes are currently implemented: `NorbertForMaskedLM`, `NorbertForSequenceClassification`, `NorbertForTokenClassification`, `NorbertForQuestionAnswering` and `NorbertForMultipleChoice`.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 14 |
|
| 15 |
# NorBERT 3 base
|
| 16 |
|
| 17 |
+
<img src="https://huggingface.co/ltg/norbert3-base/resolve/main/norbert.png" width=12.5%>
|
| 18 |
+
|
| 19 |
+
The official release of a new generation of NorBERT language models described in paper [**NorBench — A Benchmark for Norwegian Language Models**](https://openreview.net/forum?id=WgxNONkAbz). Plese read the paper to learn more details about the model.
|
| 20 |
+
|
| 21 |
|
| 22 |
## Other sizes:
|
| 23 |
- [NorBERT 3 xs (15M)](https://huggingface.co/ltg/norbert3-xs)
|
|
|
|
| 25 |
- [NorBERT 3 base (123M)](https://huggingface.co/ltg/norbert3-base)
|
| 26 |
- [NorBERT 3 large (323M)](https://huggingface.co/ltg/norbert3-large)
|
| 27 |
|
| 28 |
+
## Generative NorT5 siblings:
|
| 29 |
+
- [NorT5 xs (15M)](https://huggingface.co/ltg/nort5-xs)
|
| 30 |
+
- [NorT5 small (40M)](https://huggingface.co/ltg/nort5-small)
|
| 31 |
+
- [NorT5 base (123M)](https://huggingface.co/ltg/nort5-base)
|
| 32 |
+
- [NorT5 large (323M)](https://huggingface.co/ltg/nort5-large)
|
| 33 |
+
|
| 34 |
|
| 35 |
## Example usage
|
| 36 |
|
|
|
|
| 53 |
print(tokenizer.decode(output_text[0].tolist()))
|
| 54 |
```
|
| 55 |
|
| 56 |
+
The following classes are currently implemented: `NorbertForMaskedLM`, `NorbertForSequenceClassification`, `NorbertForTokenClassification`, `NorbertForQuestionAnswering` and `NorbertForMultipleChoice`.
|
| 57 |
+
|
| 58 |
+
## Cite us
|
| 59 |
+
|
| 60 |
+
```bibtex
|
| 61 |
+
@inproceedings{
|
| 62 |
+
samuel2023norbench,
|
| 63 |
+
title={NorBench -- A Benchmark for Norwegian Language Models},
|
| 64 |
+
author={David Samuel and Andrey Kutuzov and Samia Touileb and Erik Velldal and Lilja {\O}vrelid and Egil R{\o}nningstad and Elina Sigdel and Anna Sergeevna Palatkina},
|
| 65 |
+
booktitle={The 24rd Nordic Conference on Computational Linguistics},
|
| 66 |
+
year={2023},
|
| 67 |
+
url={https://openreview.net/forum?id=WgxNONkAbz}
|
| 68 |
+
}
|
| 69 |
+
```
|