HPLT
/

hplt_gpt_bert_base_3_0_cmn_Hans

Mandarin Chinese

text2text-generation

Model card Files Files and versions

MariaFjodorowa commited on Feb 12

Commit

89912a9

·

verified ·

1 Parent(s): 03808c7

Upload folder using huggingface_hub

Files changed (1) hide show

README.md +9 -2

README.md CHANGED Viewed

@@ -24,12 +24,15 @@ All the HPLT GPT-BERT models use the same hyper-parameters:
 - hidden size: 640
 - attention heads: 10
 - layers: 24
-- vocabulary size: 51200
 Every model uses its own tokenizer trained on language-specific HPLT data.
 [The training code](https://github.com/ltgoslo/NorBERT/tree/main/norbert4).
 ## Example usage (bidirectional encoding)
 This model currently needs a custom wrapper from `modeling_gptbert.py`, you should therefore load the model with `trust_remote_code=True`.
@@ -157,4 +160,8 @@ print([b.name for b in out.branches])
       url={https://arxiv.org/abs/2511.01066},
 }
 ```
-[![arXiv](https://img.shields.io/badge/arXiv-2406.14167-b31b1b.svg)](https://arxiv.org/abs/2410.24159)

 - hidden size: 640
 - attention heads: 10
 - layers: 24
+- vocabulary size: 32768
 Every model uses its own tokenizer trained on language-specific HPLT data.
 [The training code](https://github.com/ltgoslo/NorBERT/tree/main/norbert4).
+```
+pip install transformers==4.57.6
+```
 ## Example usage (bidirectional encoding)
 This model currently needs a custom wrapper from `modeling_gptbert.py`, you should therefore load the model with `trust_remote_code=True`.
       url={https://arxiv.org/abs/2511.01066},
 }
 ```
+[![arXiv](https://img.shields.io/badge/arXiv-2410.24159-b31b1b.svg)](https://arxiv.org/abs/2410.24159)
+[![arXiv](https://img.shields.io/badge/arXiv-2511.01066-b31b1b.svg)](https://arxiv.org/abs/2511.01066)
+This project has received funding from the European Union’s Horizon Europe research and innovation programme under grant agreement No 101070350 and from UK Research and Innovation (UKRI) under the UK government’s Horizon Europe funding guarantee [grant number 10052546].