How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("fill-mask", model="sdadas/polish-roberta-base-v2")
# Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("sdadas/polish-roberta-base-v2")
model = AutoModelForMaskedLM.from_pretrained("sdadas/polish-roberta-base-v2")
Quick Links

polish-roberta-base-v2

An encoder model based on the RoBERTa architecture, pre-trained on a large corpus of Polish texts. More information can be found in our GitHub repository and in the publication Pre-training polish transformer-based language models at scale.

Citation

@inproceedings{dadas2020pre,
  title={Pre-training polish transformer-based language models at scale},
  author={Dadas, S{\l}awomir and Pere{\l}kiewicz, Micha{\l} and Po{\'s}wiata, Rafa{\l}},
  booktitle={International Conference on Artificial Intelligence and Soft Computing},
  pages={301--314},
  year={2020},
  organization={Springer}
}
Downloads last month
982
Safetensors
Model size
0.1B params
Tensor type
I64
·
F32
·
Inference Providers NEW
Mask token: <mask>

Model tree for sdadas/polish-roberta-base-v2

Finetunes
4 models

Space using sdadas/polish-roberta-base-v2 1

Collection including sdadas/polish-roberta-base-v2

Paper for sdadas/polish-roberta-base-v2