How to use from the
Use from the
Transformers library
# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("fill-mask", model="Andrija/SRoBERTa-base")
# Load model directly
from transformers import AutoTokenizer, AutoModelForMaskedLM

tokenizer = AutoTokenizer.from_pretrained("Andrija/SRoBERTa-base")
model = AutoModelForMaskedLM.from_pretrained("Andrija/SRoBERTa-base")
Quick Links

Transformer language model for Croatian and Serbian

Trained on 3GB datasets that contain Croatian and Serbian language for two epochs. Leipzig and OSCAR datasets

Information of dataset

Model #params Arch. Training data
Andrija/SRoBERTa-base 80M Second Leipzig Corpus and OSCAR (3 GB of text)
Downloads last month
12
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train Andrija/SRoBERTa-base