legacy-datasets/wikipedia
Updated • 121k • 629
How to use SzegedAI/hubertusz-small-wiki with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForPreTraining
tokenizer = AutoTokenizer.from_pretrained("SzegedAI/hubertusz-small-wiki")
model = AutoModelForPreTraining.from_pretrained("SzegedAI/hubertusz-small-wiki")This model was trained from scratch on the Wikipedia subset of Hungarian Webcorpus 2.0 with MLM and SOP tasks.
First phase:
Second phase: