Model Card for Model ID

GPT-2 based model trained for Lithuanian.

Model Description

The model architecture is copied from the ai-forever/mGPT model, however it is trained from scratch on a modified partition of the Lithuanian partition of the mC4 dataset.

The training was done on Vilnius University supercomputer.

Downloads last month
39
Safetensors
Model size
1B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Datasets used to train domce20/GPT2-Lithuanian