clarin-pl/nkjp-pos
Viewer • Updated • 85.7k • 174 • 2
How to use nie3e/pos-polish-gpt2-small with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("token-classification", model="nie3e/pos-polish-gpt2-small") # Load model directly
from transformers import AutoTokenizer, AutoModelForTokenClassification
tokenizer = AutoTokenizer.from_pretrained("nie3e/pos-polish-gpt2-small")
model = AutoModelForTokenClassification.from_pretrained("nie3e/pos-polish-gpt2-small")This model was trained from polish-gpt2-small on clarin-pl/nkjp-pos dataset. It achieves the following results on the evaluation set:
Trained from polish-gpt2-small
Part-of-speech tagging for Polish language. Tags description at the bottom of http://nkjp.pl/poliqarp/help/plse2.html
Dataset: clarin-pl/nkjp-pos
Datacollator:
from transformers import DataCollatorForTokenClassification
data_collator = DataCollatorForTokenClassification(tokenizer=tokenizer)
GPU: RTX 3090
Training time: 00:50:24
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
|---|---|---|---|---|---|---|---|
| 0.0 | 0 | 3.6116 | 0.0464 | 0.0524 | 0.0492 | 0.0676 | |
| 0.2303 | 1.0 | 1222 | 0.2159 | 0.8737 | 0.9225 | 0.8974 | 0.9347 |
| 0.1776 | 2.0 | 2444 | 0.2124 | 0.8799 | 0.9254 | 0.9021 | 0.9381 |
| 0.1467 | 3.0 | 3666 | 0.2205 | 0.8759 | 0.9241 | 0.8994 | 0.9368 |
| 0.1254 | 4.0 | 4889 | 0.2304 | 0.8792 | 0.9256 | 0.9018 | 0.9377 |
| 0.1091 | 5.0 | 6111 | 0.2480 | 0.8787 | 0.9251 | 0.9013 | 0.9375 |
| 0.0949 | 6.0 | 7333 | 0.2651 | 0.8794 | 0.9250 | 0.9016 | 0.9373 |
| 0.0857 | 7.0 | 8555 | 0.2794 | 0.8791 | 0.9251 | 0.9015 | 0.9372 |
| 0.079 | 8.0 | 9778 | 0.2922 | 0.8789 | 0.9247 | 0.9012 | 0.9366 |
| 0.0736 | 9.0 | 11000 | 0.3037 | 0.8807 | 0.9256 | 0.9026 | 0.9375 |
| 0.0691 | 10.0 | 12220 | 0.3109 | 0.8793 | 0.9255 | 0.9018 | 0.9371 |