Transformers
Safetensors
Polish
t5
text2text-generation
seq2seq
text-to-text
scientific-language-models
cross-lingual-transfer
wechsel
global-mmlu
text-generation-inference
Instructions to use rausch/pl-t5-sci-transfer-init-spm32k with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use rausch/pl-t5-sci-transfer-init-spm32k with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("rausch/pl-t5-sci-transfer-init-spm32k") model = AutoModelForSeq2SeqLM.from_pretrained("rausch/pl-t5-sci-transfer-init-spm32k") - Notebooks
- Google Colab
- Kaggle
| { | |
| "timestamp_utc": "2026-02-16T20:50:18.532387+00:00", | |
| "language": "pol_Latn", | |
| "track": "paper_spm32k", | |
| "training_data": "/netscratch/nrauscher/projects/BA-hydra/cross_lingual_transfer_multilingual/data/languages/pol_Latn/splits/sub/sub_charcap43gb_seed42/train/docs.parquet", | |
| "training_data_rows": 121766, | |
| "subsplit": "sub_charcap43gb_seed42", | |
| "sentencepiece": { | |
| "model_type": "bpe", | |
| "vocab_size": 32000, | |
| "character_coverage": 1.0, | |
| "byte_fallback": true, | |
| "hard_vocab_limit": false, | |
| "input_sentence_size": 0, | |
| "shuffle_input_sentence": true, | |
| "num_threads": 32, | |
| "pad_id": 0, | |
| "eos_id": 1, | |
| "unk_id": 2, | |
| "bos_id": -1 | |
| }, | |
| "t5": { | |
| "extra_ids": 100, | |
| "tokenizer_length": 32100, | |
| "fast_tokenizer_export_ok": false | |
| }, | |
| "duration_sec": 63.32048845291138 | |
| } |