Instructions to use Finnish-NLP/t5-small-nl24-casing-punctuation-correction with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Finnish-NLP/t5-small-nl24-casing-punctuation-correction with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("Finnish-NLP/t5-small-nl24-casing-punctuation-correction") model = AutoModelForSeq2SeqLM.from_pretrained("Finnish-NLP/t5-small-nl24-casing-punctuation-correction") - Notebooks
- Google Colab
- Kaggle
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Based on Finnish pretrained T5 model version small-nl24
Train data: Around 300k samples from from following datasets
- wikipedia
- Yle Finnish News Archive 2011-2018
- Yle Finnish News Archive 2019-2020
- Finnish News Agency Archive (STT)
- The Suomi24 Sentences Corpus
Tested with 1000 samples from the previous datasets Median CER 1.1% MEAN CER 4.2% More detailed info coming later...
- Downloads last month
- 280
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐ Ask for provider support