YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

FinArcNER

Introduction

FinArcNER is a named entity recognition model trained with mostly archival data. This repository has been produced as part of the FIN-CLARIAH infrastructure project. The model training and paper writing has been conducted in cooperation with the National Archives of Finland.

Using the model

from transformers import pipeline

model_checkpoint = "jyu-digihum/finarcner"
token_classifier = pipeline(
    "token-classification", model=model_checkpoint, aggregation_strategy="simple"
)
predictions = token_classifier("'Helsingistä tuli Suomen suuriruhtinaskunnan pääkaupunki vuonna 1812.")
print(predictions)

Annotation guidelines

In addition to the model and full paper, we publish annotation guidelines in JYX. For the guidelines, you may use the reference below:

APA:

Poso, V., Välisalo, T., Toivanen, I., Lipsanen, M., Kukkohovi, L., Kytöaho, R., Palander, S., Pohjola, M., Laitinen, V., Föhr, A., Abdelamir, A. & Niemi, J. (2025). NER annotation guidelines for archival data. University of Jyväskylä. URN: https://urn.fi/URN:NBN:fi:jyu-202501291584

BibTeX:

@misc{poso2025ner,
title={NER annotation guidelines for archival data},
author={Poso, Venla and Välisalo, Tanja and Toivanen, Ida and Lipsanen, Mikko and Kukkohovi, Laura and Kytöaho, Roosa and Palander, Satu and Pohjola, Maiju and Laitinen, Vesa and Föhr, Atte and Abdelamir, Amir and Niemi, Joonas},
journal={JYX Digital Repository},
year={2025},
publisher={University of Jyväskylä},
url={http://urn.fi/URN:NBN:fi:jyu-202501291584}
}

How to cite the model

APA:

Toivanen, I., Poso, V., Lipsanen, M., & Välisalo, T. (2025). Developing named-entity recognition for state authority archives. In O. Holownia, & E. S. Sigurðarson (Eds.), DHNB2024 Conference Post-Proceedings (7). University of Oslo Library. Digital Humanities in the Nordic and Baltic Countries Publications. https://doi.org/10.5617/dhnbpub.12262 

BibTeX:

@article{toivanen2025developing,
title={Developing named-entity recognition for state authority archives},
author={Toivanen, Ida and Poso, Venla and Lipsanen, Mikko and Välisalo, Tanja},
journal={Digital Humanities in the Nordic and Baltic Countries Publications},
number={3},
year={2025},
publisher={University of Oslo Library},
DOI={https://doi.org/10.5617/dhnbpub.12262}
}
Downloads last month
1
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support