Kansallisarkisto
/

multicentury-htr-model

@@ -6,6 +6,8 @@ language:
 metrics:
 - cer
 pipeline_tag: image-to-text
 ---
 # Model description
@@ -21,7 +23,7 @@ pipeline_tag: image-to-text
 **License:** Apache 2.0
-This model is a fine-tuned version of the microsoft/trocr-large-handwritten model, specialized for recognizing handwritten text. It has been trained on various dataset from 17th to 20th centuries and can be used for applications such as document digitization, form recognition, or any task involving handwritten text extraction.
 # Model Architecture
@@ -39,15 +41,15 @@ This model is designed for handwritten text recognition and is intended for use
 # Training data
-The training datasetincludes more than 760 000 samples of handwritten text rows, covering a wide variety of handwriting styles and text samples.
 # Evaluation
 The model was evaluated on test dataset. Below are key metrics:
-**Character Error Rate (CER):** 3.2
-**Test Dataset Description:** size ~94 900 text rows
 # Used Hyperparameters
@@ -55,11 +57,9 @@ The model was evaluated on test dataset. Below are key metrics:
 **Train batch size per device:** 16
-**Learning rate:** 1e-5
-**Scheduler:** linear
-**Warmup steps:** 500
 **Optimizer:** AdamW
@@ -69,6 +69,8 @@ The model was evaluated on test dataset. Below are key metrics:
 **Half precision backend:** cuda_amp
 # How to Use the Model
@@ -110,13 +112,13 @@ Potential improvements for this model include:
 If you use this model in your work, please cite it as:
-@misc{multicentury_htr_model_2024,
   author = {Kansallisarkisto},
   title = {Multicentury HTR Model: Handwritten Text Recognition},
-  year = {2024},
   publisher = {Hugging Face},
@@ -127,4 +129,4 @@ If you use this model in your work, please cite it as:
 ## Model Card Authors
 Author: Kansallisarkisto
-Contact Information: riikka.marttila@kansallisarkisto.fi, ilkka.jokipii@kansallisarkisto.fi

 metrics:
 - cer
 pipeline_tag: image-to-text
+base_model:
+- microsoft/trocr-large-handwritten
 ---
 # Model description
 **License:** Apache 2.0
+This model is a fine-tuned version of the microsoft/trocr-large-handwritten model, specialized for recognizing handwritten text. It has been trained on various dataset from 16th to 20th centuries and can be used for applications such as document digitization, form recognition, or any task involving handwritten text extraction.
 # Model Architecture
 # Training data
+The training dataset includes more than 913 000 samples of handwritten and typewritten text rows, covering a wide variety of handwriting styles and text samples.
 # Evaluation
 The model was evaluated on test dataset. Below are key metrics:
+**Character Error Rate (CER):** 2.8
+**Test Dataset Description:** size ~111 800 text rows
 # Used Hyperparameters
 **Train batch size per device:** 16
+**Learning rate:** 12.2e-5
+**Scheduler:** polynomial
 **Optimizer:** AdamW
 **Half precision backend:** cuda_amp
+**Input image size:** 192 x 1024
 # How to Use the Model
 If you use this model in your work, please cite it as:
+@misc{multicentury_htr_model_202509,
   author = {Kansallisarkisto},
   title = {Multicentury HTR Model: Handwritten Text Recognition},
+  year = {2025},
   publisher = {Hugging Face},
 ## Model Card Authors
 Author: Kansallisarkisto
+Contact Information: mikko.lipsanen@kansallisarkisto.fi, ilkka.jokipii@kansallisarkisto.fi