⚠️Warning

Donut German Receipts

Fine-tuned donut model for german receipts.

Model Details

  • Base model: naver-clova-ix/donut-base
  • Dataset:
    • german receipts based on small private dataset(473 images)
  • Training:
    • training based on 435 images
    • test validation with 38 images
      • accurcay on supermarked-receipts is well (~70% exact string comparison)
      • not so many unkown stores in dataset, results differ in quality
    • Image size used: 960x1280
    • Epochs: 3
    • LR: 2e-5
    • Batch-Size: 2

Input Example

bon144

Output Schema

{
  "store": "C & A Geretsried",
  "date": "26.07.2025",
  "address": "Karl-Lederer-Platz 15",
  "total": "39,99"
}

Framework versions

  • Transformers 4.27.4
  • Pytorch 2.7.1
  • Datasets 4.0.0
  • Tokenizers 0.13.3
Downloads last month
321
Safetensors
Model size
0.2B params
Tensor type
I64
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Philipp92/donut-base-german-receipts

Finetuned
(477)
this model