Sharka
/

CIVQA_DVQA_LayoutLMv3

Document Question Answering

document question answering

Model card Files Files and versions

CIVQA_DVQA_LayoutLMv3 / README.md

Sharka's picture

Update README.md

5734bf8 verified about 2 years ago

|

history blame contribute delete

976 Bytes

	---
	license: mit
	language:
	- cs
	tags:
	- document question answering
	---

	# LayoutLMv3 Model Fine-tuned with CIVQA (Tesseract) dataset

	This is a fine-tuned version of the [LayoutLMv3 model](https://huggingface.co/microsoft/layoutlmv3-base), which was trained on Czech Invoice Visual Question Answering (CIVQA) dataset containing invoices in the Czech language as well as on the Data Visualizations via Question Answering ([DVQA] (https://paperswithcode.com/dataset/dvqa)) dataset.

	This model enables Document Visual Question Answering on Czech invoices with the use of the existing DVQA dataset.

	Regarding the Czech invoices, we focused on 10 different entities, which are crucial for processing the invoices.
	- Variable symbol
	- Specific symbol
	- Constant symbol
	- Bank code
	- Account number
	- Total amount
	- Invoice date
	- Name of supplier
	- DIC
	- QR code

	You can find more information about this model in this [paper](https://nlp.fi.muni.cz/raslan/raslan23.pdf#page=31).