callmeeric5
/

Qwen3B-Invoice-Receipt

Model card Files Files and versions

Qwen3B-Invoice-Receipt / README.md

callmeeric5's picture

Update README.md

246f852 verified 8 months ago

|

history blame contribute delete

1.02 kB

	---
	license: apache-2.0
	datasets:
	- mychen76/invoices-and-receipts_ocr_v1
	language:
	- en
	---


	# Intro

	The model* is fine-tuned on Qwen2.5-3B-VL using a dataset of invoices and receipts. It can be used to extract text from the input and return the output in a specified JSon format.

	*It is already merged with the LoRA layer and the original model. Be mindful of the input size to avoid a CUDA out-of-memory error.

	Here is an example notebook of [inference](infer/Inference.ipynb)

	For the LoRA params only, go to [this repo](https://huggingface.co/callmeeric5/Qwen-3B-Invoice-Receipt-LoRa/tree/main)


	# Usage:
	```python
	from transformers import AutoModelForVision2Seq, AutoProcessor, AutoTokenizer

	model = AutoModelForVision2Seq.from_pretrained(
	"callmeeric5/Qwen3B-Invoice-Receipt",
	device_map="cuda", #auto
	torch_dtype="auto"
	)
	tokenizer = AutoTokenizer.from_pretrained("callmeeric5/Qwen-3B-Invoice-Receipt")
	processor = AutoProcessor.from_pretrained("callmeeric5/Qwen-3B-Invoice-Receipt")
	```