| | --- |
| | license: apache-2.0 |
| | datasets: |
| | - mychen76/invoices-and-receipts_ocr_v1 |
| | language: |
| | - en |
| | --- |
| | |
| |
|
| | # Intro |
| |
|
| | The model* is fine-tuned on Qwen2.5-3B-VL using a dataset of invoices and receipts. It can be used to extract text from the input and return the output in a specified JSon format. |
| |
|
| | *It is already merged with the LoRA layer and the original model. Be mindful of the input size to avoid a CUDA out-of-memory error. |
| | |
| | Here is an example notebook of [inference](infer/Inference.ipynb) |
| | |
| | For the LoRA params only, go to [this repo](https://huggingface.co/callmeeric5/Qwen-3B-Invoice-Receipt-LoRa/tree/main) |
| | |
| | |
| | # Usage: |
| | ```python |
| | from transformers import AutoModelForVision2Seq, AutoProcessor, AutoTokenizer |
| | |
| | model = AutoModelForVision2Seq.from_pretrained( |
| | "callmeeric5/Qwen3B-Invoice-Receipt", |
| | device_map="cuda", #auto |
| | torch_dtype="auto" |
| | ) |
| | tokenizer = AutoTokenizer.from_pretrained("callmeeric5/Qwen-3B-Invoice-Receipt") |
| | processor = AutoProcessor.from_pretrained("callmeeric5/Qwen-3B-Invoice-Receipt") |
| | ``` |