File size: 1,785 Bytes
5dc582a
d3d3916
 
 
5dc582a
d3d3916
5dc582a
d3d3916
 
 
5dc582a
d3d3916
 
 
5dc582a
 
d3d3916
5dc582a
d3d3916
5dc582a
d3d3916
5dc582a
d3d3916
 
 
 
 
 
5dc582a
d3d3916
5dc582a
d3d3916
 
 
 
 
 
 
 
 
 
5dc582a
d3d3916
5dc582a
d3d3916
 
 
 
 
 
 
 
5dc582a
d3d3916
5dc582a
 
 
d3d3916
5dc582a
 
 
d3d3916
5dc582a
d3d3916
5dc582a
d3d3916
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
---
language:
- es
license: apache-2.0
library_name: peft
base_model: Qwen/Qwen3-VL-4B-Instruct
tags:
- invoice-extraction
- ocr
- spanish
- lora
- vision
- finance
pipeline_tag: image-to-text
---

# diffu-0.2 — Spanish Invoice Data Extractor (Vision)

**diffu-0.2** is a fine-tuned vision-language model for structured data extraction from Spanish invoice images. Built by [V10 Labs](https://v10labs.com), it extracts supplier details, tax IDs, amounts, and dates from invoice photographs and scans.

## Performance

| Model | Accuracy | Type |
|-------|----------|------|
| **diffu-0.2 (this model)** | **93.39%** | Fine-tuned, vision |
| diffu-0.1 (V10 Labs) | 92.82% | Fine-tuned, text-only |
| Claude Sonnet 4.6 | 61.6% | Generalist, zero-shot |
| Qwen3-VL-4B (base) | 54.4% | Generalist, zero-shot |

### Per-Field Accuracy

| Field | Accuracy |
|-------|----------|
| supplier | 92.06% |
| supplier_cif | 94.12% |
| invoice_number | 91.35% |
| date | 95.33% |
| subtotal | 92.06% |
| tax_total | 89.25% |
| total | 92.99% |
| doc_type | 100.00% |

## Model Details

- **Base model**: Qwen/Qwen3-VL-4B-Instruct
- **Method**: LoRA (r=64, alpha=128)
- **Target modules**: q_proj, v_proj, k_proj, o_proj, gate_proj, up_proj, down_proj
- **Training**: 2 epochs, LR=1e-4, effective batch size 16
- **Image resolution**: 256-1280 × 28 × 28 pixels
- **Adapter size**: 504 MB
- **Peak VRAM**: 22.57 GB (training), ~10 GB (inference)
- **Parse failures**: 0%

## Output Format



## Usage



## About V10 Labs

V10 Labs builds AI-powered financial intelligence for SMBs in Spain. We train purpose-built models that outperform general-purpose LLMs on domain-specific tasks like invoice processing, accounting classification, and financial analysis.

[v10labs.com](https://v10labs.com)