Personal Experience

#13
by fluxnad - opened

Personal thought. PaddleOCR-VL does a great job on text recognition. What I noticed in complex tables is a cell detection issue. When the table relies on alignment and spacing instead of clear cell borders, the model sometimes merges cells or assigns values to the wrong column. In my example, subtotal rows like “S/Total” lose the correct column alignment, and at times a whole column region gets treated as one cell when the structure is not clearly labeled and it droped the values of 2020

this a part of a table
Screenshot 2026-02-15 at 2.04.28 PM

and this is the output of the ocr

Screenshot 2026-02-15 at 2.13.06 PM

heres another exemple
Screenshot 2026-02-15 at 2.38.04 PM

and heres the output
Screenshot 2026-02-15 at 2.37.29 PM

and another one:
Screenshot 2026-02-15 at 2.41.50 PM

output:
Screenshot 2026-02-15 at 2.40.54 PM

by the way i set it the use_doc_unwarping=False to refer to the original image

Sign up or log in to comment