invoice-processor-ml / src /pipeline.py

Commit History

perf: fully async DB - results in 5-7s, background check+save
8f86a3c

GSoumyajit2005 commited on

perf: async DB save with duplicate check for faster extraction
4bdd01c

GSoumyajit2005 commited on

feat: PDF preview, database integration, and improved error handling
2a944a5

GSoumyajit2005 commited on

Refactor: Replace Tesseract with DocTR and integrate LayoutLMv3-DocTR model
ec0b507

GSoumyajit2005 commited on

feat: add AI detection overlay visualization with bounding boxes on extracted entities
343b0c3

GSoumyajit2005 commited on

feat: Update Dockerfile and requirements for PDF processing, add new dependencies, and refactor API structure
faa3050

GSoumyajit2005 commited on

feat: Enhance pipeline with smart PDF handling, Pydantic validation, and semantic hashing, and refactor API to src.
f74e17e

GSoumyajit2005 commited on

feat: Add Phase 3 generalization scripts and clean up legacy files
d79b7f7

GSoumyajit2005 commited on

feat: LayoutLMv3 integration, Streamlit UI toggle, README refresh, .gitignore
42e1c04

GSoumyajit2005 commited on

Complete Version 0.5 with Streamlit UI and full pipeline
566dc81

GSoumyajit2005 commited on