Commit History

perf: fully async DB - results in 5-7s, background check+save
8f86a3c

GSoumyajit2005 commited on

perf: async DB save with duplicate check for faster extraction
4bdd01c

GSoumyajit2005 commited on

feat: PDF preview, database integration, and improved error handling
2a944a5

GSoumyajit2005 commited on

refactor: remove obsolete OCR test file and enhance address extraction logic
097a95c

GSoumyajit2005 commited on

Refactor: Replace Tesseract with DocTR and integrate LayoutLMv3-DocTR model
ec0b507

GSoumyajit2005 commited on

feat: add AI detection overlay visualization with bounding boxes on extracted entities
343b0c3

GSoumyajit2005 commited on

chore: Add scaffolded database layer (not yet implemented)
e8efc4f

GSoumyajit2005 commited on

Restore full history with LFS images and all fixes
7630bcd

GSoumyajit2005 commited on

feat: Update Dockerfile and requirements for PDF processing, add new dependencies, and refactor API structure
faa3050

GSoumyajit2005 commited on

feat: Enhance pipeline with smart PDF handling, Pydantic validation, and semantic hashing, and refactor API to src.
f74e17e

GSoumyajit2005 commited on

refactor: Reorganize project structure
4768ab6

GSoumyajit2005 commited on

feat: Add .dockerignore, enhance UI to display receipt number and robustly handle bill-to, and update README with an additional dataset.
aa4f954

GSoumyajit2005 commited on

Updated Extraction Logic for more Robustness
b99270c

GSoumyajit2005 commited on

feat: Implement robust OCR, and cross-platform support
5d04abb

GSoumyajit2005 commited on

feat: Add Phase 3 generalization scripts and clean up legacy files
d79b7f7

GSoumyajit2005 commited on

feat: LayoutLMv3 integration, Streamlit UI toggle, README refresh, .gitignore
42e1c04

GSoumyajit2005 commited on

Complete Version 0.5 with Streamlit UI and full pipeline
566dc81

GSoumyajit2005 commited on