perf: fully async DB - results in 5-7s, background check+save 8f86a3c GSoumyajit2005 commited on 10 days ago
perf: async DB save with duplicate check for faster extraction 4bdd01c GSoumyajit2005 commited on 10 days ago
feat: PDF preview, database integration, and improved error handling 2a944a5 GSoumyajit2005 commited on 10 days ago
refactor: remove obsolete OCR test file and enhance address extraction logic 097a95c GSoumyajit2005 commited on 11 days ago
Refactor: Replace Tesseract with DocTR and integrate LayoutLMv3-DocTR model ec0b507 GSoumyajit2005 commited on 12 days ago
feat: add AI detection overlay visualization with bounding boxes on extracted entities 343b0c3 GSoumyajit2005 commited on 13 days ago
chore: Add scaffolded database layer (not yet implemented) e8efc4f GSoumyajit2005 commited on 15 days ago
feat: Update Dockerfile and requirements for PDF processing, add new dependencies, and refactor API structure faa3050 GSoumyajit2005 commited on 15 days ago
feat: Enhance pipeline with smart PDF handling, Pydantic validation, and semantic hashing, and refactor API to src. f74e17e GSoumyajit2005 commited on Dec 20, 2025
feat: Add .dockerignore, enhance UI to display receipt number and robustly handle bill-to, and update README with an additional dataset. aa4f954 GSoumyajit2005 commited on Dec 13, 2025
feat: Implement robust OCR, and cross-platform support 5d04abb GSoumyajit2005 commited on Dec 1, 2025
feat: Add Phase 3 generalization scripts and clean up legacy files d79b7f7 GSoumyajit2005 commited on Dec 1, 2025
feat: LayoutLMv3 integration, Streamlit UI toggle, README refresh, .gitignore 42e1c04 GSoumyajit2005 commited on Nov 5, 2025
Complete Version 0.5 with Streamlit UI and full pipeline 566dc81 GSoumyajit2005 commited on Nov 2, 2025