Commit History
feat: switch architecture.svg and upload pdfi-uml.svg 787f64f verified
Delete assets/architecture.svg 52a4815 verified
Upload pdfi-uml.svg f98b6b9 verified
Update README.md 1d83dbb verified
Update README.md d631fff verified
fix(ui): correct invalid parameter page_favicon to page_icon in app.py 5bfb507 verified
chore(assets): remove deprecated PNG favicon ebb9472 verified
fix(ui): change page_icon reference from svg_path to favicon_path a561115 verified
docs: add static architecture diagram (architekture.svg) for fallback rendering 35534da verified
docs: replace failing mermaid block in README with html image tag f5d900e verified
docs: README.md aktualisiert, Mermaid-Code durch stabiles HTML-Bild-Tag ersetzt 3489bdc verified
docs: add static architecture diagram (architecture.svg) for fallback rendering 67cdb9f verified
refactor(ui): update page_config to reference new favicon-pdfi.svg file d27b4cb verified
replace blurry PNG favicon with crisp SVG vector graphic for optimal scaling 8ba2248 verified
docs(readme): add mermaid architecture graph, local setup instructions and UI feature list c8d224d verified
refactor/app.py) update model name from llama 3.2 3B instruct to llama 3.1 8B 81a899a verified
style: add custom SVG branding and page configuration to Streamlit UI 79e03f7 verified
refactor: clean up LlamaCpp initialization parameters to meet Python syntax standards 1f76485 verified
fix(rag_core): cast n_ctx to int to resolve pydantic validation error 3a5a9db verified
perf(rag_core): scale context window to 16k and optimize retrieval depth (k=25) for high-fidelity summarization ced20e5 verified
feat(rag_core): add page-level metadata and fix truncated responses 4163ff8 verified
Update src/rag_core.py 6a172c2 verified
fix(rag_core): expand context window to 8k to accommodate retrieval depth 8dab6db verified
chore(rag_core): upgrade model to Llama 3.1 8B for improved analytical precision 1dbc42d verified
perf(rag_core): increase retriever k and context window for better PDF analysis 3910884 verified
fix(ui): resolve NameError by removing st_callback and implementing st.write_stream fa0e96c verified
fix(rag_core): add missing PromtTemplate import e91650a verified
fix(ui): remove deprecated StreamlitCallbackHandler and use str.write_stream 12168c5 verified
fix(rag_core): apply native Llama 3 prompt formatting and add stop tokens to prevent hallucinations 1cb5775 verified
fix(docker): replace CPU wheel with official CUDA wheel for GPU inference 3cbdf5b verified
fix: restore previous requirements state 12b2195 verified
ix(docker): switch to official CUDA wheel for llama-cpp-python 2bc7531 verified
feat(rag_core): enable full CUDA GPU offloading for LLM inference 8ae13dd verified
fix: limit CPU threads to 6 to prevent UI starvation b4fd72c verified
feat: implement live streaming callback and update UI for Llama 3.2 0d73716 verified
feat: switch model to Llama 3.2 and establish streaming foundation 7c3e817 verified
fix(docker): install prebuilt llama-cpp-python wheel directly to prevent build timeout 5dc774a verified
fix: relocate llama-cpp-python to Dockerfile to prevent build timeout 2333e4f verified
feat: re-add compilers to natively build llama-cpp-python from source f2c9b64
Lars Patzenbein commited on
chore: optimize Dockerfile comments and configure persistent HF cache 742a1b1
Lars Patzenbein commited on
fix: Implement pre-built manylinux wheels f9c0e47
Lars Patzenbein commited on
feat: Switch local LLM to Phi-3 Mini for better resource management b5a9ca5
Lars Patzenbein commited on
fix: Correct variable names for model download 3f54c7f
Lars Patzenbein commited on
feat: Downgrade to Llama 3.2 3B for stability 7558898
Lars Patzenbein commited on
Fix: Force compile from source with --no-binary for llama-cpp-python dac0332
Lars Patzenbein commited on
Fix cache bug: Introduce cache buster to force fresh cmake build 1d273a6
Lars Patzenbein commited on
Fix libc.musl error: Build llama-cpp natively with cmake for Debian 46459b1
Lars Patzenbein commited on
Fix: Force pre-compiled binary for llama-cpp-python 2ab41a9
Lars Patzenbein commited on
Fix: Downgrade Base-Image to Python 3.10 in Dockerfile 5b960e0
Lars Patzenbein commited on