Commit History
fix(rag_core): cast n_ctx to int to resolve pydantic validation error 3a5a9db verified
perf(rag_core): scale context window to 16k and optimize retrieval depth (k=25) for high-fidelity summarization ced20e5 verified
feat(rag_core): add page-level metadata and fix truncated responses 4163ff8 verified
Update src/rag_core.py 6a172c2 verified
fix(rag_core): expand context window to 8k to accommodate retrieval depth 8dab6db verified
chore(rag_core): upgrade model to Llama 3.1 8B for improved analytical precision 1dbc42d verified
perf(rag_core): increase retriever k and context window for better PDF analysis 3910884 verified
fix(rag_core): add missing PromtTemplate import e91650a verified
fix(rag_core): apply native Llama 3 prompt formatting and add stop tokens to prevent hallucinations 1cb5775 verified
feat(rag_core): enable full CUDA GPU offloading for LLM inference 8ae13dd verified
fix: limit CPU threads to 6 to prevent UI starvation b4fd72c verified
feat: switch model to Llama 3.2 and establish streaming foundation 7c3e817 verified
feat: Switch local LLM to Phi-3 Mini for better resource management b5a9ca5
Lars Patzenbein commited on
fix: Correct variable names for model download 3f54c7f
Lars Patzenbein commited on
feat: Downgrade to Llama 3.2 3B for stability 7558898
Lars Patzenbein commited on
Engine Swap: Llama 3.1 8B GGUF integrated 5000880
Lars Patzenbein commited on
Update src/rag_core.py Systempromt LLm answer section cf414dc verified
Fix: Groq Import korrigiert, Modell gewechelt 3af5aa1
Lars Patzenbein commited on
Fix: LangChain Import Pfade korrigiert aeb2af4
Lars Patzenbein commited on
feat: rag_core.py zum src-Ordner hinzugefügt b4c3ffe
Lars Patzenbein commited on