Hybrid RAG: BM25+Dense (sqlite-vec/BGE-M3) + cross-encoder reranker (bge-reranker-v2-m3) ec0742c verified scvcoder commited on 30 days ago
Hybrid RAG: BM25+Dense (sqlite-vec/BGE-M3) + cross-encoder reranker (bge-reranker-v2-m3) ce557af verified scvcoder commited on 30 days ago
Hybrid RAG: BM25+Dense (sqlite-vec/BGE-M3) + cross-encoder reranker (bge-reranker-v2-m3) 49b0a74 verified scvcoder commited on 30 days ago
Hybrid RAG: BM25+Dense (sqlite-vec/BGE-M3) + cross-encoder reranker (bge-reranker-v2-m3) f64a4c2 verified scvcoder commited on 30 days ago
Hybrid RAG: BM25+Dense (sqlite-vec/BGE-M3) + cross-encoder reranker (bge-reranker-v2-m3) 0e6bb26 verified scvcoder commited on 30 days ago
Hybrid RAG: BM25+Dense (sqlite-vec/BGE-M3) + cross-encoder reranker (bge-reranker-v2-m3) b3650c2 verified scvcoder commited on 30 days ago
Hybrid RAG: BM25+Dense (sqlite-vec/BGE-M3) + cross-encoder reranker (bge-reranker-v2-m3) cd6cb3c verified scvcoder commited on 30 days ago
Hybrid RAG: BM25+Dense (sqlite-vec/BGE-M3) + cross-encoder reranker (bge-reranker-v2-m3) ca5c4c2 verified scvcoder commited on 30 days ago
Retriever: RRF로 키워드+원본 질문 결합 — LLM이 핵심 주제어 누락(e.g. '처방전 보관기간' → ['보관기간']) 해도 원본 질문이 안전망. cases·guides 동일 적용 8863f87 verified scvcoder commited on 30 days ago
LLM router prompt: 핵심 주제어 보존 규칙 추가 — '처방전 보관기간' 같은 짧은 질문에서 주제어(처방전) 누락 시 CCTV 등 다른 도메인이 hit되는 문제 수정 ca27852 verified scvcoder commited on 30 days ago
feat: switch to Unsloth UD Q2/Q3/Q4 lineup; drop Qwen presets b8a2767 verified scvcoder commited on May 2
feat: drop bartowski Q4_K_M Gemma preset (duplicate of unsloth on ZeroGPU) e183c0c verified scvcoder commited on May 2
feat: swap bartowski Q4_0 → unsloth UD-Q4_K_XL, set as default d8d25fa verified scvcoder commited on May 2
ui: sort cards with explicit (근거N) chip to the top of references panel a6771b6 verified scvcoder commited on May 2
ui: simplify references panel — drop 'adopted' tier, keep 'LLM 전달' + (근거N) chip 18ad465 verified scvcoder commited on May 2
ui: drop model selector from split-view references panel (redundant w/ Open WebUI dropdown) 3630faa verified scvcoder commited on May 2
feat: add Gemma 4 E2B Q4_0 preset (Metal-friendly 4-bit variant) 2b7fc6c verified scvcoder commited on May 2
Listen for kpaa-route postMessage from Open WebUI iframe -> auto clear refs f0077b3 verified scvcoder commited on May 2
Auto-clear refs on new chat (no history + non-meta first user message) ffc58e0 verified scvcoder commited on May 2
Use generator pattern (cross-process yield via spaces res_queue) — ZeroGPU forks process so streamer queue isn't shared 4f8a16e verified scvcoder commited on May 2
Force streamer.end() in finally — transformers 5.x doesn't always auto-call it f160777 verified scvcoder commited on May 2
Initial backend code: src/kpaa, runtime data, requirements 94f1300 verified scvcoder commited on May 2