feat: drop bartowski Q4_K_M Gemma preset (duplicate of unsloth on ZeroGPU) e183c0c verified scvcoder commited on May 2
feat: swap bartowski Q4_0 → unsloth UD-Q4_K_XL, set as default d8d25fa verified scvcoder commited on May 2
ui: sort cards with explicit (근거N) chip to the top of references panel a6771b6 verified scvcoder commited on May 2
ui: simplify references panel — drop 'adopted' tier, keep 'LLM 전달' + (근거N) chip 18ad465 verified scvcoder commited on May 2
ui: drop model selector from split-view references panel (redundant w/ Open WebUI dropdown) 3630faa verified scvcoder commited on May 2
feat: add Gemma 4 E2B Q4_0 preset (Metal-friendly 4-bit variant) 2b7fc6c verified scvcoder commited on May 2
Listen for kpaa-route postMessage from Open WebUI iframe -> auto clear refs f0077b3 verified scvcoder commited on May 2
Auto-clear _last_refs on / and /split page load (page reload starts fresh) 0b5cf16 verified scvcoder commited on May 2
Auto-clear refs on new chat (no history + non-meta first user message) ffc58e0 verified scvcoder commited on May 2
Serve split HTML directly at / (no redirect) — overrides Gradio's / 5c07da5 verified scvcoder commited on May 2
Redirect / -> /split (insert route at index 0 to override Gradio's /) 3264ba1 verified scvcoder commited on May 2
Add /split route — Open WebUI iframe + 참고자료 polling layout 00d343f verified scvcoder commited on May 2
Use generator pattern (cross-process yield via spaces res_queue) — ZeroGPU forks process so streamer queue isn't shared 4f8a16e verified scvcoder commited on May 2
Force streamer.end() in finally — transformers 5.x doesn't always auto-call it f160777 verified scvcoder commited on May 2
Hybrid: demo.launch() (ZeroGPU works) + post-launch attach KPAA /v1 routes to demo.app 617c061 verified scvcoder commited on May 2
MINIMAL TEST: Gradio + @spaces.GPU + demo.launch (HF ZeroGPU canonical pattern) 1a36a19 verified scvcoder commited on May 2
Fix diag import path (spaces.config not spaces.zero.config); SPACES_ZERO_GPU=1 confirmed f22d865 verified scvcoder commited on May 2
Diagnostic: print SPACES_ZERO_GPU env + canary marking status 25e54de verified scvcoder commited on May 2
Use mount_gradio_app(ssr_mode=False) — explicit param overrides HF env var to skip Node port bind f96e574 verified scvcoder commited on May 2
Manual Gradio sub-app mount (skip startup hook that binds 7861) 6d12eeb verified scvcoder commited on May 2
Re-add Gradio Blocks + mount, with GRADIO_SSR_MODE=False to avoid port conflict c6b8960 verified scvcoder commited on May 2
Drop Gradio mount entirely; pure FastAPI on uvicorn (avoids SSR port conflict) 264396f verified scvcoder commited on May 2
Hide demo inside main() to avoid HF auto-launch port conflict cce074d verified scvcoder commited on May 2
Initial backend code: src/kpaa, runtime data, requirements 94f1300 verified scvcoder commited on May 2