Speed up: eager attn + KV cache; drop chat retries to 1; remove MiniCPM-o voice UI artifact 3326f32 verified unity4ar commited on 17 days ago
Log + surface RuntimeErrors from witness chat too (still 503 for those) a5ce744 verified unity4ar commited on 17 days ago
Surface witness chat failures with traceback + error class in 500 detail 5b4e454 verified unity4ar commited on 17 days ago
Surface zerogpu backend load_error / load detail in setup status 47c194f verified unity4ar commited on 17 days ago
Eagerly import zerogpu_backend on Spaces so @spaces.GPU is registered before startup scan 50761af verified unity4ar commited on 17 days ago
Force demo.launch to bind 0.0.0.0:$PORT on HF Spaces (CLI hot-reload ignores GRADIO_SERVER_NAME) 4f670ea verified unity4ar commited on 17 days ago
Expose `demo` at module scope so Gradio SDK runner can launch the gr.Server app 5cb944e verified unity4ar commited on 17 days ago
Gate setup/llama subprocess paths behind provider check; allow zerogpu_transformers 50a467b verified unity4ar commited on 17 days ago