debug: improve vllm-direct endpoint to test context overflow daf3545 verified msradam commited on 2 days ago
fix(warmup): fire vLLM warmup before planner so RunPod loads during planner+stones 602bc83 verified msradam commited on 2 days ago
fix(sse): send keepalive comments to prevent proxy idle timeout f9c55e4 verified msradam commited on 3 days ago