docs: update README — 4-tier routing, Groq LLM, integrated Prometheus, correct cost saving 96% 704951f ninditya commited on 17 days ago
fix: add missing httpx import to inference.py (Grafana push NameError) 149688b ninditya commited on 17 days ago
fix: add conversational_handler to frontend routing config and badge CSS 9e1c7f1 ninditya commited on 17 days ago
fix: add REGISTRY to prometheus_client imports (Grafana push NameError) 2c066b2 ninditya commited on 17 days ago
feat: add conversational pre-filter (tier 0) for greetings and farewells 844f7f7 ninditya commited on 17 days ago
fix: skip intent context hint for very low confidence LLM calls c257aaa ninditya commited on 17 days ago
feat: add Groq provider to LLM client (free, no billing required) 2f33b1d ninditya commited on 17 days ago
fix: replace prometheus-remote-write (requires py3.11) with manual protobuf+snappy bb8fb46 ninditya commited on 17 days ago
feat: add /metrics endpoint for production monitoring on HF Spaces 471fac2 ninditya commited on 17 days ago
fix: correct Banking77 intent names mapping and add RAG for template_handler 26727cc ninditya commited on 17 days ago