Improve RAG: query expansion, confidence scoring, graceful 3-tier fallback, better prompting 865c43f verified Rofati commited on 26 days ago
Fix: use OpenAI client with HF router URL (no provider param needed, works with any version)" 602129d verified Rofati commited on May 27
Switch to HF Inference API: Qwen3-32B via InferenceClient (same architecture, faster)" 681d277 verified Rofati commited on May 27