Spaces:

RayMelius
/

StockEx

Sleeping

RayMelius Claude Sonnet 4.6 commited on Feb 27

Commit

0b7d7ac

1 Parent(s): ff275e2

Fix HF inference URL routing for org/model format models

The condition '/ in m.split(\"/\")[0]' was always False (split removes
the slash), so only RayMelius/ models used the direct inference API —
all other org/model names (Qwen/, meta-llama/, etc.) hit the router
and got 400 'not supported by any provider'.

Fix: use direct inference API for any model containing a slash.
Also: replace Qwen/Qwen2.5-7B-Instruct-1M with Qwen/Qwen2.5-7B-Instruct
in the HF model list (-1M variant not available on inference API).

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Files changed (1) hide show

dashboard/dashboard.py +3 -3

dashboard/dashboard.py CHANGED Viewed

@@ -46,7 +46,7 @@ GROQ_MODELS = [
 ]
 HF_MODELS = [
     "RayMelius/stockex-analyst",
-    "Qwen/Qwen2.5-7B-Instruct-1M",
     "meta-llama/Llama-3.1-8B-Instruct",
     "mistralai/Mistral-7B-Instruct-v0.3",
 ]
@@ -144,8 +144,8 @@ def _call_llm(prompt, force_provider=None, force_model=None):
         if not HF_TOKEN:
             return None, "HuggingFace not configured (HF_TOKEN not set)"
         m = model or HF_MODEL
-        # Use direct inference API for custom models, router for known public models
-        if m.startswith("RayMelius/") or "/" in m.split("/")[0]:
             url = f"https://api-inference.huggingface.co/models/{m}/v1/chat/completions"
         else:
             url = HF_URL

 ]
 HF_MODELS = [
     "RayMelius/stockex-analyst",
+    "Qwen/Qwen2.5-7B-Instruct",
     "meta-llama/Llama-3.1-8B-Instruct",
     "mistralai/Mistral-7B-Instruct-v0.3",
 ]
         if not HF_TOKEN:
             return None, "HuggingFace not configured (HF_TOKEN not set)"
         m = model or HF_MODEL
+        # Use direct inference API for any org/model format; router for bare model names
+        if "/" in m:
             url = f"https://api-inference.huggingface.co/models/{m}/v1/chat/completions"
         else:
             url = HF_URL