Hermes Bot commited on
Commit
780d3c3
·
unverified ·
1 Parent(s): 46f4d3a

fix: revert default model to Qwen/Qwen2.5-1.5B-Instruct

Browse files

Qwen 2.5 7B requires paid Inference Providers on this account.
Revert to 1.5B free-tier model; users can override via config modal.

Files changed (1) hide show
  1. shared/inference_client.py +1 -1
shared/inference_client.py CHANGED
@@ -31,7 +31,7 @@ log = logging.getLogger("inference")
31
  # The HF model id used for text generation (VibeThinker 1.5B, Gemma 4 12B, etc.)
32
  INFERENCE_MODEL = os.environ.get(
33
  "INFERENCE_MODEL",
34
- "Qwen/Qwen2.5-7B-Instruct", # 7B, strong storytelling, HF Inference compatible
35
  )
36
 
37
  # Provider: "hf-inference" (free serverless), "together", "fal-ai", "replicate"
 
31
  # The HF model id used for text generation (VibeThinker 1.5B, Gemma 4 12B, etc.)
32
  INFERENCE_MODEL = os.environ.get(
33
  "INFERENCE_MODEL",
34
+ "Qwen/Qwen2.5-1.5B-Instruct", # 1.5B, fast, free-tier friendly
35
  )
36
 
37
  # Provider: "hf-inference" (free serverless), "together", "fal-ai", "replicate"