Spaces:
Sleeping
Sleeping
Hermes Bot commited on
fix: revert default model to Qwen/Qwen2.5-1.5B-Instruct
Browse filesQwen 2.5 7B requires paid Inference Providers on this account.
Revert to 1.5B free-tier model; users can override via config modal.
shared/inference_client.py
CHANGED
|
@@ -31,7 +31,7 @@ log = logging.getLogger("inference")
|
|
| 31 |
# The HF model id used for text generation (VibeThinker 1.5B, Gemma 4 12B, etc.)
|
| 32 |
INFERENCE_MODEL = os.environ.get(
|
| 33 |
"INFERENCE_MODEL",
|
| 34 |
-
"Qwen/Qwen2.5-
|
| 35 |
)
|
| 36 |
|
| 37 |
# Provider: "hf-inference" (free serverless), "together", "fal-ai", "replicate"
|
|
|
|
| 31 |
# The HF model id used for text generation (VibeThinker 1.5B, Gemma 4 12B, etc.)
|
| 32 |
INFERENCE_MODEL = os.environ.get(
|
| 33 |
"INFERENCE_MODEL",
|
| 34 |
+
"Qwen/Qwen2.5-1.5B-Instruct", # 1.5B, fast, free-tier friendly
|
| 35 |
)
|
| 36 |
|
| 37 |
# Provider: "hf-inference" (free serverless), "together", "fal-ai", "replicate"
|