Spaces:
Running on Zero
Running on Zero
Revert active model to Qwen3-8B (MiniCPM incompatible with this env)
Browse filesMiniCPM4.1-8B's bundled code targets transformers 4.56, but Gradio 6.18 forces
huggingface-hub>=1.0 and thus transformers 5.x, which refactored the internals
that 4.56-era remote code relies on (the is_torch_fx_available shim only covers
the first of many breakages). Qwen3-8B is natively supported in transformers 5.x
with no remote code, so it loads cleanly in this Space. Shim kept harmlessly.
Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>
app.py
CHANGED
|
@@ -43,13 +43,17 @@ from shards import SHARD_ORDER, SHARDS
|
|
| 43 |
|
| 44 |
# ---------------------------------------------------------------------------- #
|
| 45 |
# Model — keep the id in one easy-to-swap constant.
|
| 46 |
-
# Active:
|
| 47 |
-
#
|
| 48 |
# Step up: Qwen/Qwen3-14B (if replies feel thin and compute allows)
|
| 49 |
-
#
|
| 50 |
-
#
|
|
|
|
|
|
|
|
|
|
|
|
|
| 51 |
# ---------------------------------------------------------------------------- #
|
| 52 |
-
MODEL_ID = "
|
| 53 |
|
| 54 |
MODEL = None
|
| 55 |
TOKENIZER = None
|
|
|
|
| 43 |
|
| 44 |
# ---------------------------------------------------------------------------- #
|
| 45 |
# Model — keep the id in one easy-to-swap constant.
|
| 46 |
+
# Active: Qwen/Qwen3-8B (natively supported by transformers 5.x; no
|
| 47 |
+
# remote code; works with this Gradio 6.18 env)
|
| 48 |
# Step up: Qwen/Qwen3-14B (if replies feel thin and compute allows)
|
| 49 |
+
# Note: openbmb/MiniCPM4.1-8B (the OpenBMB sponsor-prize model) is NOT usable
|
| 50 |
+
# here — its bundled modeling code targets transformers 4.56, but Gradio 6.18
|
| 51 |
+
# forces huggingface-hub>=1.0 and therefore transformers 5.x, which refactored
|
| 52 |
+
# the internals that 4.56-era remote code depends on. Running it would require
|
| 53 |
+
# downgrading Gradio. The is_torch_fx_available shim above is kept harmlessly in
|
| 54 |
+
# case a future, 5.x-compatible MiniCPM revision is published.
|
| 55 |
# ---------------------------------------------------------------------------- #
|
| 56 |
+
MODEL_ID = "Qwen/Qwen3-8B"
|
| 57 |
|
| 58 |
MODEL = None
|
| 59 |
TOKENIZER = None
|