Maxluria Claude Opus 4.8 commited on
Commit
52f8645
·
1 Parent(s): 7b3f3c2

Revert active model to Qwen3-8B (MiniCPM incompatible with this env)

Browse files

MiniCPM4.1-8B's bundled code targets transformers 4.56, but Gradio 6.18 forces
huggingface-hub>=1.0 and thus transformers 5.x, which refactored the internals
that 4.56-era remote code relies on (the is_torch_fx_available shim only covers
the first of many breakages). Qwen3-8B is natively supported in transformers 5.x
with no remote code, so it loads cleanly in this Space. Shim kept harmlessly.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

Files changed (1) hide show
  1. app.py +9 -5
app.py CHANGED
@@ -43,13 +43,17 @@ from shards import SHARD_ORDER, SHARDS
43
 
44
  # ---------------------------------------------------------------------------- #
45
  # Model — keep the id in one easy-to-swap constant.
46
- # Active: openbmb/MiniCPM4.1-8B (<=32B; qualifies for the OpenBMB sponsor prize)
47
- # Swap-in: Qwen/Qwen3-8B (strong dialogue, ZeroGPU-friendly, LoRA-ready)
48
  # Step up: Qwen/Qwen3-14B (if replies feel thin and compute allows)
49
- # Both MiniCPM4.1 and Qwen3 are hybrid reasoning models that honor
50
- # apply_chat_template(enable_thinking=False) for snappy, non-thinking replies.
 
 
 
 
51
  # ---------------------------------------------------------------------------- #
52
- MODEL_ID = "openbmb/MiniCPM4.1-8B"
53
 
54
  MODEL = None
55
  TOKENIZER = None
 
43
 
44
  # ---------------------------------------------------------------------------- #
45
  # Model — keep the id in one easy-to-swap constant.
46
+ # Active: Qwen/Qwen3-8B (natively supported by transformers 5.x; no
47
+ # remote code; works with this Gradio 6.18 env)
48
  # Step up: Qwen/Qwen3-14B (if replies feel thin and compute allows)
49
+ # Note: openbmb/MiniCPM4.1-8B (the OpenBMB sponsor-prize model) is NOT usable
50
+ # here its bundled modeling code targets transformers 4.56, but Gradio 6.18
51
+ # forces huggingface-hub>=1.0 and therefore transformers 5.x, which refactored
52
+ # the internals that 4.56-era remote code depends on. Running it would require
53
+ # downgrading Gradio. The is_torch_fx_available shim above is kept harmlessly in
54
+ # case a future, 5.x-compatible MiniCPM revision is published.
55
  # ---------------------------------------------------------------------------- #
56
+ MODEL_ID = "Qwen/Qwen3-8B"
57
 
58
  MODEL = None
59
  TOKENIZER = None