Spaces:

AshwinP
/

compounding-test

Sleeping

apingali Claude Opus 4.7 (1M context) commited on 10 days ago

Commit

61393d7

1 Parent(s): 8f09671

chore(compounding-test): hide HuggingFace API option from dropdowns

The HF Inference Providers backend requires the Space owner to have
HF billing set up (credit card on file OR custom per-provider API
keys). Without either, every call fails with the misleading
"model_not_supported" error — even for ungated, fully-enabled models.
That's HF UX, not our code: providers being "enabled" in the toggle
list just means available; actually USING them needs a billing path.

For our specific Space, the owner doesn't have HF billing configured
and doesn't want to (the Pro plan's included credits should cover it
in theory but the routing requires the billing setup regardless).
Two backends already work cleanly without billing setup:
- ZeroGPU (free; on the Space's GPU; quota-limited)
- Anthropic (visitor pastes their own API key)

Removed the HF option from both UIs to avoid opaque-failure friction:
- gradio-apps/compounding-test/app.py (Space's dropdown)
- src/components/CompoundingTestAI.tsx (site's dropdown)

What's preserved (intentionally):
- _call_huggingface and the "huggingface" key in PROVIDERS dict —
backend remains reachable via MODEL_PROVIDER env override for
users who do set up HF billing
- All 31 tests pass (the routing tests still cover the backend)
- _detect_provider precedence unchanged

What's updated:
- Intro markdown rewritten for 2 options instead of 3
- Dropdown comments document why the option is hidden so future
contributors don't accidentally re-enable it without addressing
the billing requirement

Verified: 31 pytest + Astro build clean.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Files changed (1) hide show

app.py +13 -9

app.py CHANGED Viewed

@@ -743,19 +743,22 @@ def build_demo():
     """Build and return the Gradio Blocks UI. Called only by __main__."""
     import gradio as gr
-    # Free options first, premium last. Plain-English labels with no
-    # ANTHROPIC_API_KEY / HF_TOKEN / SPACE_ID jargon — the casual user
     # shouldn't have to know what any of those mean.
     provider_choices = []
     if _zerogpu_available():
         provider_choices.append((
             f"Free · Phi-4-mini-instruct (Microsoft) — runs on GPU",
             "zerogpu",
         ))
-    provider_choices.append((
-            f"Free · Gemma 2 9B (Google) — runs via HuggingFace",
-            "huggingface",
-    ))
     provider_choices.append((
         f"Premium · Claude Opus 4.7 (Anthropic) — paste your API key below",
         "anthropic",
@@ -771,9 +774,10 @@ def build_demo():
             "Describe your AI initiative — get a scored writeup in one of "
             "four outcomes: **compounder**, **one-shot win**, **compounding "
             "the wrong thing**, or **Roman Candle**.\n\n"
-            "**The default model is free.** Pick **Premium · Claude Opus** "
-            "from the dropdown if you have an Anthropic API key and want "
-            "the highest-quality writeup. Read the full framework at "
             "[mile-hi.ai/journal/the-berkshire-test]("
             "https://www.mile-hi.ai/journal/the-berkshire-test)."
         )

     """Build and return the Gradio Blocks UI. Called only by __main__."""
     import gradio as gr
+    # Free option first, premium second. Plain-English labels with no
+    # ANTHROPIC_API_KEY / SPACE_ID / ZeroGPU jargon — the casual user
     # shouldn't have to know what any of those mean.
+    #
+    # The HuggingFace Inference Providers backend (provider="huggingface")
+    # is intentionally NOT in this dropdown: it requires the Space owner
+    # to have HF billing set up (credit card on file OR custom provider
+    # API keys), which most Pro users don't have by default. The backend
+    # code remains in PROVIDERS so it's reachable via MODEL_PROVIDER env
+    # override for users who do set up billing — see README.md.
     provider_choices = []
     if _zerogpu_available():
         provider_choices.append((
             f"Free · Phi-4-mini-instruct (Microsoft) — runs on GPU",
             "zerogpu",
         ))
     provider_choices.append((
         f"Premium · Claude Opus 4.7 (Anthropic) — paste your API key below",
         "anthropic",
             "Describe your AI initiative — get a scored writeup in one of "
             "four outcomes: **compounder**, **one-shot win**, **compounding "
             "the wrong thing**, or **Roman Candle**.\n\n"
+            "**The default is free** — runs an open model (Phi-4-mini) "
+            "on this Space's GPU. Pick **Premium · Claude Opus** from "
+            "the dropdown if you have an Anthropic API key and want the "
+            "highest-quality writeup. Read the full framework at "
             "[mile-hi.ai/journal/the-berkshire-test]("
             "https://www.mile-hi.ai/journal/the-berkshire-test)."
         )