Spaces:

pollen-robotics
/

Reachy_Mini

Running

App Files Files Community

specify-compute-module

#11

by FabienDanieau - opened Mar 18

base: refs/heads/main

←

from: refs/pr/11

Discussion Files changed

+125

-2071

Files changed (12) hide show

.env.example +0 -35
.gitignore +0 -2
docs/APP_ICON_CONVENTION.md +0 -113
scripts/evaluate-prompt-v2.py +0 -445
server/categories.js +0 -189
server/categorize.js +0 -426
server/categoryCache.js +0 -290
server/index.js +2 -350
src/pages/Buy.jsx +6 -6
src/pages/Download.jsx +79 -122
src/pages/GettingStarted.jsx +36 -91
src/pages/Home.jsx +2 -2

.env.example DELETED Viewed

@@ -1,35 +0,0 @@
-# Reachy Mini Website server env vars
-#
-# Copy this file to `.env` and fill in the values for local dev.
-# In production (HF Space), set these from the Space's "Settings →
-# Variables and secrets" panel, NOT from a committed `.env`.
-# (`.env` is gitignored.)
-# -----------------------------------------------------------------------------
-# Server
-# -----------------------------------------------------------------------------
-# Port the Express server listens on. Defaults to 7860 (HF Space convention).
-# PORT=7860
-# -----------------------------------------------------------------------------
-# OAuth (used by /api/oauth-config and the in-iframe sign-in flow)
-# -----------------------------------------------------------------------------
-# Set in the Space when `hf_oauth: true` is in README.md.
-# OAUTH_CLIENT_ID=
-# OAUTH_SCOPES=openid profile
-# -----------------------------------------------------------------------------
-# HF Inference Providers (used by /api/js-apps category inference)
-# -----------------------------------------------------------------------------
-# Required for category inference. A standard READ token is enough -
-# Inference Providers access is on by default for FREE/PRO tokens.
-# Without this, /api/js-apps still works but every entry will have
-# `categories: null` (the route logs a warning at startup).
-HF_TOKEN=
-# Dataset where the inferred-categories cache is persisted.
-# Defaults to `tfrere/reachy-mini-app-categories` (per-user namespace,
-# auto-created on first commit). Override to e.g.
-# `pollen-robotics/reachy-mini-app-categories` once the org dataset
-# exists and the HF_TOKEN has write access to it.
-# HF_CATEGORIES_DATASET=tfrere/reachy-mini-app-categories

.gitignore CHANGED Viewed

@@ -22,5 +22,3 @@ dist-ssr
 *.njsproj
 *.sln
 *.sw?
-.env

 *.njsproj
 *.sln
 *.sw?

docs/APP_ICON_CONVENTION.md DELETED Viewed

@@ -1,113 +0,0 @@
-# App icon convention
-> Status: convention v1
-> Audience: authors shipping a Reachy Mini app to the Hugging Face Hub
-> Implemented by: `reachy-mini-website` catalog server (this repo) +
-> `reachy_mini_mobile_app`, `reachy_mini_desktop_app`
-> Source of truth: `server/index.js` → `findIconUrl()`
-This document specifies how a Reachy Mini app declares a custom icon.
-Apps that don't follow it keep working - the surface falls back to the
-front-matter `emoji:` glyph, which is the existing behaviour.
----
-## 1. The convention in three lines
-To ship a custom icon for your Reachy Mini app:
-1. Commit `icon.svg` (preferred) **or** `icon.png` at the root of your
-   Hugging Face Space repository.
-2. That's it. Within ~5 minutes (the catalog cache TTL) the mobile
-   shell, the desktop app and the website surface your icon
-   automatically, replacing the README front-matter emoji.
-3. If both files are present, `icon.svg` wins.
-No README change required. No tag to add. No PR to file against this
-repo. The catalog server scans the file list once per refresh and
-publishes a resolved URL on the app entry; every client consumes it.
----
-## 2. Why a file convention and not `cardData.thumbnail`
-HF Spaces support a `thumbnail:` field in README front-matter, but:
-- `thumbnail` is full-bleed marketing artwork (typically 1200x630),
-  not a square avatar. Scaling it to a 22 px or 44 px tile produces
-  muddy thumbnails.
-- We want app authors to ship a dedicated, optimised glyph they
-  control without learning the HF metadata schema.
-- SVG support means the icon scales cleanly across every mount point
-  (rail tile, pinned grid, iframe header) from a single asset.
-`thumbnail:` keeps its existing role (banner artwork on the Space's
-HF page) and is not consulted by this resolution path.
----
-## 3. Format & dimension recommendations
-| Property | Recommended | Hard requirement |
-|----------|-------------|------------------|
-| Format | `icon.svg` (vector) | `icon.svg` or `icon.png` |
-| Aspect ratio | 1:1 (square) | Renderers crop with `object-fit: contain`, but non-square icons render with letterboxing - prefer a true square |
-| Min PNG size | 256x256 | None enforced. PNGs below 64x64 will look soft on the pinned grid (44 px on retina ≈ 88 effective px) |
-| Background | Transparent OR solid colour | None - your call. Renderers don't add their own plate, so an icon with no background renders directly on the tile colour |
-| Padding | Bake ~10% inner padding into the asset | None - but icons that bleed edge-to-edge will touch the tile's rounded corners |
-| Light/dark variants | Single asset that works on both | None - if you must, ship two SVGs and use `prefers-color-scheme` inside the SVG via CSS |
-### Style notes
-- **Iconic, not photographic.** A solid filled silhouette reads at
-  22 px; a screenshot doesn't.
-- **High contrast against `background.paper`.** The mobile app paints
-  the tile background with the surface colour (very light grey on
-  light, near-black on dark). A pure white icon disappears on light.
-- **No drop shadow** baked into the asset. The renderer doesn't add
-  one either, and a baked shadow won't scale across sizes.
----
-## 4. How resolution works (for the curious)
-1. The catalog server calls
-   `https://huggingface.co/api/spaces?filter=reachy_mini&full=true`.
-   With `full=true`, the HF Hub returns `siblings: [{ rfilename: ... }]`
-   for every Space - the complete file list.
-2. For each app, `findIconUrl()` (in `server/index.js`) scans the
-   list for root-level filenames matching `ICON_CANDIDATES` in order
-   (`icon.svg` → `icon.png`).
-3. The first match becomes:
-   ```
-   https://huggingface.co/spaces/<author>/<repo>/resolve/main/<filename>
-   ```
-   `resolve/main/` (not `raw/main/`) so LFS pointers follow through
-   transparently and the `Content-Type` is set from the extension,
-   which `<img>` needs.
-4. The URL is published on the app entry as a top-level `iconUrl`
-   field. `null` when neither candidate exists.
-5. Clients (`reachy_mini_mobile_app`, `reachy_mini_desktop_app`) read
-   `iconUrl` and render an `<img>` when present, falling back to the
-   front-matter emoji otherwise. A runtime image load failure
-   re-falls-back to the emoji without a refresh.
-The whole resolution path is server-side, behind the 5-minute catalog
-cache. Adding 100 more apps adds zero per-client probes.
----
-## 5. Adding new icon formats
-If you need to support a new format (say, `icon.webp`), edit
-`ICON_CANDIDATES` in `server/index.js`:
-```js
-const ICON_CANDIDATES = ['icon.svg', 'icon.png', 'icon.webp'];
-```
-Order matters - the first hit wins, so put the preferred format first.
-Bumping the catalog cache (POST `/api/js-apps/refresh-categories` or
-just wait 5 minutes) picks up the new resolution rule.

scripts/evaluate-prompt-v2.py DELETED Viewed

@@ -1,445 +0,0 @@
-#!/usr/bin/env python3
-"""
-Prompt-v2 evaluation harness.
-Re-runs the LLM categorization on every JS app currently served by
-/api/js-apps with a tightened prompt, and prints a side-by-side
-diff against the live (v1) classifications.
-This file lives outside the server runtime - it never gets pushed
-to the Space. It's only meant to be hand-iterated until the diff
-looks right, then the chosen prompt is ported into server/categorize.js
-and server/categories.js.
-Run:
-    python3 scripts/evaluate-prompt-v2.py
-"""
-from __future__ import annotations
-import json
-import os
-import re
-import ssl
-import sys
-import time
-import urllib.error
-import urllib.request
-from pathlib import Path
-from typing import Any
-# Python 3.14 on macOS ships without the system CA bundle wired into
-# urllib by default - HF endpoints fail with CERTIFICATE_VERIFY_FAILED.
-# This script is dev-local only and only talks to huggingface.co, so
-# bypassing verification here is acceptable (would NEVER do this in
-# the server runtime).
-_SSL_CTX = ssl._create_unverified_context()  # noqa: S323
-HF_INFERENCE_URL = "https://router.huggingface.co/v1/chat/completions"
-MODEL = "meta-llama/Llama-3.1-8B-Instruct"
-TEMPERATURE = 0
-MAX_TOKENS = 120
-README_MAX_CHARS = 3000
-MAX_CATEGORIES_PER_APP = 3
-JS_APPS_URL = "https://pollen-robotics-reachy-mini.hf.space/api/js-apps"
-# ──────────────────────────────────────────────────────────────────────
-# Taxonomy v2 - 9 slugs (added "games")
-# ──────────────────────────────────────────────────────────────────────
-CATEGORIES_V2: list[tuple[str, str]] = [
-    (
-        "music",
-        "Music creation, playback, beats, songs, DJ mixing, instruments, "
-        "blind-test music games. Requires actual music (rhythm/melody/song). "
-        "NOT arbitrary audio (Morse code, alarms, TTS, sound effects).",
-    ),
-    (
-        "dance",
-        "Dance choreographies, motion replay, kinetic shows, "
-        "recording/replaying robot movements, dance parties.",
-    ),
-    (
-        "voice",
-        "Reachy talks, listens, or holds a real-time voice conversation: "
-        "TTS players, LLM-driven chat (OpenAI Realtime, Claude, Perplexity), "
-        "wake-word demos, daily reports/news/weather read aloud.",
-    ),
-    (
-        "storytelling",
-        "Narrative stories WITH plot and characters: interactive fiction, "
-        "bedtime tales, audio adventures, choose-your-own-adventure. "
-        "NOT for daily reports, news, weather, or Q&A (use `voice`).",
-    ),
-    (
-        "kids",
-        "Apps that EXPLICITLY target children: the words kids / children / "
-        "'for curious minds' / bedtime / 'learning for kids' must appear in "
-        "the name or description, OR the app must be obviously kid-targeted. "
-        "Combines with `storytelling`, `voice`, or `games`. Lifestyle, "
-        "sports, weather, general conversation are NOT kids.",
-    ),
-    (
-        "games",
-        "Apps with a play loop: scores, rounds, win/lose conditions, "
-        "quizzes, puzzles, sports simulations, dice/oracles (magic 8-ball), "
-        "arcade-style mini-games.",
-    ),
-    (
-        "vision",
-        "Apps where Reachy's camera DRIVES behaviour: face/hand/pose "
-        "tracking, image classification, gesture detection, visual mimicry. "
-        "NOT for apps that merely stream or display the camera feed.",
-    ),
-    (
-        "companion",
-        "Apps with an EXPLICIT emotional/personality/buddy framing in the "
-        "name or description (words like companion, buddy, mood, emotional, "
-        "personality, pet, Tamagotchi). Being friendly is not enough.",
-    ),
-    (
-        "dev-tools",
-        "RESERVED slug — see DECISION ALGORITHM step 1 below. Use ONLY "
-        "for pure technical artefacts (debug utilities, SDK probes, "
-        "minimal protocol demos, dev-only test spaces) with no end-user "
-        "experience. When used, it is the SOLE category — never combined.",
-    ),
-]
-ALLOWED = {slug for slug, _ in CATEGORIES_V2}
-# ──────────────────────────────────────────────────────────────────────
-# Few-shot examples - cover the main pitfalls of v1
-# ──────────────────────────────────────────────────────────────────────
-FEW_SHOT = [
-    (
-        "Reachy Morse",
-        "Send Morse code through Reachy's speaker.",
-        ["dev-tools"],
-        "(STEP 1 veto: pure technical artefact. NOT music.)",
-    ),
-    (
-        "WebRTC Demo",
-        "Minimal WebRTC connection between Reachy and the browser.",
-        ["dev-tools"],
-        "(STEP 1 veto: protocol demo. NOT vision.)",
-    ),
-    (
-        "TTS Reachy Mini",
-        "Browser TTS that plays out of Reachy Mini's speaker.",
-        ["voice"],
-        "(USER-FACING speech output is voice, NOT dev-tools.)",
-    ),
-    (
-        "Reachy Mochi - Emotional Companion",
-        "Your pocket buddy that develops a mood and personality over time.",
-        ["companion"],
-        "(explicit emotional/companion framing)",
-    ),
-    (
-        "Reachy Alive",
-        "(README empty; name suggests autonomy and life-like presence)",
-        ["companion"],
-        "(USE THE NAME when the README is empty; 'alive' = companion-like)",
-    ),
-    (
-        "Daily Surf Report",
-        "Reachy reads today's surf report out loud.",
-        ["voice"],
-        "(NOT storytelling — a report has no narrative arc. "
-        "NOT kids — surfing/sports are not kid-targeted.)",
-    ),
-    (
-        "Music Quiz",
-        "Play a blind test music game with a dancing Reachy.",
-        ["music", "games", "dance"],
-        "(multi-label: three slugs truly co-apply, ordered by relevance)",
-    ),
-    (
-        "Mime Bot",
-        "Reachy mimics your face live from your webcam.",
-        ["vision"],
-        "(NOT companion — mimicry is visual, no emotional framing.)",
-    ),
-]
-def build_system_prompt() -> str:
-    taxonomy = "\n".join(f"- {slug}: {desc}" for slug, desc in CATEGORIES_V2)
-    examples = "\n".join(
-        f"  - {name!r}: {desc!r}\n"
-        f"    → {{\"categories\": {json.dumps(cats)}}}   {hint}"
-        for name, desc, cats, hint in FEW_SHOT
-    )
-    return f"""You classify a Reachy Mini robot app into a CLOSED list of categories.
-OUTPUT FORMAT
-Return ONLY a single JSON object: {{"categories": ["slug1", "slug2"]}}.
-Pick 1 to {MAX_CATEGORIES_PER_APP} slugs, ordered from most to least relevant.
-Use the EXACT slug. No prose, no code fences, no commentary outside the JSON.
-DECISION ALGORITHM (apply in order)
-STEP 1 — `dev-tools` veto
-Is this app a PURE technical artefact with no user-facing experience
-beyond "here is how the SDK / API works"?
-Examples that pass the veto: WebRTC demo, SDK probe, debug utility,
-raw remote-control interface, dev-only test space.
-Examples that DO NOT pass the veto (they are user-facing apps):
-TTS players, voice chat, music apps, storytelling, companions —
-even when the README is dev-heavy.
-  ─ YES → return {{"categories": ["dev-tools"]}} and STOP. Never combine.
-  ─ NO  → continue to STEP 2.
-STEP 2 — Pick 1 to {MAX_CATEGORIES_PER_APP} user-facing slugs from the
-list below. Choose the MOST SPECIFIC categories. Order from most to
-least relevant. Multi-label is encouraged when two categories truly
-co-apply (e.g. music-and-dance, kids storytelling, vision game).
-If the README is empty or very sparse, USE THE NAME AND DESCRIPTION
-as the primary signal — do not bail to an empty list just because the
-README is thin.
-STEP 3 — Strict slug rules (each must hold, or DO NOT use the slug)
-- `companion`: requires EXPLICIT emotional / personality / buddy framing
-  (companion, buddy, friend, mood, emotional, personality, pet,
-  Tamagotchi-like, "alive", "life companion"). Being friendly is not
-  enough.
-- `music`: requires actual music — rhythm, melody, songs, beats, DJ
-  sets, instruments, music quizzes. Arbitrary audio (Morse, alarms,
-  TTS, sound effects) is NOT music.
-- `vision`: requires the camera to DRIVE behaviour (tracking,
-  classification, mimicry). Merely streaming or displaying the camera
-  (WebRTC demos, remote-control viewers) is NOT vision.
-- `storytelling`: requires a narrative ARC — plot, characters, scenes.
-  Daily reports, news, weather, Q&A are NOT storytelling (they are
-  `voice`).
-- `games`: requires a play loop — score, rounds, win/lose, puzzles,
-  quizzes, dice/oracles, sports simulations.
-- `kids`: requires kid-targeted framing (kids/children/curious minds/
-  bedtime/learning for kids) in the name or description. Lifestyle,
-  sports, weather, general conversation are NOT kids.
-AVAILABLE CATEGORIES
-{taxonomy}
-REFERENCE EXAMPLES
-{examples}
-Do not include any text outside the JSON object."""
-def build_user_prompt(name: str, description: str, readme: str) -> str:
-    return (
-        f"App name: {name or '(unknown)'}\n"
-        f"Short description: {description or '(none)'}\n\n"
-        f"README excerpt:\n{readme or '(no README available)'}\n\n"
-        f"Return the JSON now."
-    )
-# ──────────────────────────────────────────────────────────────────────
-# README fetch + clean (mirrors server/categorize.js)
-# ─────────────────────────────────────────────────────────────��────────
-def fetch_readme(space_id: str) -> str:
-    url = f"https://huggingface.co/spaces/{space_id}/raw/main/README.md"
-    try:
-        with urllib.request.urlopen(url, timeout=10, context=_SSL_CTX) as r:
-            return r.read().decode("utf-8", errors="replace")
-    except (urllib.error.URLError, urllib.error.HTTPError, TimeoutError):
-        return ""
-def clean_readme(raw: str) -> str:
-    if not raw:
-        return ""
-    txt = raw
-    txt = re.sub(r"^---\n[\s\S]*?\n---\n?", "", txt)
-    txt = re.sub(r"!\[[^\]]*\]\([^)]+\)", "", txt)
-    txt = re.sub(r"<img\b[^>]*>", "", txt, flags=re.IGNORECASE)
-    txt = re.sub(r"\[!\[[^\]]*\]\([^)]+\)\]\([^)]+\)", "", txt)
-    txt = re.sub(r"</?[a-zA-Z][^>]*>", "", txt)
-    txt = re.sub(r"\n{3,}", "\n\n", txt)
-    if len(txt) > README_MAX_CHARS:
-        cut = txt.rfind("\n\n", 0, README_MAX_CHARS)
-        if cut > README_MAX_CHARS // 2:
-            txt = txt[:cut]
-        else:
-            txt = txt[:README_MAX_CHARS]
-    return txt.strip()
-# ──────────────────────────────────────────────────────────────────────
-# LLM call
-# ──────────────────────────────────────────────────────────────────────
-def call_llm(hf_token: str, system: str, user: str) -> str | None:
-    body = json.dumps(
-        {
-            "model": MODEL,
-            "messages": [
-                {"role": "system", "content": system},
-                {"role": "user", "content": user},
-            ],
-            "temperature": TEMPERATURE,
-            "max_tokens": MAX_TOKENS,
-            "response_format": {"type": "json_object"},
-        }
-    ).encode("utf-8")
-    req = urllib.request.Request(
-        HF_INFERENCE_URL,
-        data=body,
-        headers={
-            "Authorization": f"Bearer {hf_token}",
-            "Content-Type": "application/json",
-            # Cloudflare in front of the router 403s the default
-            # "Python-urllib/x.y" UA. Any reasonable UA passes.
-            "User-Agent": "reachy-mini-prompt-eval/1.0",
-        },
-        method="POST",
-    )
-    try:
-        with urllib.request.urlopen(req, timeout=30, context=_SSL_CTX) as r:
-            data = json.loads(r.read().decode("utf-8"))
-            return data.get("choices", [{}])[0].get("message", {}).get("content")
-    except urllib.error.HTTPError as e:
-        detail = e.read().decode("utf-8", errors="replace")[:200]
-        print(f"  ✗ LLM HTTP {e.code}: {detail}", file=sys.stderr)
-        return None
-    except Exception as e:  # noqa: BLE001
-        print(f"  ✗ LLM error: {e}", file=sys.stderr)
-        return None
-def extract_json_obj(text: str) -> dict[str, Any] | None:
-    if not text:
-        return None
-    start = text.find("{")
-    if start == -1:
-        return None
-    depth = 0
-    for i in range(start, len(text)):
-        c = text[i]
-        if c == "{":
-            depth += 1
-        elif c == "}":
-            depth -= 1
-            if depth == 0:
-                try:
-                    return json.loads(text[start : i + 1])
-                except json.JSONDecodeError:
-                    return None
-    return None
-def sanitize(raw: Any) -> list[str]:
-    if not isinstance(raw, list):
-        return []
-    out: list[str] = []
-    seen: set[str] = set()
-    for v in raw:
-        if not isinstance(v, str):
-            continue
-        slug = v.strip().lower()
-        if not slug or slug in seen or slug not in ALLOWED:
-            continue
-        seen.add(slug)
-        out.append(slug)
-        if len(out) >= MAX_CATEGORIES_PER_APP:
-            break
-    return out
-# ──────────────────────────────────────────────────────────────────────
-# Main
-# ──────────────────────────────────────────────────────────────────────
-def read_hf_token() -> str:
-    if os.environ.get("HF_TOKEN"):
-        return os.environ["HF_TOKEN"]
-    env_file = Path(__file__).resolve().parent.parent / ".env"
-    if env_file.exists():
-        for line in env_file.read_text().splitlines():
-            m = re.match(r"^\s*HF_TOKEN\s*=\s*(.*?)\s*$", line)
-            if m:
-                v = m.group(1).strip().strip('"').strip("'")
-                if v:
-                    return v
-    raise SystemExit("HF_TOKEN not found in env or .env")
-def fetch_live_classifications() -> list[dict[str, Any]]:
-    with urllib.request.urlopen(JS_APPS_URL, timeout=30, context=_SSL_CTX) as r:
-        return json.load(r)["apps"]
-def main() -> int:
-    hf_token = read_hf_token()
-    apps = fetch_live_classifications()
-    print(f"Loaded {len(apps)} JS apps from prod.\n")
-    system = build_system_prompt()
-    print(f"System prompt: {len(system)} chars, {system.count(chr(10))} lines.\n")
-    results: list[dict[str, Any]] = []
-    for i, app in enumerate(apps, 1):
-        sid = app["id"]
-        name = app.get("name") or sid.split("/")[-1]
-        desc = (
-            app.get("description")
-            or (app.get("extra") or {}).get("cardData", {}).get("short_description")
-            or ""
-        )
-        old_cats = app.get("categories") or []
-        raw_readme = fetch_readme(sid)
-        readme = clean_readme(raw_readme)
-        user = build_user_prompt(name, desc, readme)
-        reply = call_llm(hf_token, system, user)
-        new_cats = sanitize((extract_json_obj(reply) or {}).get("categories"))
-        changed = set(old_cats) != set(new_cats)
-        marker = "Δ" if changed else " "
-        print(
-            f"  {marker} ({i:>2}/{len(apps)}) {name[:36]:<37}  "
-            f"old=[{', '.join(old_cats)}]"
-            + (f"  →  new=[{', '.join(new_cats)}]" if changed else "")
-        )
-        results.append(
-            {
-                "id": sid,
-                "name": name,
-                "old": old_cats,
-                "new": new_cats,
-                "changed": changed,
-            }
-        )
-        time.sleep(0.25)
-    print()
-    print("─" * 80)
-    print("DIFF (only changed entries)")
-    print("─" * 80)
-    for r in results:
-        if not r["changed"]:
-            continue
-        print(
-            f"  {r['name'][:38]:<40}  "
-            f"[{', '.join(r['old']) or '∅'}]  →  [{', '.join(r['new']) or '∅'}]"
-        )
-    changed_count = sum(1 for r in results if r["changed"])
-    print()
-    print(f"{changed_count}/{len(results)} entries changed.")
-    return 0
-if __name__ == "__main__":
-    sys.exit(main())

server/categories.js DELETED Viewed

@@ -1,189 +0,0 @@
-/**
- * Predefined taxonomy for JS Reachy Mini apps.
- *
- * These slugs are the ONLY valid output values for the LLM
- * inference step (anything else is dropped at parse time) and
- * the values consumers (mobile shell, website) filter on.
- *
- * Why a closed list instead of free-form tags
- * ──────────────────────────────────────────
- * The HF Spaces catalog has no usable categorization for the
- * reachy_mini_js_app subset (only platform/SDK tags). We bridge
- * the gap by inferring categories with an LLM, but we have to
- * constrain the model's output: a closed list keeps category
- * pages stable, lets us pre-pick emojis/labels, and avoids the
- * "30 near-duplicate slugs" problem you'd get with free-form.
- *
- * Bumping the taxonomy
- * ────────────────────
- * Adding, removing or renaming a slug changes the meaning of
- * cached entries. Bump TAXONOMY_VERSION when you do that: the
- * cache layer compares each entry's `taxonomyVersion` against
- * the live one and recomputes stale ones on the next pass.
- */
-/**
- * Bump this when the slug list OR the descriptions change in a way
- * that affects the LLM output. The cache layer invalidates entries
- * whose taxonomyVersion is older than this and reclassifies them on
- * the next pass. We don't bump it for cosmetic edits (label / emoji)
- * since those don't reach the LLM.
- *
- * History:
- *   - v1: initial 8-slug taxonomy.
- *   - v2: added `games`, tightened `kids` + `dev-tools` descriptions,
- *         switched the prompt to a DECISION ALGORITHM with few-shot.
- *   - v3: switched from multi-label (up to 3 slugs) to single-label
- *         (exactly 1 slug). Each app surfaces in exactly one category
- *         section on the mobile shell - no duplicates across swipers.
- *   - v4: renamed `dance` to `motion` (broader: marionette, replay,
- *         choreography without music). Music-driven dance parties
- *         now belong to `music` since music is what drives them.
- */
-export const TAXONOMY_VERSION = 4;
-/**
- * Canonical category list. Keep slugs short, kebab-case, and
- * memorable: they end up in URLs (e.g. `?cat=music`) and in
- * filter chips on mobile.
- *
- * The `description` field is the SOLE source of truth the LLM
- * sees - keep them factual, scope-bounded, and example-led so
- * the model has signal for both inclusion and exclusion.
- */
-export const CATEGORIES = [
-  {
-    slug: 'music',
-    label: 'Music & Beats',
-    emoji: '🎵',
-    description:
-      'Music creation, playback, beats, songs, DJ mixing, instruments, ' +
-      'blind-test music games, AND music-driven dance parties (Reachy ' +
-      'dances to a song). Requires actual music (rhythm / melody / song). ' +
-      'Arbitrary audio (Morse code, alarms, TTS, sound effects) is NOT ' +
-      'music. Pure choreography without music belongs to `motion`.',
-  },
-  {
-    slug: 'motion',
-    label: 'Motion & Movement',
-    emoji: '🦾',
-    description:
-      "Apps that drive Reachy's physical movement on its own: motion " +
-      'replay, marionette-style remote control of the body, kinetic ' +
-      'shows, choreographies WITHOUT music, expressive body language. ' +
-      'If the movement is synced to music, use `music` instead.',
-  },
-  {
-    slug: 'voice',
-    label: 'Voice & Conversation',
-    emoji: '🗣️',
-    description:
-      'Reachy talks, listens, or holds a real-time voice ' +
-      'conversation: TTS players, LLM-driven chat (OpenAI Realtime, ' +
-      'Claude, Perplexity), wake-word demos, daily reports / news / ' +
-      'weather read aloud.',
-  },
-  {
-    slug: 'storytelling',
-    label: 'Stories',
-    emoji: '📖',
-    description:
-      'Narrative stories WITH plot and characters: interactive ' +
-      'fiction, bedtime tales, audio adventures, choose-your-own-' +
-      'adventure. NOT for daily reports, news, weather, or Q&A ' +
-      '(those are `voice`).',
-  },
-  {
-    slug: 'kids',
-    label: 'For Kids',
-    emoji: '🧒',
-    description:
-      'Apps that EXPLICITLY target children: the words kids / ' +
-      "children / 'for curious minds' / bedtime / 'learning for kids' " +
-      'must appear in the name or description, OR the app must be ' +
-      'obviously kid-targeted. Combines with `storytelling`, `voice`, ' +
-      'or `games`. Lifestyle, sports, weather, generic personality / ' +
-      'narration / fun framings are NOT kids.',
-  },
-  {
-    slug: 'games',
-    label: 'Games & Play',
-    emoji: '🎮',
-    description:
-      'Apps with a play loop: scores, rounds, win/lose conditions, ' +
-      'quizzes, puzzles, sports simulations, dice/oracles (magic ' +
-      '8-ball), arcade-style mini-games.',
-  },
-  {
-    slug: 'vision',
-    label: 'Vision & Camera',
-    emoji: '👁️',
-    description:
-      "Apps where Reachy's camera DRIVES behaviour: face/hand/pose " +
-      'tracking, image classification, gesture detection, visual ' +
-      'mimicry. Merely streaming or displaying the camera feed ' +
-      '(WebRTC demos, remote-control viewers) is NOT vision.',
-  },
-  {
-    slug: 'companion',
-    label: 'Companion',
-    emoji: '🤝',
-    description:
-      'Apps with an EXPLICIT emotional / personality / buddy framing ' +
-      'in the name or description (companion, buddy, friend, mood, ' +
-      'emotional, personality, pet, Tamagotchi-like, "alive", ' +
-      '"life companion"). Being friendly is not enough.',
-  },
-  {
-    slug: 'dev-tools',
-    label: 'Dev & Demos',
-    emoji: '🛠️',
-    description:
-      'RESERVED slug - see DECISION ALGORITHM step 1 in the prompt. ' +
-      'Use ONLY for pure technical artefacts (debug utilities, SDK ' +
-      'probes, minimal protocol demos, dev-only test spaces) with no ' +
-      'end-user experience. When used, it is the SOLE category - ' +
-      'never combined with another slug.',
-  },
-];
-export const ALLOWED_SLUGS = new Set(CATEGORIES.map((c) => c.slug));
-export function isValidSlug(slug) {
-  return ALLOWED_SLUGS.has(slug);
-}
-/**
- * Render the taxonomy as a bulleted list for the LLM prompt.
- * Format mirrors what the model is asked to output (slug first)
- * to nudge it towards copying the exact string back.
- */
-export function buildLlmCategoryList() {
-  return CATEGORIES.map((c) => `- ${c.slug}: ${c.description}`).join('\n');
-}
-/**
- * Sanitize a raw LLM-returned list of slugs:
- * - drop non-strings
- * - lowercase + trim
- * - drop unknown slugs (hallucinations)
- * - dedupe while preserving order (the model orders by relevance)
- * - cap to MAX_CATEGORIES
- *
- * Returns a fresh array; never mutates input.
- */
-export function sanitizeSlugs(raw, maxCategories = 3) {
-  if (!Array.isArray(raw)) return [];
-  const seen = new Set();
-  const out = [];
-  for (const v of raw) {
-    if (typeof v !== 'string') continue;
-    const slug = v.trim().toLowerCase();
-    if (!slug || seen.has(slug)) continue;
-    if (!ALLOWED_SLUGS.has(slug)) continue;
-    seen.add(slug);
-    out.push(slug);
-    if (out.length >= maxCategories) break;
-  }
-  return out;
-}

server/categorize.js DELETED Viewed

@@ -1,426 +0,0 @@
-/**
- * LLM-based category inference for JS Reachy Mini apps.
- *
- * Pipeline (`categorizeApp`)
- * ──────────────────────────
- *   1. Fetch the Space's README from HF Hub (raw)
- *   2. Strip frontmatter, images, badges, raw HTML, then truncate
- *   3. Call a chat LLM via HF Inference Providers (OpenAI-compatible)
- *      with the predefined taxonomy + the app's name/description
- *   4. Parse JSON, validate against ALLOWED_SLUGS, keep up to 3
- *
- * Robustness contract
- * ───────────────────
- * `categorizeApp` NEVER throws on transient failure (network,
- * 429, malformed JSON). It returns `null`, which the cache layer
- * interprets as "not yet categorized; retry on the next pass".
- * Hard errors (HF_TOKEN missing) are signalled by a thrown
- * `HfTokenMissingError` so the caller can short-circuit the
- * whole batch.
- */
-import {
-  buildLlmCategoryList,
-  sanitizeSlugs,
-} from './categories.js';
-// HF Inference Providers - OpenAI-compatible router. Auto-routes
-// the request to whichever provider currently serves the model
-// (Together, Nebius, Fireworks, Sambanova...). The token must
-// have `Inference Providers` access (default for all PRO and
-// most FREE tokens since 2025).
-const HF_INFERENCE_URL = 'https://router.huggingface.co/v1/chat/completions';
-// 8B model: cheap, fast (~1 s per call), more than enough for a
-// closed-list multi-label classification with good descriptions.
-// If quality drifts we can swap to 70B without touching anything
-// else - the prompt is generic.
-const DEFAULT_MODEL = 'meta-llama/Llama-3.1-8B-Instruct';
-// README budget
-const README_MAX_CHARS = 3000;
-// Single-label classification: each app gets EXACTLY ONE slug -
-// the dominant one. The shape stays `string[]` for forward
-// compatibility (if we ever revert to multi-label, no API break),
-// but the array always contains 0 or 1 entry. Mobile chips and
-// "swipers per category" thus surface each app once and only once.
-const MAX_CATEGORIES_PER_APP = 1;
-// LLM call budget
-const LLM_TIMEOUT_MS = 30_000;
-const LLM_MAX_TOKENS = 120;
-const LLM_TEMPERATURE = 0;
-export class HfTokenMissingError extends Error {
-  constructor() {
-    super('HF_TOKEN env var is not set; cannot call HF Inference Providers.');
-    this.name = 'HfTokenMissingError';
-  }
-}
-/**
- * Fetch a Space's README from HF Hub. Returns the raw markdown
- * string, or `null` if the request fails (404, network, etc.) -
- * the caller falls back to "name + description only" in that case,
- * which is still enough signal for the LLM on most apps.
- */
-export async function fetchSpaceReadme(spaceId, { signal } = {}) {
-  if (!spaceId || typeof spaceId !== 'string') return null;
-  // The README of a HF Space lives at /spaces/<id>/raw/main/README.md.
-  // The `raw` endpoint returns the file as-is (no Hub UI wrapping)
-  // and is anonymous-friendly, so no auth is needed here.
-  const url = `https://huggingface.co/spaces/${spaceId}/raw/main/README.md`;
-  try {
-    const res = await fetch(url, { signal });
-    if (!res.ok) return null;
-    return await res.text();
-  } catch {
-    return null;
-  }
-}
-/**
- * Lightly clean a raw README so the LLM doesn't burn tokens on
- * boilerplate (HF frontmatter, badges, images) and so the actual
- * prose surfaces above the truncation budget.
- *
- * We keep transformations conservative: we never edit the
- * surrounding prose, we just delete decorative tokens. Anything
- * cosmetic-only that clearly isn't signal for classification
- * (badges, images, raw HTML).
- */
-export function cleanReadme(raw) {
-  if (!raw || typeof raw !== 'string') return '';
-  let txt = raw;
-  // 1. Strip the YAML frontmatter at the very top (HF Spaces
-  //    ship a mandatory `---\n...metadata...\n---` block whose
-  //    fields are already exposed to us via the catalog payload,
-  //    so feeding them to the LLM is pure noise).
-  txt = txt.replace(/^---\n[\s\S]*?\n---\n?/, '');
-  // 2. Drop image markdown (`![alt](url)`) and HTML <img> tags.
-  //    Vision apps tend to load up READMEs with screenshots and
-  //    GIFs; the alt text is sometimes useful but more often it's
-  //    "demo.gif" - low signal/noise ratio.
-  txt = txt.replace(/!\[[^\]]*\]\([^)]+\)/g, '');
-  txt = txt.replace(/<img\b[^>]*>/gi, '');
-  // 3. Strip shields.io / GitHub badges (markdown links that
-  //    wrap an image). They survive (2) only when nested.
-  txt = txt.replace(/\[!\[[^\]]*\]\([^)]+\)\]\([^)]+\)/g, '');
-  // 4. Generic HTML stripping. Most READMEs are pure markdown,
-  //    but some authors embed `<details>`, `<sub>`, `<center>`
-  //    blocks. Keep the inner text, drop the tags.
-  txt = txt.replace(/<\/?[a-zA-Z][^>]*>/g, '');
-  // 5. Collapse runs of blank lines so trimming doesn't waste
-  //    tokens on the gap.
-  txt = txt.replace(/\n{3,}/g, '\n\n');
-  // 6. Truncate. We slice at the paragraph boundary closest to
-  //    the budget so we don't end mid-sentence.
-  if (txt.length > README_MAX_CHARS) {
-    const cut = txt.lastIndexOf('\n\n', README_MAX_CHARS);
-    txt = txt.slice(0, cut > README_MAX_CHARS / 2 ? cut : README_MAX_CHARS);
-  }
-  return txt.trim();
-}
-/**
- * Few-shot examples woven into the system prompt.
- *
- * Each entry encodes a pitfall the v1 prompt fell into during the
- * 24-app eval (see `scripts/evaluate-prompt-v2.py`). Keep this list
- * tight - past ~10 examples the model starts pattern-matching
- * literally on the example names rather than applying the rules.
- *
- * Format: [name, description, expected_slugs, brief_justification]
- */
-const FEW_SHOT_EXAMPLES = [
-  [
-    'Reachy Morse',
-    "Send Morse code through Reachy's speaker.",
-    ['dev-tools'],
-    '(STEP 1 veto: pure technical artefact. NOT music.)',
-  ],
-  [
-    'WebRTC Demo',
-    'Minimal WebRTC connection between Reachy and the browser.',
-    ['dev-tools'],
-    '(STEP 1 veto: protocol demo. NOT vision.)',
-  ],
-  [
-    'TTS Reachy Mini',
-    "Browser TTS that plays out of Reachy Mini's speaker.",
-    ['voice'],
-    '(USER-FACING speech output is voice, NOT dev-tools.)',
-  ],
-  [
-    'Reachy Mochi - Emotional Companion',
-    'Your pocket buddy that develops a mood and personality over time.',
-    ['companion'],
-    '(explicit emotional/companion framing)',
-  ],
-  [
-    'Reachy Alive',
-    '(README empty; name suggests autonomy and life-like presence)',
-    ['companion'],
-    "(USE THE NAME when the README is empty; 'alive' = companion-like)",
-  ],
-  [
-    'Daily Surf Report',
-    "Reachy reads today's surf report out loud.",
-    ['voice'],
-    '(NOT storytelling - a report has no narrative arc. ' +
-      'NOT kids - surfing/sports are not kid-targeted.)',
-  ],
-  [
-    'Music Quiz',
-    'Play a blind test music game with a dancing Reachy.',
-    ['music'],
-    '(single dominant slug - music wins over games because the app ' +
-      "is primarily a music blind-test; the dancing is a side effect " +
-      'of the music and is captured by `music` too)',
-  ],
-  [
-    'Mime Bot',
-    'Reachy mimics your face live from your webcam.',
-    ['vision'],
-    '(NOT companion - mimicry is visual, no emotional framing.)',
-  ],
-];
-function renderFewShot() {
-  return FEW_SHOT_EXAMPLES.map(([name, desc, slugs, hint]) => {
-    const slugsJson = JSON.stringify(slugs);
-    return (
-      `  - ${JSON.stringify(name)}: ${JSON.stringify(desc)}\n` +
-      `    → {"categories": ${slugsJson}}   ${hint}`
-    );
-  }).join('\n');
-}
-/**
- * Build the chat messages handed to the LLM.
- *
- * The system prompt is structured as a 3-step DECISION ALGORITHM
- * rather than a flat list of rules, because the 8B-class model we
- * use (Llama-3.1-8B-Instruct) follows imperative procedures more
- * reliably than soft constraints. The `dev-tools` veto in STEP 1
- * is what stops the model from silently combining it with other
- * slugs on user-facing apps.
- *
- * The few-shot examples below the rules cover the v1 pitfalls
- * (companion hallucinations, music-on-audio, kids-on-personas,
- * storytelling-on-reports). Six is the sweet spot - more starts
- * over-fitting on example wording.
- */
-function buildMessages({ name, description, readme }) {
-  const taxonomy = buildLlmCategoryList();
-  const examples = renderFewShot();
-  const system = `You classify a Reachy Mini robot app into a CLOSED list of categories.
-OUTPUT FORMAT
-Return ONLY a single JSON object: {"categories": ["slug"]}.
-Pick EXACTLY ONE slug - the single dominant category that best
-captures the app's primary identity. Use the EXACT slug. The list
-always contains 0 or 1 entry.
-No prose, no code fences, no commentary outside the JSON.
-DECISION ALGORITHM (apply in order)
-STEP 1 - \`dev-tools\` veto
-Is this app a PURE technical artefact with no user-facing experience
-beyond "here is how the SDK / API works"?
-Examples that pass the veto: WebRTC demo, SDK probe, debug utility,
-raw remote-control interface, dev-only test space.
-Examples that DO NOT pass the veto (they are user-facing apps):
-TTS players, voice chat, music apps, storytelling, companions -
-even when the README is dev-heavy.
-  - YES -> return {"categories": ["dev-tools"]} and STOP.
-  - NO  -> continue to STEP 2.
-STEP 2 - Pick the SINGLE most dominant user-facing slug from the list
-below. Choose the slug that captures the app's primary identity, not
-every aspect it touches. When two slugs feel equally fitting, pick the
-one that a user would name FIRST when describing the app in one word.
-Examples of tie-breaks:
-  - music-driven dance party (Reachy dances to a song) -> \`music\`.
-    The music is what drives the experience.
-  - pure choreography / marionette / motion replay without music ->
-    \`motion\`. The movement is the experience.
-  - storytelling + kids app -> prefer \`kids\` if it explicitly targets
-    children, \`storytelling\` otherwise.
-  - vision + games app -> prefer \`games\` if there is a play loop,
-    \`vision\` if it is mostly a perception demo.
-If the README is empty or very sparse, USE THE NAME AND DESCRIPTION
-as the primary signal - do not bail to an empty list just because the
-README is thin.
-STEP 3 - Strict slug rules (each must hold, or DO NOT use the slug)
-- \`companion\`: requires EXPLICIT emotional / personality / buddy
-  framing (companion, buddy, friend, mood, emotional, personality,
-  pet, Tamagotchi-like, "alive", "life companion"). Being friendly is
-  not enough.
-- \`music\`: requires actual music - rhythm, melody, songs, beats, DJ
-  sets, instruments, music quizzes. Arbitrary audio (Morse, alarms,
-  TTS, sound effects) is NOT music.
-- \`vision\`: requires the camera to DRIVE behaviour (tracking,
-  classification, mimicry). Merely streaming or displaying the camera
-  (WebRTC demos, remote-control viewers) is NOT vision.
-- \`storytelling\`: requires a narrative ARC - plot, characters, scenes.
-  Daily reports, news, weather, Q&A are NOT storytelling (they are
-  \`voice\`).
-- \`games\`: requires a play loop - score, rounds, win/lose, puzzles,
-  quizzes, dice/oracles, sports simulations.
-- \`kids\`: requires kid-targeted framing (kids/children/curious minds/
-  bedtime/learning for kids) in the name or description. Lifestyle,
-  sports, weather, general conversation are NOT kids.
-AVAILABLE CATEGORIES
-${taxonomy}
-REFERENCE EXAMPLES
-${examples}
-Do not include any text outside the JSON object.`;
-  const user =
-    `App name: ${name || '(unknown)'}\n` +
-    `Short description: ${description || '(none)'}\n\n` +
-    `README excerpt:\n${readme || '(no README available)'}\n\n` +
-    'Return the JSON now.';
-  return [
-    { role: 'system', content: system },
-    { role: 'user', content: user },
-  ];
-}
-/**
- * Best-effort JSON extraction. Some 8B models still wrap the
- * answer in ``` fences or prepend "Sure, here you go:". We grab
- * the first balanced `{...}` block and parse that.
- */
-function extractJsonObject(text) {
-  if (!text || typeof text !== 'string') return null;
-  const start = text.indexOf('{');
-  if (start === -1) return null;
-  let depth = 0;
-  for (let i = start; i < text.length; i++) {
-    const ch = text[i];
-    if (ch === '{') depth++;
-    else if (ch === '}') {
-      depth--;
-      if (depth === 0) {
-        const slice = text.slice(start, i + 1);
-        try {
-          return JSON.parse(slice);
-        } catch {
-          return null;
-        }
-      }
-    }
-  }
-  return null;
-}
-/**
- * Call the HF Inference Providers chat endpoint. Returns the
- * raw assistant message string, or `null` on any error.
- */
-async function callLlm({ messages, model, signal }) {
-  const token = process.env.HF_TOKEN;
-  if (!token) throw new HfTokenMissingError();
-  const body = {
-    model,
-    messages,
-    temperature: LLM_TEMPERATURE,
-    max_tokens: LLM_MAX_TOKENS,
-    // `response_format` is honoured by some providers (Nebius,
-    // Together) but ignored by others. It's a free upgrade when
-    // present, harmless otherwise; the JSON-extractor below is
-    // the real safety net.
-    response_format: { type: 'json_object' },
-  };
-  let res;
-  try {
-    res = await fetch(HF_INFERENCE_URL, {
-      method: 'POST',
-      headers: {
-        'Authorization': `Bearer ${token}`,
-        'Content-Type': 'application/json',
-      },
-      body: JSON.stringify(body),
-      signal,
-    });
-  } catch (err) {
-    console.warn(`[categorize] LLM fetch failed: ${err.message}`);
-    return null;
-  }
-  if (!res.ok) {
-    const detail = await res.text().catch(() => '');
-    console.warn(
-      `[categorize] LLM HTTP ${res.status}: ${detail.slice(0, 200)}`,
-    );
-    return null;
-  }
-  let json;
-  try {
-    json = await res.json();
-  } catch {
-    return null;
-  }
-  return json?.choices?.[0]?.message?.content ?? null;
-}
-/**
- * Public entry point.
- *
- * Returns a string[] of validated slugs (0-3 items), or `null`
- * on transient failure so the caller can mark the entry "needs
- * retry" without writing a misleading empty list.
- *
- * Treat an empty array `[]` as "the LLM looked and concluded
- * none fit" - that's a valid, cacheable outcome.
- */
-export async function categorizeApp({
-  name,
-  description,
-  spaceId,
-  model = DEFAULT_MODEL,
-} = {}) {
-  if (!spaceId) return null;
-  const ctrl = new AbortController();
-  const timeoutId = setTimeout(() => ctrl.abort(), LLM_TIMEOUT_MS);
-  try {
-    const rawReadme = await fetchSpaceReadme(spaceId, { signal: ctrl.signal });
-    const readme = cleanReadme(rawReadme);
-    const messages = buildMessages({ name, description, readme });
-    const reply = await callLlm({ messages, model, signal: ctrl.signal });
-    if (reply == null) return null;
-    const obj = extractJsonObject(reply);
-    if (!obj || !Array.isArray(obj.categories)) {
-      console.warn(
-        `[categorize] ${spaceId}: malformed LLM reply (truncated): ` +
-          `${reply.slice(0, 120)}`,
-      );
-      return null;
-    }
-    return sanitizeSlugs(obj.categories, MAX_CATEGORIES_PER_APP);
-  } finally {
-    clearTimeout(timeoutId);
-  }
-}

server/categoryCache.js DELETED Viewed

@@ -1,290 +0,0 @@
-/**
- * Persistent cache for inferred app categories, backed by a
- * HuggingFace dataset.
- *
- * Why a dataset (not a local file)
- * ────────────────────────────────
- * The website runs in a Docker HF Space. The container's
- * filesystem is wiped on every rebuild (and rebuilds happen
- * on every push, every model update, every Space restart).
- * Re-running 200 LLM calls every cold start would be wasteful
- * and slow the user-visible /api/js-apps for the first 30 s.
- *
- * Pushing the cache to a dataset gives us:
- *   1. Persistence across rebuilds and machine moves
- *   2. A versioned audit log of how categories evolve
- *   3. A single source of truth other tooling can consume
- *      (the mobile shell could even read the dataset directly
- *      if it ever wanted to bypass the website).
- *
- * Storage shape
- * ─────────────
- *   <dataset>/categories.json
- *
- *   {
- *     "version": 1,
- *     "taxonomyVersion": 1,
- *     "updatedAt": "2026-05-10T11:08:42Z",
- *     "entries": {
- *       "<spaceId>": {
- *         "lastModified": "2026-05-08T22:13:01Z",
- *         "categories": ["storytelling", "kids", "voice"],
- *         "categorizedAt": "2026-05-10T11:08:42Z",
- *         "taxonomyVersion": 1
- *       }
- *     }
- *   }
- *
- * In-memory tier
- * ──────────────
- * The Map<spaceId, entry> is the hot path. The dataset is
- * loaded once at boot and only flushed when entries actually
- * change (the warmup batch buffers writes and flushes once
- * at the end). All synchronous access goes through the Map.
- */
-import { commit, createRepo } from '@huggingface/hub';
-import { TAXONOMY_VERSION } from './categories.js';
-// Default location: a per-user dataset that the HF_TOKEN owner
-// definitely has write access to. Override with the env var
-// when promoting to the org-owned `pollen-robotics/...` dataset.
-const DEFAULT_DATASET = 'tfrere/reachy-mini-app-categories';
-const CACHE_FILE_PATH = 'categories.json';
-const CACHE_FORMAT_VERSION = 1;
-class CategoryCache {
-  constructor() {
-    this.entries = new Map();
-    this.repoName = process.env.HF_CATEGORIES_DATASET || DEFAULT_DATASET;
-    this.loaded = false;
-    this.dirty = false;
-    // Concurrency guard for `flush()` - we never want two
-    // commit() calls fighting for the same parent commit.
-    this.flushing = false;
-  }
-  /**
-   * Load the dataset cache into memory. Best-effort: a missing
-   * dataset, a 404, or a malformed JSON all collapse to "start
-   * fresh, the warmup will repopulate". We never let cache load
-   * failure block the server boot.
-   */
-  async load() {
-    if (this.loaded) return;
-    this.loaded = true;
-    const url = `https://huggingface.co/datasets/${this.repoName}/resolve/main/${CACHE_FILE_PATH}`;
-    try {
-      const res = await fetch(url, {
-        // Send the token even on a public dataset: it lets HF
-        // bump our rate limit and keeps the path identical for
-        // a future private dataset migration.
-        headers: process.env.HF_TOKEN
-          ? { Authorization: `Bearer ${process.env.HF_TOKEN}` }
-          : undefined,
-      });
-      if (!res.ok) {
-        if (res.status === 404) {
-          console.log(
-            `[CategoryCache] Dataset ${this.repoName} or ${CACHE_FILE_PATH} ` +
-              `not found yet - starting empty.`,
-          );
-        } else {
-          console.warn(
-            `[CategoryCache] HTTP ${res.status} loading cache from ` +
-              `${this.repoName}, starting empty.`,
-          );
-        }
-        return;
-      }
-      const data = await res.json();
-      const entries = data?.entries || {};
-      let kept = 0;
-      let staleTaxonomy = 0;
-      for (const [id, raw] of Object.entries(entries)) {
-        if (!raw || typeof raw !== 'object') continue;
-        // Drop entries from a previous taxonomy: their slugs
-        // may no longer exist or may have shifted meaning.
-        // The warmup will re-run them.
-        if (raw.taxonomyVersion !== TAXONOMY_VERSION) {
-          staleTaxonomy++;
-          continue;
-        }
-        this.entries.set(id, {
-          lastModified: raw.lastModified || null,
-          categories: Array.isArray(raw.categories) ? raw.categories : [],
-          categorizedAt: raw.categorizedAt || null,
-          taxonomyVersion: raw.taxonomyVersion,
-        });
-        kept++;
-      }
-      console.log(
-        `[CategoryCache] Loaded ${kept} entries from ${this.repoName}` +
-          (staleTaxonomy ? ` (dropped ${staleTaxonomy} stale taxonomy)` : ''),
-      );
-    } catch (err) {
-      console.warn(
-        `[CategoryCache] Load failed (${err.message}); starting empty.`,
-      );
-    }
-  }
-  get(spaceId) {
-    return this.entries.get(spaceId) || null;
-  }
-  /**
-   * Decide whether `spaceId` needs a fresh classification call.
-   * It does when:
-   *   - we have no entry at all, OR
-   *   - the Space's `lastModified` has moved past our cached one
-   *     (the README may have changed - re-classify), OR
-   *   - the taxonomy version moved (handled at load() time, but
-   *     belt-and-braces for hot reloads).
-   */
-  needsCategorization(spaceId, lastModified) {
-    const entry = this.entries.get(spaceId);
-    if (!entry) return true;
-    if (entry.taxonomyVersion !== TAXONOMY_VERSION) return true;
-    if (lastModified && entry.lastModified !== lastModified) return true;
-    return false;
-  }
-  set(spaceId, { categories, lastModified }) {
-    if (!Array.isArray(categories)) return;
-    const next = {
-      lastModified: lastModified || null,
-      categories: [...categories],
-      categorizedAt: new Date().toISOString(),
-      taxonomyVersion: TAXONOMY_VERSION,
-    };
-    const prev = this.entries.get(spaceId);
-    // Skip the dirty flag if nothing actually changed - avoids
-    // a useless commit when a refresh confirms the same labels.
-    if (
-      prev &&
-      prev.lastModified === next.lastModified &&
-      prev.taxonomyVersion === next.taxonomyVersion &&
-      JSON.stringify(prev.categories) === JSON.stringify(next.categories)
-    ) {
-      return;
-    }
-    this.entries.set(spaceId, next);
-    this.dirty = true;
-  }
-  /**
-   * Persist the in-memory cache to the dataset (one commit, one
-   * file). No-op if nothing has changed since the last flush.
-   *
-   * Auto-creates the dataset on first write if it doesn't exist
-   * yet (so a brand-new `HF_CATEGORIES_DATASET` value bootstraps
-   * cleanly without manual setup).
-   */
-  async flush() {
-    if (!this.dirty || this.flushing) return;
-    if (!process.env.HF_TOKEN) {
-      console.warn('[CategoryCache] HF_TOKEN missing; skipping flush.');
-      return;
-    }
-    this.flushing = true;
-    try {
-      const payload = this.serialize();
-      const blob = new Blob([JSON.stringify(payload, null, 2)], {
-        type: 'application/json',
-      });
-      const repo = { type: 'dataset', name: this.repoName };
-      const credentials = { accessToken: process.env.HF_TOKEN };
-      // First attempt: plain commit. If the dataset doesn't
-      // exist yet, the SDK throws and we fall through to
-      // create-then-commit. We never assume the dataset exists
-      // - that lets a fresh deploy auto-bootstrap.
-      try {
-        await commit({
-          repo,
-          credentials,
-          title: `Update categories (${this.entries.size} apps)`,
-          operations: [
-            {
-              operation: 'addOrUpdate',
-              path: CACHE_FILE_PATH,
-              content: blob,
-            },
-          ],
-        });
-      } catch (err) {
-        const msg = err?.message || '';
-        const looksMissing =
-          msg.includes('404') ||
-          msg.toLowerCase().includes('not found') ||
-          msg.toLowerCase().includes('does not exist');
-        if (!looksMissing) throw err;
-        console.log(
-          `[CategoryCache] Dataset ${this.repoName} missing - creating it.`,
-        );
-        await createRepo({
-          repo,
-          credentials,
-          private: false,
-          // Re-using the same blob so the initial commit ships
-          // the cache content (instead of an empty repo
-          // followed by a no-op commit).
-          files: [
-            {
-              path: CACHE_FILE_PATH,
-              content: await blob.arrayBuffer(),
-            },
-          ],
-        });
-      }
-      this.dirty = false;
-      console.log(
-        `[CategoryCache] Flushed ${this.entries.size} entries to ${this.repoName}`,
-      );
-    } catch (err) {
-      // We deliberately swallow flush errors so a HF outage
-      // doesn't break the running server. The next set() will
-      // re-flag dirty=true and the next flush() will retry.
-      console.error(
-        `[CategoryCache] Flush failed: ${err?.message || err}`,
-      );
-    } finally {
-      this.flushing = false;
-    }
-  }
-  serialize() {
-    const entries = {};
-    for (const [id, entry] of this.entries) {
-      entries[id] = entry;
-    }
-    return {
-      version: CACHE_FORMAT_VERSION,
-      taxonomyVersion: TAXONOMY_VERSION,
-      updatedAt: new Date().toISOString(),
-      entries,
-    };
-  }
-  /**
-   * Diagnostic snapshot for /api/js-apps's `categorization`
-   * sub-payload. Lets the mobile shell decide whether to show
-   * "loading categories..." or to render the chips immediately.
-   */
-  stats() {
-    return {
-      total: this.entries.size,
-      dataset: this.repoName,
-      taxonomyVersion: TAXONOMY_VERSION,
-    };
-  }
-}
-// Singleton: there's only one cache per server process.
-export const categoryCache = new CategoryCache();

server/index.js CHANGED Viewed

@@ -1,42 +1,9 @@
 import express from 'express';
-import { existsSync, readFileSync } from 'fs';
 import path from 'path';
 import { fileURLToPath } from 'url';
-import { categorizeApp, HfTokenMissingError } from './categorize.js';
-import { categoryCache } from './categoryCache.js';
 const __dirname = path.dirname(fileURLToPath(import.meta.url));
-// Load `.env` from the repo root in dev. In production (HF Space)
-// the platform already injects the secrets as env vars, so this
-// loader silently no-ops. We avoid the `dotenv` dep on purpose -
-// the format is trivial, and reproducing it inline keeps the
-// runtime closure tiny.
-(function loadDotenv() {
-  try {
-    const envPath = path.join(__dirname, '..', '.env');
-    if (!existsSync(envPath)) return;
-    const text = readFileSync(envPath, 'utf8');
-    for (const line of text.split(/\r?\n/)) {
-      const m = line.match(/^\s*([A-Z0-9_]+)\s*=\s*(.*?)\s*$/i);
-      if (!m) continue;
-      const [, key, raw] = m;
-      let value = raw;
-      if (
-        (value.startsWith('"') && value.endsWith('"')) ||
-        (value.startsWith("'") && value.endsWith("'"))
-      ) {
-        value = value.slice(1, -1);
-      }
-      // Existing env wins (so `HF_TOKEN=foo node …` overrides .env).
-      if (process.env[key] === undefined) process.env[key] = value;
-    }
-  } catch {
-    /* best-effort - missing or malformed .env never blocks boot */
-  }
-})();
 const app = express();
 const PORT = process.env.PORT || 7860;
@@ -47,77 +14,6 @@ const HF_SPACES_API = 'https://huggingface.co/api/spaces';
 // Note: HF API doesn't support pagination with filter=, so we use a high limit
 const HF_SPACES_LIMIT = 1000;
-// Tag that gates the JS-only subset surfaced by /api/js-apps and
-// fed to the LLM categorizer. Mirrors the filter the mobile shell
-// applies today client-side; the route lets us retire that filter
-// from the mobile codebase down the line.
-const JS_APP_TAG = 'reachy_mini_js_app';
-// =====================================================================
-// App icon convention
-// =====================================================================
-//
-// Convention: an app MAY commit `icon.svg` (preferred) or
-// `icon.png` at the root of its HF Space repository. When present,
-// the mobile shell + desktop store render it as the app glyph
-// instead of the front-matter `emoji:` codepoint.
-//
-// We resolve the icon ONCE at indexing time (here) rather than
-// probing per-client because:
-//   1. We already pull `siblings` from `?full=true` (one cheap
-//      hub call returns the file list for every app), so the
-//      lookup is a pure JS filter, no extra network.
-//   2. Clients see a single field (`iconUrl`) in the payload and
-//      don't have to know about HF resolve URLs, LFS pointers,
-//      or the candidate-order race ("SVG wins if both exist").
-//   3. The HF API caps probes at ~hub side; doing it server-side
-//      keeps fanout under a 5-minute TTL behind ONE token, instead
-//      of every mobile shell hammering `huggingface.co/resolve/`
-//      to discover icons.
-//
-// Resolution order: `icon.svg` → `icon.png`. SVG first because the
-// same asset scales cleanly across every mount point (small rail
-// tile, larger pinned tile, iframe header) from a single file.
-// Extra formats can be added to `ICON_CANDIDATES` if needed; order
-// matters - the first match wins.
-const ICON_CANDIDATES = ['icon.svg', 'icon.png'];
-/**
- * Look for a standard app icon file at the root of the Space.
- * Returns the absolute HF resolve URL when found, `null` otherwise.
- *
- * We hit `resolve/main/` (not `raw/main/`) so:
- *   - LFS pointers follow transparently (large PNGs work).
- *   - `Content-Type` comes from the extension, which `<img>` needs.
- *   - The URL is cacheable cross-session by the browser, so
- *     repeated mounts of the same app glyph don't re-fetch.
- */
-function findIconUrl(spaceId, siblings) {
-  if (!spaceId || !Array.isArray(siblings)) return null;
-  // Build a Set of root-level filenames for O(1) candidate
-  // lookups. HF returns `siblings` as `[{ rfilename: "path/in/repo" }, ...]`,
-  // so we filter to repo-root (no slash) before testing.
-  const rootFiles = new Set();
-  for (const s of siblings) {
-    const name = s && typeof s.rfilename === 'string' ? s.rfilename : null;
-    if (!name) continue;
-    if (name.includes('/')) continue;
-    rootFiles.add(name);
-  }
-  for (const candidate of ICON_CANDIDATES) {
-    if (rootFiles.has(candidate)) {
-      return `https://huggingface.co/spaces/${spaceId}/resolve/main/${candidate}`;
-    }
-  }
-  return null;
-}
-// Serialised LLM batch concurrency: we want at most one
-// categorization sweep running at a time, regardless of how many
-// /api/js-apps requests come in. The flag also prevents the
-// startup warm-up and an on-demand refresh from racing each other.
-let categorizationBatchRunning = false;
 // In-memory cache
 let appsCache = {
   data: null,
@@ -157,13 +53,6 @@ async function fetchAppsFromHF() {
       const author = spaceId.split('/')[0];
       const name = spaceId.split('/').pop();
-      // Server-resolved icon URL. Looks for `icon.svg` or `icon.png`
-      // at the repo root via the `siblings` list returned by
-      // `?full=true`. See `findIconUrl()` above for the rationale.
-      // `null` when the author hasn't shipped one; clients fall
-      // back to the front-matter emoji.
-      const iconUrl = findIconUrl(spaceId, space.siblings);
       return {
         // Core fields (used by both website and desktop)
         id: spaceId,
@@ -172,8 +61,7 @@ async function fetchAppsFromHF() {
         url: `https://huggingface.co/spaces/${spaceId}`,
         source_kind: 'hf_space',
         isOfficial,
-        iconUrl,
         // Extra metadata (desktop-compatible structure)
         extra: {
           id: spaceId,
@@ -295,221 +183,6 @@ app.get('/api/apps', async (req, res) => {
   }
 });
-// =====================================================================
-// JS apps + LLM-inferred categories
-// =====================================================================
-//
-// `/api/js-apps` is a curated view on top of `/api/apps`:
-//   1. Filter on the `reachy_mini_js_app` tag (the mobile-embeddable subset).
-//   2. Enrich each entry with `categories` + `categories_source`,
-//      sourced from a persistent dataset cache (see categoryCache.js).
-//
-// Categories are inferred lazily by an LLM from each Space's
-// README. The first request after a cold start may see entries
-// with `categories: null` while the warmup batch is still in
-// flight; subsequent requests pick them up as the cache fills.
-/**
- * Pull the JS-app subset out of the global apps cache and fold
- * in cached categories. Pure, synchronous-ish (the only async
- * call is to the upstream `getApps()` which has its own cache).
- */
-async function getJsApps() {
-  const apps = await getApps();
-  const jsApps = apps.filter((a) => {
-    const tags = a?.extra?.tags;
-    return Array.isArray(tags) && tags.includes(JS_APP_TAG);
-  });
-  return jsApps.map((app) => {
-    const cached = categoryCache.get(app.id);
-    return {
-      ...app,
-      categories: cached ? cached.categories : null,
-      categories_source: cached ? 'inferred' : null,
-      categorized_at: cached ? cached.categorizedAt : null,
-    };
-  });
-}
-/**
- * Run one classification pass over `jsApps`. Skips entries whose
- * cache is still fresh (same `lastModified`, same taxonomy).
- *
- * Serial on purpose: HF Inference Providers don't love bursts
- * from a single token, and total throughput on ~50 apps stays
- * well under a minute. We slip a small jitter between calls to
- * smooth the curve further.
- */
-async function runCategorizationBatch(jsApps) {
-  if (categorizationBatchRunning) {
-    console.log('[Categorize] Batch already running, skipping.');
-    return;
-  }
-  if (!process.env.HF_TOKEN) {
-    console.warn(
-      '[Categorize] HF_TOKEN not set; skipping batch. Set it in .env ' +
-        'or the Space secrets to enable category inference.',
-    );
-    return;
-  }
-  const todo = jsApps.filter((app) =>
-    categoryCache.needsCategorization(app.id, app?.extra?.lastModified),
-  );
-  if (todo.length === 0) {
-    console.log(
-      `[Categorize] All ${jsApps.length} JS apps are already categorized.`,
-    );
-    return;
-  }
-  categorizationBatchRunning = true;
-  console.log(
-    `[Categorize] Starting batch: ${todo.length}/${jsApps.length} app(s) need classification.`,
-  );
-  let success = 0;
-  let failed = 0;
-  let aborted = false;
-  for (let i = 0; i < todo.length; i++) {
-    const app = todo[i];
-    const desc =
-      app.description ||
-      app.extra?.cardData?.short_description ||
-      '';
-    try {
-      const slugs = await categorizeApp({
-        spaceId: app.id,
-        name: app.name,
-        description: desc,
-      });
-      if (slugs == null) {
-        failed++;
-        console.log(
-          `[Categorize]   (${i + 1}/${todo.length}) ${app.id}: transient failure, will retry next pass`,
-        );
-      } else {
-        categoryCache.set(app.id, {
-          categories: slugs,
-          lastModified: app.extra?.lastModified || null,
-        });
-        success++;
-        console.log(
-          `[Categorize]   (${i + 1}/${todo.length}) ${app.id}: ${
-            slugs.length ? slugs.join(', ') : '(no fit)'
-          }`,
-        );
-      }
-    } catch (err) {
-      if (err instanceof HfTokenMissingError) {
-        console.warn(
-          '[Categorize] HF_TOKEN missing mid-batch; aborting cleanly.',
-        );
-        aborted = true;
-        break;
-      }
-      failed++;
-      console.warn(
-        `[Categorize]   (${i + 1}/${todo.length}) ${app.id}: error - ${err.message}`,
-      );
-    }
-    // 250 ms cooldown between calls. Below this, the HF Provider
-    // router occasionally rate-limits a hot token.
-    await new Promise((resolve) => setTimeout(resolve, 250));
-  }
-  console.log(
-    `[Categorize] Batch done: ${success} ok, ${failed} failed${aborted ? ' (aborted)' : ''}.`,
-  );
-  // Persist the new entries even if some failed - partial
-  // progress is strictly better than none, and the failed
-  // entries will be retried on the next pass.
-  await categoryCache.flush();
-  categorizationBatchRunning = false;
-}
-/**
- * Wrap the diagnostic snapshot for the API payload. Lets
- * consumers (mobile shell, website) decide whether to show
- * "loading categories..." or render chips immediately.
- */
-function buildCategorizationStats(jsApps) {
-  let withCategories = 0;
-  for (const app of jsApps) {
-    if (app.categories && app.categories.length >= 0 && app.categories_source) {
-      withCategories++;
-    }
-  }
-  return {
-    enabled: !!process.env.HF_TOKEN,
-    total: jsApps.length,
-    classified: withCategories,
-    pending: jsApps.length - withCategories,
-    inProgress: categorizationBatchRunning,
-    ...categoryCache.stats(),
-  };
-}
-app.get('/api/js-apps', async (req, res) => {
-  try {
-    const apps = await getJsApps();
-    // Background top-up: if any entry is still uncategorized
-    // (or a Space's lastModified moved since we last looked),
-    // fire off a batch. We DO NOT await it - the response goes
-    // out immediately with whatever the cache currently knows.
-    const needsWork = apps.some(
-      (a) =>
-        !a.categories_source ||
-        categoryCache.needsCategorization(a.id, a.extra?.lastModified),
-    );
-    if (needsWork) {
-      // `void` to make it crystal clear we don't expect a value;
-      // the batch logs its own progress.
-      void runCategorizationBatch(apps).catch((err) => {
-        console.error('[Categorize] Background batch crashed:', err);
-      });
-    }
-    res.json({
-      apps,
-      cached: true,
-      cacheAge: appsCache.lastFetch
-        ? Math.round((Date.now() - appsCache.lastFetch) / 1000)
-        : 0,
-      count: apps.length,
-      categorization: buildCategorizationStats(apps),
-    });
-  } catch (err) {
-    console.error('[API] /api/js-apps error:', err);
-    res.status(500).json({ error: 'Failed to fetch JS apps' });
-  }
-});
-// Manual trigger for a categorization sweep, useful when
-// hand-tuning the taxonomy or testing the LLM prompt without
-// waiting for the next /api/js-apps hit.
-app.post('/api/js-apps/refresh-categories', async (req, res) => {
-  try {
-    const apps = await getJsApps();
-    void runCategorizationBatch(apps).catch((err) => {
-      console.error('[Categorize] Manual batch crashed:', err);
-    });
-    res.json({
-      ok: true,
-      message: `Categorization batch kicked off for ${apps.length} JS apps.`,
-      stats: buildCategorizationStats(apps),
-    });
-  } catch (err) {
-    res.status(500).json({ error: 'Failed to trigger refresh' });
-  }
-});
 // OAuth config endpoint - expose public OAuth variables to the frontend
 // (Docker Spaces don't auto-inject window.huggingface.variables like static Spaces)
 app.get('/api/oauth-config', (req, res) => {
@@ -562,29 +235,8 @@ app.get('*', (req, res) => {
 async function warmCache() {
   console.log('[Startup] Pre-warming cache...');
   try {
-    const apps = await getApps();
     console.log('[Startup] Cache warmed successfully');
-    // Categorization warm-up: fire the JS-app batch in the
-    // background so the first /api/js-apps caller doesn't
-    // shoulder the cold-start cost. Order: load the dataset
-    // cache first (cheap, one HTTP call), then run the batch
-    // for stale entries only.
-    void (async () => {
-      try {
-        await categoryCache.load();
-        const jsApps = apps.filter((a) => {
-          const tags = a?.extra?.tags;
-          return Array.isArray(tags) && tags.includes(JS_APP_TAG);
-        });
-        console.log(
-          `[Startup] Found ${jsApps.length} JS apps; checking categories...`,
-        );
-        await runCategorizationBatch(jsApps);
-      } catch (err) {
-        console.error('[Startup] Categorization warm-up failed:', err);
-      }
-    })();
   } catch (err) {
     console.error('[Startup] Failed to warm cache:', err);
   }

 import express from 'express';
 import path from 'path';
 import { fileURLToPath } from 'url';
 const __dirname = path.dirname(fileURLToPath(import.meta.url));
 const app = express();
 const PORT = process.env.PORT || 7860;
 // Note: HF API doesn't support pagination with filter=, so we use a high limit
 const HF_SPACES_LIMIT = 1000;
 // In-memory cache
 let appsCache = {
   data: null,
       const author = spaceId.split('/')[0];
       const name = spaceId.split('/').pop();
       return {
         // Core fields (used by both website and desktop)
         id: spaceId,
         url: `https://huggingface.co/spaces/${spaceId}`,
         source_kind: 'hf_space',
         isOfficial,
         // Extra metadata (desktop-compatible structure)
         extra: {
           id: spaceId,
   }
 });
 // OAuth config endpoint - expose public OAuth variables to the frontend
 // (Docker Spaces don't auto-inject window.huggingface.variables like static Spaces)
 app.get('/api/oauth-config', (req, res) => {
 async function warmCache() {
   console.log('[Startup] Pre-warming cache...');
   try {
+    await getApps();
     console.log('[Startup] Cache warmed successfully');
   } catch (err) {
     console.error('[Startup] Failed to warm cache:', err);
   }

src/pages/Buy.jsx CHANGED Viewed

@@ -41,7 +41,7 @@ const products = {
     price: 449,
     badge: 'Wireless',
     badgeColor: '#0ea5e9',
-    description: 'Self-contained robot with on-board compute. Works wirelessly or wired, perfect for standalone projects and demos. <strong>Ships in 60 days</strong>.',
     buyLink: 'https://buy.stripe.com/9B65kFfFlaKFbY34W873G03',
     image: '/assets/reachy-wireless.png',
     featured: true,
@@ -52,7 +52,7 @@ const products = {
     price: 299,
     badge: 'Lite',
     badgeColor: '#f59e0b',
-    description: 'Connect to your computer via USB. Same expressive robot, powered by your machine. Ideal for development and learning. <strong>Ships in 30 days</strong>.',
     buyLink: 'https://buy.stripe.com/6oUfZj78P1a5e6b0FS73G02',
     image: '/assets/reachy-lite.png',
     featured: false,
@@ -68,7 +68,7 @@ const comparisonFeatures = [
   { name: 'Camera', wireless: 'Wide angle', lite: 'Wide angle' },
   { name: 'Microphones', wireless: '4 microphones array', lite: '4 microphones array' },
   { name: 'Speaker', wireless: '5W speaker', lite: '5W speaker' },
-  { name: 'On-board Compute', wireless: 'Raspberry Pi CM 4 (16GB storage)', lite: false },
   { name: 'Accelerometer', wireless: 'Built-in IMU', lite: false },
   { name: 'Wi-Fi Connectivity', wireless: 'Wi-Fi', lite: false },
   { name: 'Standalone Mode', wireless: true, lite: false },
@@ -90,7 +90,7 @@ const boxContents = [
 const faqItems = [
   {
     question: 'What is the difference between Wireless and Lite?',
-    answer: 'The Wireless version includes a Raspberry Pi CM 4 built-in, allowing it to run standalone without a computer. The Lite version connects to your Mac, Linux, or Windows computer via USB and uses your computer for processing. Both versions have the same mechanical design and audio/video capabilities.',
   },
   {
     question: 'How long does assembly take?',
@@ -338,7 +338,7 @@ function ProductCardsSection() {
                 <Stack spacing={1} sx={{ mb: 3 }}>
                   {key === 'wireless' ? (
                     <>
-                      <FeatureRow icon="✓" text="On-board Raspberry Pi CM 4" highlight />
                       <FeatureRow icon="✓" text="Wi-Fi + USB connectivity" highlight />
                       <FeatureRow icon="✓" text="Built-in IMU" highlight />
                     </>
@@ -379,7 +379,7 @@ function ProductCardsSection() {
           variant="body1"
           sx={{ fontWeight: 600, color: 'text.primary' }}
         >
-          Current Lead time: 30 days for Lite, 60 days for Wireless after purchase
         </Typography>
         <Typography
           variant="body2"

     price: 449,
     badge: 'Wireless',
     badgeColor: '#0ea5e9',
+    description: 'Self-contained robot with on-board compute. Works wirelessly or wired, perfect for standalone projects and demos. <strong>Ships in 90 days</strong>.',
     buyLink: 'https://buy.stripe.com/9B65kFfFlaKFbY34W873G03',
     image: '/assets/reachy-wireless.png',
     featured: true,
     price: 299,
     badge: 'Lite',
     badgeColor: '#f59e0b',
+    description: 'Connect to your computer via USB. Same expressive robot, powered by your machine. Ideal for development and learning. <strong>Ships in 90 days</strong>.',
     buyLink: 'https://buy.stripe.com/6oUfZj78P1a5e6b0FS73G02',
     image: '/assets/reachy-lite.png',
     featured: false,
   { name: 'Camera', wireless: 'Wide angle', lite: 'Wide angle' },
   { name: 'Microphones', wireless: '4 microphones array', lite: '4 microphones array' },
   { name: 'Speaker', wireless: '5W speaker', lite: '5W speaker' },
+  { name: 'On-board Compute', wireless: 'Raspberry Pi 4 (16GB storage)', lite: false },
   { name: 'Accelerometer', wireless: 'Built-in IMU', lite: false },
   { name: 'Wi-Fi Connectivity', wireless: 'Wi-Fi', lite: false },
   { name: 'Standalone Mode', wireless: true, lite: false },
 const faqItems = [
   {
     question: 'What is the difference between Wireless and Lite?',
+    answer: 'The Wireless version includes a Raspberry Pi 4 built-in, allowing it to run standalone without a computer. The Lite version connects to your Mac, Linux, or Windows computer via USB and uses your computer for processing. Both versions have the same mechanical design and audio/video capabilities.',
   },
   {
     question: 'How long does assembly take?',
                 <Stack spacing={1} sx={{ mb: 3 }}>
                   {key === 'wireless' ? (
                     <>
+                      <FeatureRow icon="✓" text="On-board Raspberry Pi 4" highlight />
                       <FeatureRow icon="✓" text="Wi-Fi + USB connectivity" highlight />
                       <FeatureRow icon="✓" text="Built-in IMU" highlight />
                     </>
           variant="body1"
           sx={{ fontWeight: 600, color: 'text.primary' }}
         >
+          Current Lead time: 90 days after purchase
         </Typography>
         <Typography
           variant="body2"

src/pages/Download.jsx CHANGED Viewed

@@ -18,7 +18,6 @@ import CheckCircleIcon from '@mui/icons-material/CheckCircle';
 import OpenInNewIcon from '@mui/icons-material/OpenInNew';
 import ExpandMoreIcon from '@mui/icons-material/ExpandMore';
 import ExpandLessIcon from '@mui/icons-material/ExpandLess';
-import DesktopWindowsIcon from '@mui/icons-material/DesktopWindows';
 import Layout from '../components/Layout';
@@ -66,11 +65,6 @@ function detectPlatform() {
   return 'darwin-aarch64';
 }
-function isMobileDevice() {
-  const ua = navigator.userAgent;
-  return /Android|webOS|iPhone|iPad|iPod|BlackBerry|IEMobile|Opera Mini/i.test(ua);
-}
 // Format date
 function formatDate(dateString) {
   const date = new Date(dateString);
@@ -180,9 +174,6 @@ function parseReleasePlatforms(assets) {
     const name = asset.name.toLowerCase();
     const url = asset.browser_download_url;
-    // Skip signature files entirely
-    if (name.endsWith('.sig')) return;
     // macOS Apple Silicon - prefer .dmg
     if (name.includes('arm64.dmg')) {
       platforms['darwin-aarch64'] = { url };
@@ -190,13 +181,13 @@ function parseReleasePlatforms(assets) {
       platforms['darwin-aarch64'] = { url };
     }
-    // Windows - .msi
     if (name.endsWith('.msi')) {
       platforms['windows-x86_64'] = { url };
     }
     // Linux - .deb
-    if (name.endsWith('.deb')) {
       platforms['linux-x86_64'] = { url };
     }
   });
@@ -321,13 +312,11 @@ export default function Download() {
   const [detectedPlatform, setDetectedPlatform] = useState(null);
   const [loading, setLoading] = useState(true);
   const [showAllReleases, setShowAllReleases] = useState(false);
-  const [isMobile, setIsMobile] = useState(false);
   const [error, setError] = useState(null);
   useEffect(() => {
     setDetectedPlatform(detectPlatform());
-    setIsMobile(isMobileDevice());
     // Fetch latest release info from GitHub API
     async function fetchReleases() {
@@ -532,97 +521,67 @@ export default function Download() {
               </Typography>
             </Stack>
-            {/* Primary download button or mobile notice */}
-            {isMobile ? (
               <Box
                 sx={{
-                  mt: 2,
-                  p: 3,
-                  background: 'linear-gradient(135deg, rgba(255, 149, 0, 0.1) 0%, rgba(139, 92, 246, 0.08) 100%)',
-                  border: '1px solid rgba(255, 149, 0, 0.3)',
-                  borderRadius: 3,
                   maxWidth: 500,
                   mx: 'auto',
                 }}
               >
-                <DesktopWindowsIcon sx={{ fontSize: 40, color: 'rgba(255,255,255,0.5)', mb: 1.5 }} />
-                <Typography
-                  variant="body1"
-                  sx={{ color: 'rgba(255,255,255,0.9)', fontWeight: 600, mb: 1 }}
-                >
-                  Desktop only
-                </Typography>
-                <Typography
-                  variant="body2"
-                  sx={{ color: 'rgba(255,255,255,0.6)' }}
-                >
-                  Reachy Mini Control is a desktop application available for macOS, Windows, and Linux. Please visit this page from a computer to download it.
-                </Typography>
-              </Box>
-            ) : (
-              <>
-                <Button
-                  variant="contained"
-                  size="large"
-                  href={currentUrl}
-                  startIcon={<DownloadIcon />}
-                  sx={{
-                    px: 6,
-                    py: 2,
-                    fontSize: 17,
-                    fontWeight: 600,
-                    borderRadius: 3,
-                    background: 'linear-gradient(135deg, #FF9500 0%, #764ba2 100%)',
-                    boxShadow: '0 8px 32px rgba(255, 149, 0, 0.35)',
-                    transition: 'all 0.3s ease',
-                    '&:hover': {
-                      boxShadow: '0 12px 48px rgba(59, 130, 246, 0.5)',
-                      transform: 'translateY(-2px)',
-                    },
-                  }}
-                >
-                  Download for {currentPlatform?.name}
-                </Button>
                 <Typography
                   variant="body2"
                   sx={{
-                    color: 'rgba(255,255,255,0.4)',
-                    mt: 2,
-                    fontSize: 13,
                   }}
                 >
-                  {currentPlatform?.subtitle} • {currentPlatform?.format?.replace('.', '').toUpperCase()} package
                 </Typography>
-                {/* Beta Warning for Windows and Linux */}
-                {(detectedPlatform?.startsWith('windows') || detectedPlatform?.includes('linux')) && (
-                  <Box
-                    sx={{
-                      mt: 3,
-                      p: 2.5,
-                      background: 'linear-gradient(135deg, rgba(59, 130, 246, 0.1) 0%, rgba(139, 92, 246, 0.08) 100%)',
-                      border: '1px solid rgba(59, 130, 246, 0.3)',
-                      borderRadius: 2,
-                      maxWidth: 500,
-                      mx: 'auto',
-                    }}
-                  >
-                    <Typography
-                      variant="body2"
-                      sx={{
-                        color: 'rgba(255,255,255,0.8)',
-                        fontWeight: 500,
-                      }}
-                    >
-                      {detectedPlatform?.startsWith('windows')
-                        ? <>⚠️ Windows version is currently in Beta - installation requires <strong style={{ color: 'rgba(255,255,255,0.9)' }}>administrator privileges</strong>.</>
-                        : <>⚠️ Linux version is currently in Beta - please report any issues on <a href="https://github.com/pollen-robotics/reachy-mini-desktop-app/issues" target="_blank" rel="noopener noreferrer" style={{ color: '#3b82f6', textDecoration: 'underline' }}>GitHub</a> or <a href="https://discord.gg/HDrGY9eJHt" target="_blank" rel="noopener noreferrer" style={{ color: '#3b82f6', textDecoration: 'underline' }}>Discord</a>.</>
-                      }
-                    </Typography>
-                  </Box>
-                )}
-              </>
             )}
             {/* App screenshot */}
@@ -641,37 +600,35 @@ export default function Download() {
             />
           </Box>
-          {/* All platforms - hidden on mobile */}
-          {!isMobile && (
-            <Box sx={{ mb: 8 }}>
-              <Typography
-                variant="overline"
-                sx={{
-                  color: 'rgba(255,255,255,0.4)',
-                  display: 'block',
-                  textAlign: 'center',
-                  mb: 3,
-                  letterSpacing: 2,
-                }}
-              >
-                Available for all platforms
-              </Typography>
-              <Grid container spacing={2}>
-                {['darwin-aarch64', 'windows-x86_64', 'linux-x86_64'].map((key) => (
-                  <Grid size={{ xs: 12, sm: 4 }} key={key}>
-                    <PlatformCard
-                      platformKey={key}
-                      url={releaseData?.platforms[key]?.url}
-                      isActive={key === detectedPlatform}
-                      onClick={() => setDetectedPlatform(key)}
-                    />
-                  </Grid>
-                ))}
-              </Grid>
-            </Box>
-          )}
           {/* Features / What's included */}
           <Box

 import OpenInNewIcon from '@mui/icons-material/OpenInNew';
 import ExpandMoreIcon from '@mui/icons-material/ExpandMore';
 import ExpandLessIcon from '@mui/icons-material/ExpandLess';
 import Layout from '../components/Layout';
   return 'darwin-aarch64';
 }
 // Format date
 function formatDate(dateString) {
   const date = new Date(dateString);
     const name = asset.name.toLowerCase();
     const url = asset.browser_download_url;
     // macOS Apple Silicon - prefer .dmg
     if (name.includes('arm64.dmg')) {
       platforms['darwin-aarch64'] = { url };
       platforms['darwin-aarch64'] = { url };
     }
+    // Windows - .msi (exclude .sig signature files)
     if (name.endsWith('.msi')) {
       platforms['windows-x86_64'] = { url };
     }
     // Linux - .deb
+    if (name.includes('amd64.deb')) {
       platforms['linux-x86_64'] = { url };
     }
   });
   const [detectedPlatform, setDetectedPlatform] = useState(null);
   const [loading, setLoading] = useState(true);
   const [showAllReleases, setShowAllReleases] = useState(false);
   const [error, setError] = useState(null);
   useEffect(() => {
     setDetectedPlatform(detectPlatform());
     // Fetch latest release info from GitHub API
     async function fetchReleases() {
               </Typography>
             </Stack>
+            {/* Primary download button */}
+            <Button
+              variant="contained"
+              size="large"
+              href={currentUrl}
+              startIcon={<DownloadIcon />}
+              sx={{
+                px: 6,
+                py: 2,
+                fontSize: 17,
+                fontWeight: 600,
+                borderRadius: 3,
+                background: 'linear-gradient(135deg, #FF9500 0%, #764ba2 100%)',
+                boxShadow: '0 8px 32px rgba(255, 149, 0, 0.35)',
+                transition: 'all 0.3s ease',
+                '&:hover': {
+                  boxShadow: '0 12px 48px rgba(59, 130, 246, 0.5)',
+                  transform: 'translateY(-2px)',
+                },
+              }}
+            >
+              Download for {currentPlatform?.name}
+            </Button>
+            <Typography
+              variant="body2"
+              sx={{
+                color: 'rgba(255,255,255,0.4)',
+                mt: 2,
+                fontSize: 13,
+              }}
+            >
+              {currentPlatform?.subtitle} • {currentPlatform?.format?.replace('.', '').toUpperCase()} package
+            </Typography>
+            {/* Beta Warning for Windows and Linux */}
+            {(detectedPlatform?.startsWith('windows') || detectedPlatform?.includes('linux')) && (
               <Box
                 sx={{
+                  mt: 3,
+                  p: 2.5,
+                  background: 'linear-gradient(135deg, rgba(59, 130, 246, 0.1) 0%, rgba(139, 92, 246, 0.08) 100%)',
+                  border: '1px solid rgba(59, 130, 246, 0.3)',
+                  borderRadius: 2,
                   maxWidth: 500,
                   mx: 'auto',
                 }}
               >
                 <Typography
                   variant="body2"
                   sx={{
+                    color: 'rgba(255,255,255,0.8)',
+                    fontWeight: 500,
                   }}
                 >
+                  {detectedPlatform?.startsWith('windows')
+                    ? <>⚠️ Windows version is currently in Beta — installation requires <strong style={{ color: 'rgba(255,255,255,0.9)' }}>administrator privileges</strong>.</>
+                    : <>⚠️ Linux version is currently in Beta — please report any issues on <a href="https://github.com/pollen-robotics/reachy-mini-desktop-app/issues" target="_blank" rel="noopener noreferrer" style={{ color: '#3b82f6', textDecoration: 'underline' }}>GitHub</a> or <a href="https://discord.gg/HDrGY9eJHt" target="_blank" rel="noopener noreferrer" style={{ color: '#3b82f6', textDecoration: 'underline' }}>Discord</a>.</>
+                  }
                 </Typography>
+              </Box>
             )}
             {/* App screenshot */}
             />
           </Box>
+          {/* All platforms */}
+          <Box sx={{ mb: 8 }}>
+            <Typography
+              variant="overline"
+              sx={{
+                color: 'rgba(255,255,255,0.4)',
+                display: 'block',
+                textAlign: 'center',
+                mb: 3,
+                letterSpacing: 2,
+              }}
+            >
+              Available for all platforms
+            </Typography>
+            <Grid container spacing={2}>
+              {['darwin-aarch64', 'windows-x86_64', 'linux-x86_64'].map((key) => (
+                <Grid size={{ xs: 12, sm: 4 }} key={key}>
+                  <PlatformCard
+                    platformKey={key}
+                    url={releaseData?.platforms[key]?.url}
+                    isActive={key === detectedPlatform}
+                    onClick={() => setDetectedPlatform(key)}
+                  />
+                </Grid>
+              ))}
+            </Grid>
+          </Box>
           {/* Features / What's included */}
           <Box

src/pages/GettingStarted.jsx CHANGED Viewed

@@ -1,4 +1,4 @@
-import { useState, useEffect } from 'react';
 import { Link as RouterLink, useLocation } from 'react-router-dom';
 import {
   Box,
@@ -18,7 +18,6 @@ import {
 } from '@mui/material';
 import OpenInNewIcon from '@mui/icons-material/OpenInNew';
 import DownloadIcon from '@mui/icons-material/Download';
-import DesktopWindowsIcon from '@mui/icons-material/DesktopWindows';
 import WifiIcon from '@mui/icons-material/Wifi';
 import UsbIcon from '@mui/icons-material/Usb';
 import CheckCircleIcon from '@mui/icons-material/CheckCircle';
@@ -142,11 +141,6 @@ function YouTubeEmbed({ videoId, title, version = 'wireless' }) {
   );
 }
-function isMobileDevice() {
-  const ua = navigator.userAgent;
-  return /Android|webOS|iPhone|iPad|iPod|BlackBerry|IEMobile|Opera Mini/i.test(ua);
-}
 export default function GettingStarted() {
   const location = useLocation();
   const params = new URLSearchParams(location.search);
@@ -154,11 +148,6 @@ export default function GettingStarted() {
   const [version, setVersion] = useState(
     urlVersion === 'lite' ? 'lite' : 'wireless'
   );
-  const [isMobile, setIsMobile] = useState(false);
-  useEffect(() => {
-    setIsMobile(isMobileDevice());
-  }, []);
   return (
     <Layout transparentHeader>
@@ -339,45 +328,23 @@ export default function GettingStarted() {
                         <Typography variant="caption" sx={{ display: 'block', mb: 2, color: 'warning.main' }}>
                           Desktop App available for macOS (Apple Silicon), Windows & Linux (beta).
                         </Typography>
-                        {isMobile ? (
-                          <Box
-                            sx={{
-                              p: 2,
-                              bgcolor: 'action.hover',
-                              borderRadius: 2,
-                              border: '1px solid',
-                              borderColor: 'divider',
-                              display: 'flex',
-                              alignItems: 'center',
-                              gap: 1.5,
-                            }}
-                          >
-                            <DesktopWindowsIcon sx={{ color: 'text.secondary', fontSize: 20 }} />
-                            <Typography variant="body2" color="text.secondary">
-                              The desktop app can only be downloaded from a computer.
-                            </Typography>
-                          </Box>
-                        ) : (
-                          <>
-                            <Button
-                              variant="contained"
-                              component={RouterLink}
-                              to="/download"
-                              startIcon={<DownloadIcon/>}
-                            >
-                              Download Desktop App
-                            </Button>
-                            <Button
-                              variant="outlined"
-                              href="https://huggingface.co/docs/reachy_mini/SDK/installation"
-                              target="_blank"
-                              startIcon={<OpenInNewIcon/>}
-                            >
-                              Alternative: Python SDK
-                            </Button>
-                          </>
-                        )}
                       </StepContent>
                     </Step>
@@ -433,7 +400,7 @@ export default function GettingStarted() {
               <Typography variant="body1" color="text.secondary" sx={{ mb: 4, maxWidth: 600, mx: 'auto' }}>
                 Follow our visual guide to put together your Reachy Mini.
-                Most people finish in <strong>2-3 hours</strong> - our record is 43 minutes! 🏆
               </Typography>
               <Box
@@ -512,45 +479,23 @@ export default function GettingStarted() {
                         <Typography variant="caption" sx={{ display: 'block', mb: 2, color: 'warning.main' }}>
                           Desktop App available for macOS (Apple Silicon), Windows & Linux (beta).
                         </Typography>
-                        {isMobile ? (
-                          <Box
-                            sx={{
-                              p: 2,
-                              bgcolor: 'action.hover',
-                              borderRadius: 2,
-                              border: '1px solid',
-                              borderColor: 'divider',
-                              display: 'flex',
-                              alignItems: 'center',
-                              gap: 1.5,
-                            }}
-                          >
-                            <DesktopWindowsIcon sx={{ color: 'text.secondary', fontSize: 20 }} />
-                            <Typography variant="body2" color="text.secondary">
-                              The desktop app can only be downloaded from a computer.
-                            </Typography>
-                          </Box>
-                        ) : (
-                          <>
-                            <Button
-                              variant="contained"
-                              component={RouterLink}
-                              to="/download"
-                              startIcon={<DownloadIcon/>}
-                            >
-                              Download Desktop App
-                            </Button>
-                            <Button
-                              variant="outlined"
-                              href="https://huggingface.co/docs/reachy_mini/SDK/installation"
-                              target="_blank"
-                              startIcon={<OpenInNewIcon/>}
-                            >
-                              Alternative: Python SDK
-                            </Button>
-                          </>
-                        )}
                       </StepContent>
                     </Step>
                     <Step active completed={false}>

+import { useState } from 'react';
 import { Link as RouterLink, useLocation } from 'react-router-dom';
 import {
   Box,
 } from '@mui/material';
 import OpenInNewIcon from '@mui/icons-material/OpenInNew';
 import DownloadIcon from '@mui/icons-material/Download';
 import WifiIcon from '@mui/icons-material/Wifi';
 import UsbIcon from '@mui/icons-material/Usb';
 import CheckCircleIcon from '@mui/icons-material/CheckCircle';
   );
 }
 export default function GettingStarted() {
   const location = useLocation();
   const params = new URLSearchParams(location.search);
   const [version, setVersion] = useState(
     urlVersion === 'lite' ? 'lite' : 'wireless'
   );
   return (
     <Layout transparentHeader>
                         <Typography variant="caption" sx={{ display: 'block', mb: 2, color: 'warning.main' }}>
                           Desktop App available for macOS (Apple Silicon), Windows & Linux (beta).
                         </Typography>
+                        <Button
+                          variant="contained"
+                          component={RouterLink}
+                          to="/download"
+                          startIcon={<DownloadIcon/>}
+                        >
+                          Download Desktop App
+                        </Button>
+                        <Button
+                          variant="outlined"
+                          href="https://huggingface.co/docs/reachy_mini/SDK/installation"
+                          target="_blank"
+                          startIcon={<OpenInNewIcon/>}
+                        >
+                          Alternative: Python SDK
+                        </Button>
                       </StepContent>
                     </Step>
               <Typography variant="body1" color="text.secondary" sx={{ mb: 4, maxWidth: 600, mx: 'auto' }}>
                 Follow our visual guide to put together your Reachy Mini.
+                Most people finish in <strong>2-3 hours</strong> — our record is 43 minutes! 🏆
               </Typography>
               <Box
                         <Typography variant="caption" sx={{ display: 'block', mb: 2, color: 'warning.main' }}>
                           Desktop App available for macOS (Apple Silicon), Windows & Linux (beta).
                         </Typography>
+                        <Button
+                          variant="contained"
+                          component={RouterLink}
+                          to="/download"
+                          startIcon={<DownloadIcon/>}
+                        >
+                          Download Desktop App
+                        </Button>
+                        <Button
+                          variant="outlined"
+                          href="https://huggingface.co/docs/reachy_mini/SDK/installation"
+                          target="_blank"
+                          startIcon={<OpenInNewIcon/>}
+                        >
+                          Alternative: Python SDK
+                        </Button>
                       </StepContent>
                     </Step>
                     <Step active completed={false}>

src/pages/Home.jsx CHANGED Viewed

@@ -665,7 +665,7 @@ function ProductsSection() {
               sx={{ mb: 4, textAlign: "left", maxWidth: 280, mx: "auto" }}
             >
               {[
-                "Raspberry Pi CM 4 on-board",
                 "Wi-Fi + USB",
                 "Camera, 4 mics, speaker",
                 "Accelerometer",
@@ -763,7 +763,7 @@ function ProductsSection() {
             fontWeight: 600,
           }}
         >
-          Current Lead time: 30 days for Lite, 60 days for Wireless after purchase
         </Typography>
         <Typography
           variant="body2"

               sx={{ mb: 4, textAlign: "left", maxWidth: 280, mx: "auto" }}
             >
               {[
+                "Raspberry Pi 4 on-board",
                 "Wi-Fi + USB",
                 "Camera, 4 mics, speaker",
                 "Accelerometer",
             fontWeight: 600,
           }}
         >
+          Current Lead time: 90 days after purchase
         </Typography>
         <Typography
           variant="body2"