--- title: Phantom Grid emoji: ðŸ•ĩïļ colorFrom: indigo colorTo: gray sdk: gradio sdk_version: 6.17.3 app_file: app.py python_version: "3.10" pinned: false license: mit hf_oauth: false tags: - thousand-token-wood - delightful - game - agent - minicpm - track:wood - sponsor:openbmb - sponsor:openai - achievement:offgrid - achievement:offbrand - achievement:llama --- # ðŸ•ĩïļ Phantom Grid **An AI-driven noir detective game.** You are a detective hunting a phantom suspect across a stylized London grid. Issue notices, raise lookouts, set blockades, and **interview AI-roleplayed witnesses** whose memories decay over time — all rendered in a custom `gr.Server` HTML/JS board interface > Track: **Delightful — Thousand Token Wood** (an AI-driven game). ## 🏅 Prize-category badges - ðŸŽŪ **Thousand Token Wood / Delightful** — a playable, AI-driven detective game. - ðŸŽĻ **Off Brand** — fully custom `gr.Server` HTML/JS frontend, well beyond the stock components. - ðŸŠķ **Small & Mighty** — runs entirely on a single under-32B model (MiniCPM4.1-8B). ## ðŸĪ– Model & inference - **Model:** [`openbmb/MiniCPM4.1-8B`](https://huggingface.co/openbmb/MiniCPM4.1-8B) (text, bf16 transformers) — ~8B params, well under the 32B cap. - **Inference:** in-process Hugging Face `transformers`, placed on `cuda` at module load (using ZeroGPU's PyTorch CUDA emulation), with the real GPU attached only inside a `@spaces.GPU`-decorated `generate()` call. ## ðŸ–Ĩïļ Hardware Runs on **ZeroGPU** (NVIDIA RTX Pro 6000 Blackwell, `large` / 48 GB VRAM; 40 min/day for Team org members). Each generation is capped at `PHANTOM_GRID_ZEROGPU_DURATION` seconds (default 90). No voice path for now (`PHANTOM_GRID_WITNESS_CHAT_TTS=0`). ## ðŸŽĨ Demo video https://www.youtube.com/watch?v=p8iSjatInXo ## ðŸ“Ģ Social post Launch post on X: https://x.com/unityashtv/status/2066633879109382378 ## â–ķïļ How to play Start a new case, read the briefing, then use the board tools (notices, lookouts, blockades, searches) and question witnesses to corner the suspect before the turn limit. ## Local Deployment For running a version of the game with a locally running llama cpp backend find the code at https://github.com/U4AR/JohnDoe