phantom-grid / README.md
unity4ar's picture
Update README.md
7c640d5 verified
|
Raw
History Blame Contribute Delete
2.26 kB
---
title: Phantom Grid
emoji: ๐Ÿ•ต๏ธ
colorFrom: indigo
colorTo: gray
sdk: gradio
sdk_version: 6.17.3
app_file: app.py
python_version: "3.10"
pinned: false
license: mit
hf_oauth: false
tags:
- thousand-token-wood
- delightful
- game
- agent
- minicpm
- track:wood
- sponsor:openbmb
- sponsor:openai
- achievement:offgrid
- achievement:offbrand
- achievement:llama
---
# ๐Ÿ•ต๏ธ Phantom Grid
**An AI-driven noir detective game.** You are a detective hunting a phantom suspect
across a stylized London grid. Issue notices, raise lookouts, set blockades, and
**interview AI-roleplayed witnesses** whose memories decay over time โ€” all rendered
in a custom `gr.Server` HTML/JS board interface
> Track: **Delightful โ€” Thousand Token Wood** (an AI-driven game).
## ๐Ÿ… Prize-category badges
- ๐ŸŽฎ **Thousand Token Wood / Delightful** โ€” a playable, AI-driven detective game.
- ๐ŸŽจ **Off Brand** โ€” fully custom `gr.Server` HTML/JS frontend, well beyond the stock components.
- ๐Ÿชถ **Small & Mighty** โ€” runs entirely on a single under-32B model (MiniCPM4.1-8B).
## ๐Ÿค– Model & inference
- **Model:** [`openbmb/MiniCPM4.1-8B`](https://huggingface.co/openbmb/MiniCPM4.1-8B)
(text, bf16 transformers) โ€” ~8B params, well under the 32B cap.
- **Inference:** in-process Hugging Face `transformers`, placed on `cuda` at
module load (using ZeroGPU's PyTorch CUDA emulation), with the real GPU
attached only inside a `@spaces.GPU`-decorated `generate()` call.
## ๐Ÿ–ฅ๏ธ Hardware
Runs on **ZeroGPU** (NVIDIA RTX Pro 6000 Blackwell, `large` / 48 GB VRAM; 40
min/day for Team org members). Each generation is capped at
`PHANTOM_GRID_ZEROGPU_DURATION` seconds (default 90). No voice path for now
(`PHANTOM_GRID_WITNESS_CHAT_TTS=0`).
## ๐ŸŽฅ Demo video
https://www.youtube.com/watch?v=p8iSjatInXo
## ๐Ÿ“ฃ Social post
Launch post on X: https://x.com/unityashtv/status/2066633879109382378
## โ–ถ๏ธ How to play
Start a new case, read the briefing, then use the board tools (notices, lookouts,
blockades, searches) and question witnesses to corner the suspect before the turn limit.
## Local Deployment
For running a version of the game with a locally running llama cpp backend find the code at https://github.com/U4AR/JohnDoe