Spaces:
Runtime error
Runtime error
| hackathon: Build Small (2026) | |
| title: Dreadzone | |
| emoji: 💬 | |
| colorFrom: yellow | |
| colorTo: red | |
| sdk: gradio | |
| sdk_version: 6.5.1 | |
| app_file: app.py | |
| pinned: false | |
| suggested_hardware: t4-small | |
| license: artistic-2.0 | |
| short_description: Backrooms-inspired local GGUF experience | |
| team: | |
| - grimjim | |
| tags: | |
| - track:wood | |
| - sponsor:openai | |
| - sponsor:nvidia | |
| - achievement:offgrid | |
| - achievement:llama | |
| social_media_post: https://www.linkedin.com/posts/jim-lai-038249_i-participated-in-the-build-small-hackathon-share-7472113354073853952-LA39/ | |
| An entry for the Build Small Hackathon (2026) | |
| The track taken: Thousand Token Wood | |
| Dreadzone is a Backrooms-inspired interactive fiction prototype that runs a | |
| local GGUF model with `llama-cpp-python` and Gradio ChatInterface. | |
| The app downloads | |
| [`unsloth/NVIDIA-Nemotron-3-Nano-4B-GGUF`](https://huggingface.co/unsloth/NVIDIA-Nemotron-3-Nano-4B-GGUF) | |
| automatically on first launch and streams responses from | |
| `NVIDIA-Nemotron-3-Nano-4B-Q5_K_M.gguf`. | |
| No hosted inference API, OAuth token, secrets, or external inference services are | |
| used. The default dependency pin uses the CUDA 12.4 `llama-cpp-python` wheel for | |
| GPU Spaces. | |
| The Python app owns the lightweight game state: coordinates, turn count, sanity, | |
| zone profile, and encounter rolls. The model receives hidden state each turn and | |
| narrates the result without exposing coordinates or mechanics. There are a few | |
| surprises to keep players on their toes. | |
| ## Runtime settings | |
| The defaults are intentionally conservative while enabling GPU offload: | |
| - `N_CTX=2048` | |
| - `N_BATCH=128` | |
| - `MAX_HISTORY_TURNS=6` | |
| - `GAME_SEED=dreadzone` | |
| - `N_THREADS` defaults to one fewer than the detected CPU count | |
| - `N_GPU_LAYERS=-1` offloads all possible layers to GPU | |
| - `ENABLE_THINKING=false` renders the model chat template with thinking disabled | |
| You can override the model or runtime settings with Space variables: | |
| - `MODEL_REPO` | |
| - `MODEL_FILE` | |
| - `MODEL_DIR` | |
| - `GAME_SEED` | |
| - `N_CTX` | |
| - `N_BATCH` | |
| - `N_THREADS` | |
| - `N_GPU_LAYERS` | |
| - `ENABLE_THINKING` | |
| - `MAX_HISTORY_TURNS` | |
| ## Author | |
| grimjim@huggingface | |
| Assisted by Codex |