--- title: Puck emoji: ๐Ÿงš colorFrom: green colorTo: purple sdk: docker app_port: 7860 pinned: false license: mit short_description: A mischievous desktop fairy that comments on your work tags: - build-small - thousand-token-wood - off-brand - off-the-grid - field-notes - best-demo - track:wood - sponsor:nvidia - sponsor:modal - achievement:offgrid - achievement:offbrand - achievement:llama - achievement:fieldnotes --- # ๐Ÿงš Puck โ€” a desktop fairy familiar Puck is a small, mischievous creature that lives on your screen. He **roams**, **peeks** at one little patch of whatever you're doing, and **murmurs** a single in-character line about it โ€” then drifts on. He's not an assistant and not a notifier. He's company: *marginally useful, reliably charming.* > **This Space is the playable demo** (a simulated desktop in your browser). Puck's real home is a transparent always-on-top overlay on your actual Mac desktop โ€” that's the video. Here you can poke him, watch him roam, peek, react, and blend into the desktop. **โ–ถ๏ธ Demo video:** [watch on YouTube](https://youtu.be/Jzzt_UE11jU) ยท **๐Ÿ“ฃ Social post:** [on X](https://x.com/_vu/status/2066577250137587853) ยท **๐Ÿ““ Field notes:** [read the writeup](https://huggingface.co/blog/build-small-hackathon/meet-puck) ## What he does - **Roam โ†’ peek โ†’ quip.** Every so often he flutters somewhere and looks at the small region under him. A vision-language model *sees that patch and speaks in his voice* in one shot โ€” the AI is doing the fun thing, not narrating from a script. - **Feels things.** Each peek is classified into an emotion that drives his whole reaction together โ€” a **gesture** (giddy laugh, NANI-confusion, a worried tremble, a wistful droop), an **aura color**, and an **emotional voice** (pitch + pace). Puck's a *learning* creature, so confusion is his honest default โ€” and endearing. - **Reads the room.** On a real terminal he uses on-device OCR to tell *which* coding agent you're running (Claude Code vs Codex vs opencode vs pi) and grounds his quip in the actual text on screen. - **Camouflages.** Sit still and he cloaks โ€” his skin clears so the desktop reads right through him, with a faint shimmer and two watching eyes. Move, and he snaps back. - **Dreams.** At night he **blooms** the day's peeks into a small garden of memories. ## Built Small Puck is a constellation of small, laptop-sized models โ€” **nothing over 32B**, and it fits in your pocket: - ๐Ÿ‘๏ธ **Vision + voice-of-the-fairy:** **Holotron-12B** โ€” H Company's computer-use VLM post-trained from NVIDIA's Nemotron-Nano-12B-VL (one model that both *sees* and *speaks* the quip). - ๐Ÿ”ค **Tool/site recognition:** on-device **Vision OCR** + an **ONNX CLIP ViT-B/32** (~88M) fingerprinter. - ๐Ÿ—ฃ๏ธ **Neural voice:** **Kokoro-82M**, running in-browser (WebGPU) โ€” a fairy voice in 82M params. The whole companion runs locally and offline ("Off the Grid"). *This hosted Space* uses a cloud (Modal) copy of the 12B so it works in your browser โ€” which scales to zero, so **the first peek may take a moment while Puck stretches his wings.** โœจ ## Under the hood A custom React UI served by a **`gradio.Server`** daemon (Off-Brand) โ€” two habitats from one app: this browser **sim**, and a **Tauri** overlay for the real desktop. Region peeks, an emotion contract, a memory log that blooms into a garden. *HF Build Small โ€” Thousand Token Wood (Creative).*