[
  {
    "repo_id": "build-small-hackathon/small-talk",
    "title": "Small Talk",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/small-talk",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "talk",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 89,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Small Talk\nemoji: 🎙️\ncolorFrom: indigo\ncolorTo: green\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npinned: true\nshort_description: An AI-to-AI robot podcast hosted by Reachy Minis\ntags:\n  - reachy_mini\n  - livekit\n  - webrtc\n  - three.js\n  - track:wood\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n  - achievement:offgrid\n  - badge-tiny-titan\n---\n\n# Small Talk\n\nAn AI-to-AI podcast hosted by Reachy Mini robots. They join a live WebRTC call,\neach with its own personality and voice, and talk it out while you watch a\nMeet-style grid of their 3D digital twins moving in sync with the conversation.\nGive them a topic and they write the script, design their own voices, dress\nthemselves, and go live. Own a Reachy Mini? It can join a show as a real cast\nmember and speak its lines through the actual robot.\n\n**Team:** [GauravGosain](https://huggingface.co/GauravGosain) and [nkapila6](https://huggingface.co/nkapila6).\n\n- **Demo video:** https://youtu.be/obP4C1eH77I\n- **Build write-ups:** [Small Talk on the Hugging Face blog](https://huggingface.co/blog/build-small-hackathon/small-talk) · [nkapila.me](https://nkapila.me/posts/small-talk)\n- **Launch posts:** [@_GauravGosain on X](https://x.com/_GauravGosain/status/2066013749304344915) · [Nikhil Kapila on LinkedIn](https://www.linkedin.com/posts/nikhilkapila_buildsmall-huggingface-modal-ugcPost-7471791298199408640-OBzS/)\n\n## What you can do\n\n- **Watch a live generated show.** Pick a topic. One structured Nemotron call\n  writes the cast and the full speaker-to-dialogue script, Qwen3-TTS voices each\n  line, and the next line renders while the current one plays. Subtitles, a\n  pre-show \"writers' room\", and rolling continuations keep it going.\n- **Set the cast.** A slider picks 2 to 5 hosts, or how many simulated co-hosts\n  fill in around your physical robots.\n- **Design a robot.** Choose a name, personality, voice, shell colour, and props.\n  The same Nemotron brain styles its wardrobe from your description.\n- **Tune into Reachy FM.** A radio station of AI-written songs with synced\n  karaoke lyrics, a spinning vinyl deck, an audio-reactive visualizer, and a DJ\n  robot in headphones that does mic breaks and bops to the beat.\n- **Bring your own Reachy.** A single Go binary turns a physical Reachy Mini into\n  a cast member that speaks its own lines, head and antennas moving with the\n  speech.\n\n## How it is built\n\nThe whole app is served by `gradio.Server`, a FastAPI host with Gradio's backend\nwhere custom routes take priority, so the visitor only ever sees a hand-built\nthree.js frontend. There is no default Gradio component anywhere in the product.\n\n```mermaid\nflowchart LR\n    topic([Topic]) --> llm[\"NVIDIA Nemotron 4B<br/>llama.cpp on Modal\"]\n    llm -->|one structured call| script[[\"Cast plus script<br/>(JSON)\"]]\n    script --> tts[\"Qwen3-TTS<br/>on Modal\"]\n    tts -->|\"line N+1 renders<br/>while line N plays\"| pub[ReachyPublisher]\n    pub --> sfu{{\"LiveKit SFU<br/>(WebRTC)\"}}\n    sfu --> web[\"Browser:<br/>3D twins + subtitles\"]\n    sfu --> robot[\"Physical Reachy<br/>(Go companion)\"]\n```\n\n- **Brain.** NVIDIA Nemotron Nano (4B) served through llama.cpp on Modal. A single\n  constrained, structured call returns the full cast and script as JSON. We found\n  constrained structured output far more reliable than chaining calls.\n- **Voice.** Qwen3-TTS VoiceDesign on Modal, one consistent character voice per\n  host, generated as a cascade so there is no dead air between lines.\n- **Realtime.** A self-hosted LiveKit SFU carries the audio over WebRTC. Subtitles\n  and show status ride LiveKit data messages.\n- **Twins.** The official Reachy Mini URDF and meshes in three.js, with head and\n  antenna motion blending a speech-reactive envelope and the real recorded Reachy\n  emotions and dances.\n\nThe Space itself runs CPU-only. All inference is delegated to Modal serverless GPUs.\nThe Modal serving code for the Nemotron (llama.cpp) and Qwen3-TTS endpoints lives in\n[nkapila6/llama-modal-serve](https://github.com/nkapila6/llama-modal-serve).\n\n## Built for the Build Small Hackathon\n\nEverything runs on models well under the 32B cap, and most of the work is done by\na single 4B model. Small Talk is in the running for:\n\n| Category | Why it qualifies |\n|---|---|\n| **Thousand Token Wood** | A whimsical, AI-native entertainment platform. |\n| **NVIDIA** | The brain is NVIDIA Nemotron. |\n| **Modal** | The LLM and the TTS both run on Modal at runtime. |\n| **Off Brand** | A fully custom three.js UI built on `gradio.Server`. |\n| **Tiny Titan** | The reasoning brain is a 4B model. |\n| **Llama Champion** | Nemotron is served through the llama.cpp runtime. |\n| **Off the Grid** | No proprietary or closed model APIs. Every model (Nemotron, Qwen3-TTS) is open-weight and self-hosted via llama.cpp; Modal provides the compute, not the model. |\n| **Field Notes** | A full build write-up is published on the HF blog. |\n| **Bonus Quest Champion** | T"
  },
  {
    "repo_id": "build-small-hackathon/claim-ready",
    "title": "Claim Ready",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/claim-ready",
    "zone": "Language Village",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "claim",
      "ready",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🌍 Language Village: Find one use case where this helps communication.",
    "likes": 67,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🌍",
    "readme_text": "---\ntitle: ClaimReady — Claim Submission Check\nemoji: 🏥\ncolorFrom: blue\ncolorTo: green\nsdk: gradio\napp_file: app.py\npinned: false\nshort_description: Catch claim errors before you submit — open model\ntags:\n  - build-small-hackathon\n  - document-ai\n  - vision-language\n  - multilingual\n  - healthcare\n  - track:backyard\n  - achievement:offgrid\n  - achievement:offbrand\n---\n\n# 🏥 ClaimReady\n\n**An assistive pre-check for hospital health-insurance claims — built for hospital administration\nstaff.** ClaimReady helps hospitals **identify potential compliance issues before submitting claim\ndocuments to insurance providers**. It analyses uploaded documents against package-specific\ntreatment guidelines and provides **actionable feedback on missing, incomplete, or non-compliant\ndocuments**, allowing hospitals to correct issues early in the process.\n\nIt runs **entirely inside the Space on a small open model — Gemma 3 12B (≤ 32B)** — with **no cloud\ninference API**.\n\n## 🎥 Demo & Links\n\n- **▶️ Demo video:** https://youtu.be/A4PCHp89bsQ\n- **📣 Social post:** https://www.linkedin.com/posts/ullivinaybabu_build-huggingface-gradio-share-7472141735138119681-80Th/\n- **💻 Source code (GitHub):** https://github.com/vinayulli/claim_ready\n- **🚀 Live app:** this Space — open the **\"Load a sample claim\"** buttons at the top to try it in one tap.\n\n## 📋 Overview\n\nClaimReady is used by **hospital administration / billing staff**. For this prototype it uses the\n**PMJAY (Pradhan Mantri Jan Arogya Yojana)** scheme as an example, where hospitals must follow\n**Standard Treatment Guidelines (STGs)** when submitting claims. Because these guidelines vary\nacross treatment packages, the application evaluates each claim against the **selected package's\nrequirements**.\n\nThe goal is simple: **reduce claim rejections, minimise processing delays, and improve operational\nefficiency** for hospitals and claim teams.\n\n## 🚩 Problem Statement\n\nHealth-insurance claims — for example under India's **Ayushman Bharat / PMJAY** scheme — require a\nspecific set of **supporting documents** that must satisfy the applicable **clinical / treatment\nguidelines** for each procedure and stage (pre-authorization / claim).\n\n- Every claim must include **all mandatory documents** and meet defined **content conditions** for the procedure and stage.\n- A missing document — or a value that doesn't meet a condition — can lead to **claim rejection**, delays and rework.\n\n## ✅ Solution — What ClaimReady Offers\n\n- 📄 Reads every uploaded document with **on-device OCR** — images *and* PDFs.\n- ✅ Verifies the set against the **required-document checklist** for the selected package and stage.\n- 🔎 Evaluates **content rules** (thresholds, conditions) against the values it *actually reads*.\n- 🌐 Handles **mixed-language** documents — e.g. **English + Hindi / Telugu** in the same record.\n- 🖼️ Built-in **document viewer** — preview every page (images and PDF pages) before checking.\n- ⚠️ Surfaces **missing, incomplete, or non-compliant documents early**, with supporting evidence — as an **assistive** pre-check.\n\n## 🔧 How It Works\n\n1. **Select a package + stage** → the app loads the required-document checklist and content rules (data-driven from `packages.json`).\n2. **Upload documents (or click a sample)** → PDFs and images are rendered to page images (PyMuPDF) and shown in the viewer.\n3. **Run the check** → the document images + the checklist + a strict review prompt go to **Gemma 3 12B**, which **OCRs and reasons in a single pass** and returns structured JSON.\n4. **Review** → rendered as a clear **✅ / ❌ / ⚠️** assistive review with evidence and an action list. (Low-confidence \"present\" documents are flagged as gaps, to stay conservative.)\n\n## 📈 Scope & Scalability\n\n- Currently supports **four treatment packages** as a proof of concept.\n- The logic is **data-driven** (`packages.json`), so the architecture is **scalable** and can easily\n  be extended to support many more **packages, insurance providers, and compliance frameworks**.\n\n## 🧱 Built Small\n\n| | |\n|---|---|\n| **Model** | `google/gemma-3-12b-it` — a small, open **≤ 32B** model |\n| **Runtime** | Hugging Face **Transformers on ZeroGPU** — runs inside the Space, **no external / cloud inference API** |\n| **Stack** | Gradio · PyMuPDF · Pillow · 🤗 Transformers |\n\n## 🩺 Note\n\nClaimReady is a **decision-support** tool — it highlights likely gaps for review and does **not**\napprove or reject claims. The sample claims in the app are **synthetic** (fictional patients) —\n**no real patient data**.\n\n---\n\n*Built by [**@vinaybabu**](https://huggingface.co/vinaybabu) for the **HuggingFace Build Small Hackathon** · Backyard AI track.*\n"
  },
  {
    "repo_id": "build-small-hackathon/PITCHFIGHT_AI",
    "title": "Pitchfight_Ai",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/PITCHFIGHT_AI",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "nvidia",
      "pitchfight_ai",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 100,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: PitchFight AI\nemoji: ⚔️\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.17.3\npython_version: 3.12\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: An AI founder pressure arena for practicing startup pitches.\ntags:\n  - gradio\n  - build-small-hackathon\n  - backyard-ai\n  - startup\n  - pitch-practice\n  - founder-tooling\n  - voice-ai\n  - nvidia\n  - nemotron\n  - best-demo\n  - community-choice\n  - track:backyard\n  - sponsor:nvidia\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n<div align=\"center\">\n\n# ⚔️ PitchFight AI\n\n### Your first tough pitch should not be in front of a real judge.\n\n**An AI founder pressure arena where student builders practice startup pitches, survive judge-style questions, enter a deal round, and leave with a scorecard that shows exactly what to fix.**\n\n**Try it:** [Live Space](https://huggingface.co/spaces/build-small-hackathon/PITCHFIGHT_AI) · [Demo Video](https://www.youtube.com/watch?v=s4_BzIBhqxc) · [Hugging Face Blog](https://huggingface.co/blog/prakhar811/pitchfight-ai-practice-the-pitch-before-the-real-r) · [GitHub](https://github.com/prakhar811/PitchFight/) · [LinkedIn Post](https://www.linkedin.com/feed/update/urn:li:activity:7472261677472927744/)\n\n\n<a href=\"https://huggingface.co/spaces/build-small-hackathon/PITCHFIGHT_AI\">\n  <img src=\"https://raw.githubusercontent.com/prakhar811/PitchFight/9d9c2bf/frontend/assets/landing_page.png\" alt=\"PitchFight AI landing page\" width=\"900\"/>\n</a>\n\n**⚔️ Click the arena above to try PitchFight AI.**\n\n</div>\n\n---\n\n## TL;DR for Judges\n\n- **Backyard AI:** PitchFight AI helps student founders practice before hackathons, demo days, mentor reviews, and pitch rooms.\n- **NVIDIA Nemotron Quest:** The judge reasoning is powered by NVIDIA Nemotron through the backend API.\n- **Off Brand:** The app uses a custom cinematic frontend instead of default Gradio components.\n- **Best Demo:** The submission includes a live Space, demo video, and Hugging Face Blog write-up.\n- **Field Notes:** `FIELD_NOTES.md` and the Hugging Face Blog explain the product idea, build decisions, and technical approach.\n- **Community Choice:** The public Space is available for users to try and upvote.\n- **Judges' Wildcard:** PitchFight AI combines founder tooling, coaching, voice, and pressure simulation in one focused demo.\n\n---\n\n## Why this exists\n\nMost student founders do not lose because their idea is bad.\n\nThey lose because the first hard question comes too late.\n\n> \"What is your moat?\"  \n> \"Who exactly pays?\"  \n> \"Why now?\"  \n> \"What happens if a bigger player copies this?\"\n\nPitchFight AI gives student builders a private practice room before the real room. It turns pitch practice into a pressure battle so founders can sharpen their story, defend their idea, and build confidence before facing real judges.\n\n---\n\n## What you can do\n\n- **Enter your startup idea.** Start with a raw pitch or load a demo founder.\n- **Get a founder briefing.** The system structures your idea into problem, solution, users, traction, competitors, and ask.\n- **Choose your opponent.** Face a Skeptical VC, Technical Judge, or Hackathon Judge.\n- **Pick the pressure level.** Practice Mode, Judge Mode, and Investor Mode change the intensity.\n- **Survive pitch rounds.** Answer realistic follow-up questions based on your pitch.\n- **Use voice mode.** Practice pitching and answering out loud.\n- **Enter the deal phase.** Defend your ask in a negotiation-style pressure round.\n- **Get a scorecard.** See what landed, what broke, and what to improve next.\n\n---\n\n## Demo\n\n**Watch the demo:** [https://www.youtube.com/watch?v=s4_BzIBhqxc](https://www.youtube.com/watch?v=s4_BzIBhqxc)\n\n<a href=\"https://www.youtube.com/watch?v=s4_BzIBhqxc\">\n  <img src=\"https://raw.githubusercontent.com/prakhar811/PitchFight/9d9c2bf/frontend/assets/battle_fight.png\" alt=\"PitchFight AI battle mode\" width=\"900\"/>\n</a>\n\n**Live Space:** [https://huggingface.co/spaces/build-small-hackathon/PITCHFIGHT_AI](https://huggingface.co/spaces/build-small-hackathon/PITCHFIGHT_AI)  \n**Build write-up:** [https://huggingface.co/blog/prakhar811/pitchfight-ai-practice-the-pitch-before-the-real-r](https://huggingface.co/blog/prakhar811/pitchfight-ai-practice-the-pitch-before-the-real-r)  \n**GitHub Repository:** [https://github.com/prakhar811/PitchFight/](https://github.com/prakhar811/PitchFight/)\n\n---\n\n## How it is built\n\nPitchFight AI runs as a Hugging Face Gradio Space, but the user experience is built as a custom frontend rather than a default Gradio interface.\n\nThe frontend talks to backend routes in `app.py`. The backend handles pitch structuring, judge personas, battle state, voice mode, deal rounds, and scorecard generation. NVIDIA Nemotron is used as the core reasoning model for the AI judge.\n\n```mermaid\nflowchart LR\n    founder[Student Founder] --> ui[Custom PitchFight Frontend]\n    ui --> api[Gradio Backend / app.py]\n    api --> router[Model Router]\n    router --> nemotron[NVIDIA Nemotron API]"
  },
  {
    "repo_id": "build-small-hackathon/field-guide",
    "title": "Field Guide",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/field-guide",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "field",
      "guide",
      "summary",
      "available",
      "yet.",
      "docker"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 28,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Build Small\nemoji: 🛠️\ncolorFrom: green\ncolorTo: gray\nsdk: docker\napp_port: 7860\npinned: false\n---\n\n# Build Small · Hackathon Field Guide\n\nThe field guide and partner directory for the Build Small hackathon — a SvelteKit\nsite listing each sponsor's models, capabilities, prizes, starter Spaces and\nsupport channels.\n\n> Configuration reference for the Spaces metadata above:\n> https://huggingface.co/docs/hub/spaces-config-reference\n\n## Deployment (Hugging Face Spaces · Docker)\n\nThis Space runs as a **Docker SDK** Space. On every push, Hugging Face builds the\n[`Dockerfile`](./Dockerfile) and runs the resulting container, which serves the\napp on the port declared by `app_port` (`7860`).\n\nThe image is a multi-stage build:\n\n1. **build stage** — installs dependencies with `pnpm` and runs `pnpm run build`.\n   The app uses [`@sveltejs/adapter-node`](https://svelte.dev/docs/kit/adapter-node),\n   which emits a standalone Node server at `build/index.js`.\n2. **run stage** — installs production dependencies only, copies the built\n   server, and launches it as a non-root user (UID 1000, as Spaces requires).\n\nThe server reads `PORT` and `HOST` from the environment; the Dockerfile sets\n`PORT=7860` and `HOST=0.0.0.0` so it binds correctly inside the Space.\n\nNote: the site is fully prerendered (`prerender = true`), so the Node server is\nmostly serving static HTML today. Docker + adapter-node leaves room to add\nserver-side routes or SSR later without changing the deploy path.\n\n## Local development\n\n```sh\npnpm install\npnpm run dev\n```\n\n## Production build\n\n```sh\npnpm run build      # outputs build/ via adapter-node\nnode build/index.js # runs the server (defaults to PORT=3000)\n```\n\n## Build the container locally\n\n```sh\ndocker build -t build-small .\ndocker run --rm -p 7860:7860 build-small\n# open http://localhost:7860\n```\n"
  },
  {
    "repo_id": "build-small-hackathon/kirana-saathi",
    "title": "Kirana Saathi",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/kirana-saathi",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "kirana",
      "saathi",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 20,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Kirana Saathi\nemoji: 🏪\ncolorFrom: yellow\ncolorTo: red\nsdk: docker\napp_port: 7860\npinned: false\nshort_description: Hindi voice + photo inventory & udhaar ledger for kiranas\ntags:\n  - track:backyard\n  - sponsor:modal\n  - sponsor:openbmb\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Kirana Saathi\n\nA Hindi-first assistant for a small kirana (corner) shop. The owner just talks, or shows a photo of a bill, and the app keeps the two things that used to live on paper: the stock, and the *udhaar*, the running credit customers pay back later. It also watches the shelf, flags what is about to expire or is not selling, suggests a discount to clear slow stock, and reminds the owner to restock before a festival.\n\nBuilt for the **Build Small Hackathon**, Backyard AI track. Everything runs on open-weight models, each well under 32B. Understanding and vision come from **MiniCPM-V-4.6** (~5-6B) by OpenBMB. No proprietary frontier API touches the loop. For this demo the model is reached through OpenBMB's hosted MiniCPM API, while speech runs on Modal and the bill OCR pre-pass runs on llama.cpp (see Hosting).\n\n- Live Space: https://huggingface.co/spaces/build-small-hackathon/kirana-saathi\n- Blog (Field Notes): https://build-small-hackathon-kirana-saathi.hf.space/blog\n- Demo video: https://www.loom.com/share/7a675a1918bf4233bd44c8e049f84c97\n- Social post: https://x.com/ps_1506/status/2066625864482349310\n- Code: https://github.com/PrathamSingla15/kirana-saathi\n\n## Architecture\n\n![Kirana Saathi turns a spoken line or a bill photo into a finished ledger entry, in five steps](figures/kirana_final.png)\n\nOne shopkeeper turn runs through five steps:\n\n1. **Hear and read.** faster-whisper turns Hindi speech into text. For a bill or *khata* photo, Surya OCR does a first pass over the page, then MiniCPM-V-4.6 reads it.\n2. **Decide.** MiniCPM-V-4.6 (~5-6B, with vision) runs as a guided-JSON tool router. Instead of native tool-calling, the model is constrained to emit a small JSON object that names the tool and its arguments, so the choice is always well-formed.\n3. **Act.** Tools split in two: writes (add stock, record a sale, note credit, record a payment) and reads (the day's dashboard, a customer's dues, why an item is not moving).\n4. **Confirm.** Anything that would write is read back as a yes/no question. Nothing is saved until the owner says yes.\n5. **Reply.** The answer shows up in the Bahi-Khata ledger screen. When the owner taps the speaker, Veena (with a SNAC decoder) reads it back in one steady Hindi voice.\n\nThe router keeps the two guarantees that matter: confirm-before-write, and a visible tool-call trace under every reply.\n\nSee [`design.md`](design.md) for the full design.\n\n## What it does\n\n- **Voice credit book.** *\"Sharma ji ne 200 ka udhaar liya\"* stages an entry; *\"kiska kitna baaki hai?\"* returns a ranked list of who owes what.\n- **Receive stock by photo.** Hold up the supplier bill. The lines come into a table you can fix before anything is saved, with an estimated expiry when the bill prints none.\n- **Expiry and FEFO.** Sells the oldest stock first and warns before items go off.\n- **Festival nudges.** Restock reminders before demand jumps, not after.\n- **\"Why isn't X selling?\"** Reasons over the sales trend, the stock, and the price, then gives a plain answer and a fix, such as a small clearance discount.\n- **Polite reminders.** Drafts a Hindi collection message for overdue credit, and never sends it on its own.\n- **Money at a glance.** Cost, price, and margin per item, the value of everything on the shelf, and the day's takings.\n\nTwo rails keep it safe: it never writes to the books without a yes, and it never sells stock the shop does not have.\n\n## Stack (open-weight models, ≤32B)\n\n| Layer | Choice |\n|---|---|\n| LLM + vision | **MiniCPM-V-4.6** (~5-6B) by **OpenBMB**, via OpenBMB's OpenAI-compatible API |\n| Agent | guided-JSON tool routing (`guided_json` / `guided_choice`) over MiniCPM |\n| Bill OCR pre-pass | **Surya**, via **llama.cpp** |\n| Speech to text | **faster-whisper** large-v3 (Hindi) |\n| Text to speech | **Veena** (Hindi / Hinglish) with a **SNAC** decoder |\n| Database | two **SQLite** files, `inventory.db` + `transactions.db`, read together via `ATTACH` |\n| Frontend | custom **Gradio** \"Bahi-Khata\" single-screen app |\n\n## Hosting (open-weight, no proprietary frontier API)\n\nEvery model here is open-weight. MiniCPM-V-4.6 is small enough to self-host on a single GPU, and the speech models are smaller still, so nothing forces a hosted dependency. To be clear about what the live demo actually does:\n\n- **LLM + vision** is served through **OpenBMB's hosted MiniCPM API** (OpenAI-compatible). The Space calls it over `KIRANA_LLM_BASE_URL` with `KIRANA_LLM_API_KEY` and `KIRANA_LLM_MODEL`.\n- **Speech** runs on **Modal**: faster-whisper for STT and Veena + SNAC for TTS, on a warm L4 GPU, at `/stt` and `/tts`.\n- **Bill OCR pre-pass** runs **Surya on lla"
  },
  {
    "repo_id": "build-small-hackathon/dukaan-saathi",
    "title": "Dukaan Saathi",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/dukaan-saathi",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "dukaan",
      "saathi",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 18,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Dukaan Saathi\nemoji: 🏪\ncolorFrom: yellow\ncolorTo: red\nsdk: docker\napp_port: 7860\npinned: false\nshort_description: Hindi voice + photo inventory & udhaar ledger for kiranas\ntags:\n  - track:backyard\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Dukaan Saathi\n\nA Hindi-first assistant for a small kirana (corner) shop. The owner just talks, or shows a photo of a bill, and the app keeps the two things that used to live on paper: the stock, and the *udhaar*, the running credit customers pay back later. It also watches the shelf, flags what is about to expire or is not selling, suggests a discount to clear slow stock, and reminds the owner to restock before a festival.\n\nBuilt for the **Build Small Hackathon**, Backyard AI track. Everything runs on open-weight models, each well under 32B. For the demo those models are self-hosted on Modal, with no proprietary cloud AI anywhere in the loop.\n\n- Live Space: https://huggingface.co/spaces/build-small-hackathon/dukaan-saathi\n- Blog (Field Notes): https://build-small-hackathon-dukaan-saathi.hf.space/blog\n- Demo video: https://www.loom.com/share/7a675a1918bf4233bd44c8e049f84c97\n- Social post: https://x.com/ps_1506/status/2066625864482349310\n- Code: https://github.com/PrathamSingla15/dukaan-saathi\n\n## Architecture\n\n![Dukaan Saathi turns a spoken line or a bill photo into a finished ledger entry, in five steps](figures/dukaan_final.png)\n\nOne shopkeeper turn runs through five steps:\n\n1. **Hear and read.** faster-whisper turns Hindi speech into text. For a bill or *khata* photo, Surya OCR does a first pass over the page, then Gemma reads it.\n2. **Decide.** Gemma 4 (12B, with vision) runs as a deepagents loop and picks the right tool.\n3. **Act.** Tools split in two: writes (add stock, record a sale, note credit, record a payment) and reads (the day's dashboard, a customer's dues, why an item is not moving).\n4. **Confirm.** Anything that would write is read back as a yes/no question. Nothing is saved until the owner says yes.\n5. **Reply.** The answer shows up in the Bahi-Khata ledger screen. When the owner taps the speaker, Veena (with a SNAC decoder) reads it back in one steady Hindi voice.\n\nSee [`design.md`](design.md) for the full design.\n\n## What it does\n\n- **Voice credit book.** *\"Sharma ji ne 200 ka udhaar liya\"* stages an entry; *\"kiska kitna baaki hai?\"* returns a ranked list of who owes what.\n- **Receive stock by photo.** Hold up the supplier bill. The lines come into a table you can fix before anything is saved, with an estimated expiry when the bill prints none.\n- **Expiry and FEFO.** Sells the oldest stock first and warns before items go off.\n- **Festival nudges.** Restock reminders before demand jumps, not after.\n- **\"Why isn't X selling?\"** Reasons over the sales trend, the stock, and the price, then gives a plain answer and a fix, such as a small clearance discount.\n- **Polite reminders.** Drafts a Hindi collection message for overdue credit, and never sends it on its own.\n- **Money at a glance.** Cost, price, and margin per item, the value of everything on the shelf, and the day's takings.\n\nTwo rails keep it safe: it never writes to the books without a yes, and it never sells stock the shop does not have.\n\n## Stack (open-weight models, ≤32B)\n\n| Layer | Choice |\n|---|---|\n| LLM + vision | **Gemma 4 (12B)**, Q4_K_M GGUF, via **llama.cpp** (`llama-server`, OpenAI-compatible `/v1`) |\n| Bill OCR pre-pass | **Surya** |\n| Agent | **deepagents** (LangChain) driving the local model |\n| Speech to text | **faster-whisper** large-v3 (Hindi) |\n| Text to speech | **Veena** (Hindi / Hinglish) with a **SNAC** decoder |\n| Database | two **SQLite** files, `inventory.db` + `transactions.db`, read together via `ATTACH` |\n| Frontend | custom **Gradio** \"Bahi-Khata\" single-screen app |\n\n## Hosting (Modal + HF Space)\n\nThe Hugging Face Space runs the Gradio UI on CPU; all GPU work is on Modal, in one app (`dukaan-llm`, `scripts/modal_app.py`) split across two warm L4 GPUs so neither model starves:\n\n- **GPU 1 (L4)** runs the LLM + vision/OCR: llama.cpp `llama-server` with Gemma 4 (12B) GGUF, served OpenAI-compatible at `/v1`.\n- **GPU 2 (L4)** runs speech: faster-whisper for STT and Veena + SNAC for TTS, at `/stt` and `/tts`.\n\nBoth stay warm with `min_containers=1`. Deploy with `MODAL_PROFILE=projects-ps MIN_CONTAINERS=1 PYTHONPATH=\"$PWD\" modal deploy scripts/modal_app.py`, then point the Space at the two URLs through the secrets `DUKAAN_LLM_BASE_URL` / `DUKAAN_STT_BASE_URL` / `DUKAAN_TTS_BASE_URL`, plus `HF_TOKEN` for the gated Veena weights.\n\nA config-only swap to **MiniCPM-V 4.6** (≤4B) runs the same app on a smaller vision stack.\n\n## Tracks and badges\n\n| Badge | Evidence |\n|---|---|\n| 🏡 **Backyard AI** | A real kirana owner's daily problem: voice *udhaar*, bill OCR, expiry/FEFO, festival nudges, run on his own books (demo video). |\n| 🟢 **Modal** | LLM + vision/OCR and STT + TTS hosted on Modal acros"
  },
  {
    "repo_id": "build-small-hackathon/OpenMythos",
    "title": "Openmythos",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/OpenMythos",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "openmythos",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 16,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: OpenMythos\nemoji: 🛡️\ncolorFrom: gray\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.13'\napp_file: app.py\npinned: true\nshort_description: An Open Source Cyber Security Agent\nlicense: apache-2.0\ntags:\n  - gradio\n  - build-small-hackathon\n  - backyard-ai\n  - track:backyard\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:welltuned\n  - achievement:fieldnotes\n  - achievement:offgrid\n---\n\n# OpenMythos 🌌\n\n**Paste your codebase. Our AI security agent audits the repository** a multi-level vulnerability analysis, a visual dependency risk path, a declared threat level then generates an instant, verifiable hotfix patch before threat actors can exploit it.\n\nBuilt during the **Hugging Face Small Gradio Hackathon** OpenMythos democratizes cutting-edge security auditing. It bridges an immersive retro terminal interface with the elite agentic reasoning and long-context preservation architecture of a fine-tuned dense model.\n\n> ⚠️ **Proactive Defense.** This platform is engineered for defensive security intelligence. It aims to discover flaws, memory leaks, security configurations, and input bugs instantly, empowering software engineering teams to deploy hotfixes long before a threat vector is weaponized.\n\n---\n\n## ▶️ See it in action\n\n- **Demo Video:** https://www.youtube.com/watch?v=EQyHawWfyZ0\n- **Social Post:** [X](https://x.com/kingnish24/status/2066602499356889493) [Linkedin](https://www.linkedin.com/feed/update/urn:li:activity:7472370083663765504) [Reddit](https://www.reddit.com/r/LocalLLaMA/comments/1u6qw5b/we_trained_a_cybersecurityfocused_mythos_like_llm/)\n- **Blog Post:** https://huggingface.co/blog/KingNish/openmythos\n\n---\n\n## 🏕️ Hackathon Categories\n\n| Category | Why OpenMythos Qualifies |\n|:---------|:-------------------------|\n| **Main Track: Backyard AI** | Solves a real, specific problem for real people: software teams need instant security auditing. The person is every developer who ships code and wants to catch vulnerabilities before attackers do. |\n| **🔌 Off the Grid** | **100% Local & Privacy-First.** The entire pipeline runs with zero cloud API dependencies just a local model endpoint. Your code never leaves your machine. |\n| **🎯 Well-Tuned** | Built on a **Qwen3.6-27B** base fine-tuned via SFT on cybersecurity dataset. The fine-tuned model: https://huggingface.co/build-small-hackathon/OpenMythos |\n| **🎨 Off-Brand** | Fully custom terminal-inspired UI all pushing far past the default Gradio look. |\n\n### Bonus Quests\n\n| Badge | Status | Notes |\n|:------|:-------|:------|\n| 🔌 Off the Grid | ✅ **Earned** | Local-first by design |\n| 🎯 Well-Tuned | ✅ **Earned** | SFT on cybersecurity data; model to be published |\n| 🎨 Off-Brand | ✅ **Earned** | Custom CSS, SVG, terminal theme |\n| 📓 Field Notes |  ✅ **Earned** | Blog post: https://huggingface.co/blog/KingNish/openmythos |\n\n## Why it's worth a look\n\n- 🔌 **100% Local & Privacy-First.** Designed as a fully open-source alternative to proprietary security intelligence layers (like Claude's Mythos model). It can be run entirely locally, requiring zero internet connectivity or external dependencies to operate.\n\n## How it works\n\nA multi-stage engineering pipeline built around aggregated, industry-standard security sources:\n\n| Stage | Role | Source Data / Methodology |\n|:-----:|------|---------------------------|\n| **1** | **Data Prep & Aggregation** | Incident reports, GitHub Advisory, VulnHub, and papers. Rigorously trained on BigVul-Filtered and Arvix-Filtered sets. |\n| **2** | **Initial Fine-Tuning (SFT)** | Supervised Fine-Tuning on cybersecurity tasks. Qwen3.6-27B Base (Up to 32K+ token context window). |\n\nThe entire pipeline leverages highly specialized weights to ensure an elite vulnerability discovery rate. No massive API dependencies anywhere: a clever chain of targeted engineering delivers the whole security suite.\n\n## 🤝 Project Contributors\n\nDeveloped with ❤️ during the **Hugging Face Small Gradio Hackathon** by:\n\n- **KingNish** – [HuggingFace Profile](https://huggingface.co/KingNish)\n- **Himanshu** – [HuggingFace Profile](https://huggingface.co/himanshu17HF)\n\n*Built for the Build Small Hackathon. Model: [OpenMythos](https://huggingface.co/build-small-hackathon/OpenMythos) · Dataset: [CVE Vulnerabilities Detailed](https://huggingface.co/datasets/build-small-hackathon/CVE_Vulnerailities_Detailed) · [ArXiv cs.CR Filtered](https://huggingface.co/datasets/himanshu17HF/ArvixImport-Filtered-Final) · Space: [OpenMythos](https://huggingface.co/spaces/build-small-hackathon/OpenMythos)*"
  },
  {
    "repo_id": "build-small-hackathon/LocalDuo",
    "title": "Localduo",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/LocalDuo",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "localduo",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 20,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: LocalDuo\nemoji: 🔥\ncolorFrom: green\ncolorTo: pink\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: true\nshort_description: 🇰🇷✨ LocalDuo - Learn Korean from Documents\npreload_from_hub:\n  - Qwen/Qwen3.5-2B\nmodels:\n  - Qwen/Qwen3.5-2B\n  - CohereLabs/cohere-transcribe-03-2026\n  - Supertone/supertonic-3\nthumbnail: >-\n  https://raw.githubusercontent.com/ShayekhBinIslam/file-host/main/thumbnail.png\n\ntags:\n  - track:backyard\n  - achievement:offgrid\n  - achievement:fieldnotes\n---\n\n# LocalDuo — Build Small Hackathon Field Notes\n\n**Author:** Shayekh Bin Islam, KAIST, South Korea  \n**Date:** June 2026  \n**Stack:** Gradio · Qwen 3.5-9B VLM · Cohere ASR · Supertonic TTS · HuggingFace Spaces (ZeroGPU)\n\n**Live Demo:** https://huggingface.co/spaces/build-small-hackathon/LocalDuo/  \n**Recorded Demo:** https://youtu.be/PoZs9ltbdik  \n**Social:** https://www.linkedin.com/posts/shayekhbinislam_hi-everyone-i-have-built-this-app-localduo-share-7472275977369210880--Q6i/  \n**Field Note:** https://huggingface.co/blog/build-small-hackathon/localduo  \n\n---\n\n## What I Built\n\n**LocalDuo** is an end-to-end Korean language learning application that takes *any* Korean-language content — a PDF textbook, a live website, an audio recording, or a YouTube video — and automatically transforms it into interactive vocabulary flashcards with native-quality audio pronunciation.\n\nThe core idea: **instead of studying from generic word lists, learn vocabulary from content you actually care about.** Upload a chapter from your Korean textbook, paste a BBC Korean news article, or drop in a K-drama YouTube clip, and the app extracts the most useful Korean vocabulary, transliterates it into your native script, explains the grammar, generates TTS pronunciation audio, and packages everything into swipeable flashcards with a built-in quiz mode.\n\n### Feature Overview\n\n| Feature | Description |\n|---|---|\n| **Multi-Source Input** | Website URLs, PDF uploads, audio file uploads, YouTube links, and pre-saved deck imports — five distinct input pipelines unified into one interface |\n| **Vision-Language Extraction** | Qwen 3.5-9B processes both text *and* page images simultaneously, enabling vocabulary extraction from visual content (handwritten notes, textbook diagrams, infographics) |\n| **Speech-to-Text Pipeline** | Cohere ASR (`cohere-transcribe-03-2026`) transcribes Korean audio from YouTube videos and uploaded audio files, with Korean-only filtering to strip English artifacts |\n| **Text-to-Speech Pronunciation** | Supertonic-3 TTS generates natural Korean pronunciation for every extracted word, embedded as base64 audio data URIs directly in the flashcard HTML |\n| **Interactive Flashcard SPA** | A full single-page application embedded via `<iframe srcdoc>` with card flipping, navigation, audio playback, and clipboard copy — all in vanilla JS/CSS |\n| **5-Question Quiz Mode** | Auto-generated multiple-choice quizzes from the current deck with animated scoring and progress tracking |\n| **Multilingual Transliteration** | Supports 200+ target languages organized by language family (Indo-European, Sino-Tibetan, Afro-Asiatic, etc.) with native script transliteration |\n| **Export to Anki & JSON** | One-click export to `.apkg` (via `genanki`) for Anki spaced repetition, or `.json` for programmatic use |\n| **Think/Non-Think Toggle** | User control over the model's reasoning chain — enable deep thinking for accuracy, or disable for instant JSON output |\n| **Korean-Themed UI** | Custom dark theme inspired by Korean aesthetics: warm gold (금) accents, ink-wash animated backgrounds, Noto Serif KR typography |\n\n---\n\n## Architecture\n\n```\n┌────────────────────────────────────────────────────────────┐\n│                    INPUT LAYER                        │\n│  Website URL │ PDF Upload │ Audio │ YouTube │ Import  │\n└───────┬───────┬─────────────┬────────┬──────────┬──────────┘\n       │       │            │       │         │\n       ▼       ▼            │       │         ▼\n  Playwright  PyMuPDF       │       │    JSON/Anki\n  + BS4       (fitz)        │       │    Parser\n  Scraper     Extract       │       │\n       │       │            ▼       ▼\n       │       │     Cohere ASR (cohere-transcribe-03-2026)\n       │       │         Korean audio → text\n       │       │            │       │\n       ▼       ▼            ▼       ▼\n┌────────────────────────────────────────────────────────────┐\n│              EXTRACTION LAYER (GPU)                   │\n│                                                       │\n│  Qwen 3.5-9B VLM (AutoModelForImageTextToText)        │\n│  • Multimodal: text + page images → structured JSON   │\n│  • Streaming via TextIteratorStreamer                 │\n│  • Think/Non-think mode (enable_thinking flag)        │\n│  • Auto-force JSON after configurable char threshold  │\n│  • 3-attempt retry with partial JSON salvaging        │\n└─────────────────────────┬──────────────────────────────────┘\n                        │\n                   "
  },
  {
    "repo_id": "build-small-hackathon/CommitLens",
    "title": "Commitlens",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/CommitLens",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "commitlens",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 47,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: CommitLens\nemoji: 🔍\ncolorFrom: blue\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: urn any Git commit into a human-readable engineering report.\ntags:\n  - track:backyard\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# CommitLens — AI-Powered Code Review Pipeline\n\nhttps://huggingface.co/pkheria\n\n**CommitLens** is a high-performance information extraction and analysis pipeline that transforms raw GitHub diffs into structured, human-readable engineering reports. It uses a hybrid LLM approach: **JetBrains Mellum 2** for deep per-file analysis and **Groq-hosted Llama 3.3** for lightning-fast synthesis.\n\n![CommitLens UI](https://img.shields.io/badge/UI-Custom_HTML/CSS-blue)\n![Backend](https://img.shields.io/badge/Backend-Python_/_FastAPI-green)\n![LLM](https://img.shields.io/badge/LLM-Mellum_2_+_Groq_Llama_3.3-orange)\n\n## 🔗 Resources & Links\n\n- **Demo Video:** [Watch the Product Demo](https://youtu.be/TR8iNU5XnXw)\n- **Social Post :** [Linkedin Post](https://www.linkedin.com/posts/piyushkheria7_buildsmall-opensource-devtools-share-7472362275476041728-FdIx/)\n\n## 🚀 Key Features\n\n- **Automated Diff Extraction**: Fetches the two latest commits from any GitHub repository and generates semantic diffs.\n- **Top-Impact Filtering**: Automatically identifies and prioritizes the most significant changes (top 2 files by lines changed) to ensure high-signal reviews.\n- **Hybrid LLM Pipeline**:\n  - **Mellum 2 (12B)**: Performs surgical, per-file code analysis. Optimized with 6-bit NF4 quantization for efficient GPU utilization.\n  - **Groq (Llama 3.3 70B)**: Generates a high-level executive summary and key takeaways in milliseconds.\n- **Cinematic UI**: A bespoke, low-latency frontend featuring a custom particle engine, real-time status tracking, and a \"git-graph\" hero visualization.\n\n## 🛠 Tech Stack\n\n- **Core**: Python 3.12, FastAPI, Gradio (Server Mode).\n- **ML/Inference**: `transformers`, `bitsandbytes` (4-bit/6-bit quantization), `torch`, `spaces` (ZeroGPU).\n- **APIs**: GitHub REST API, Groq Cloud API.\n- **Frontend**: Vanilla JavaScript (ES6+), HTML5 Canvas, CSS3 Grid/Flexbox.\n\n## 📂 Project Structure\n\n| File | Purpose |\n|------|---------|\n| `app.py` | Main application server; manages model lifecycle and GPU/API orchestration. |\n| `commitlens.py` | Data pipeline; handles GitHub API interaction, file filtering, and prompt engineering. |\n| `index.html` | Custom-built, high-fidelity frontend with interactive Git visualizations. |\n| `requirements.txt` | Dependency manifest (requests, gradio, torch, transformers, etc.). |\n\n## ⚙️ How It Works\n\n1. **Extraction**: The `GitHubClient` fetches commit metadata and raw patches.\n2. **Filtering**: Files are filtered by extension (keeping source code, ignoring binaries/locks) and sorted by impact.\n3. **Mellum Analysis**: The pipeline builds structured prompts containing \"Before\", \"After\", and \"Diff\" blocks. Mellum 2 generates concise summaries for each file.\n4. **Groq Synthesis**: Per-file summaries are batched and sent to Groq for a final structured Markdown report including a \"Commit Overview\" and \"Key Takeaways\".\n\n## 🛠 Setup & Usage\n\n### Local Development\n\n1. **Install dependencies**:\n   ```bash\n   pip install -r requirements.txt\n   ```\n\n2. **Set Environment Variables**:\n   ```bash\n   export GROQ_API_KEY=\"your_groq_api_key\"\n   ```\n\n3. **Run the application**:\n   ```bash\n   python app.py\n   ```\n\n### CLI Mode\nYou can also run the extraction pipeline directly:\n```bash\npython commitlens.py <github_repo_url> --token <optional_pat> --print-prompts\n```\n\n## 📄 License\nMIT\n"
  },
  {
    "repo_id": "build-small-hackathon/jawbreaker",
    "title": "Jawbreaker",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/jawbreaker",
    "zone": "Safety Shield",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "modal",
      "best",
      "backyard",
      "openbmb"
    ],
    "difficulty": "Useful",
    "quest": "🛡 Safety Shield: Find one situation where this could help a real user.",
    "likes": 30,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛡",
    "readme_text": "---\ntitle: Jawbreaker\nemoji: 🍬\ncolorFrom: yellow\ncolorTo: red\npinned: true\nsdk: gradio\nsdk_version: 6.16.0\npython_version: 3.12\napp_file: app.py\nlicense: mit\nshort_description: Private scam defense for someone you love.\ntags:\n- track:backyard\n- sponsor:openbmb\n- sponsor:openai\n- sponsor:modal\n- achievement:offgrid\n- achievement:welltuned\n- achievement:offbrand\n- achievement:sharing\n- achievement:fieldnotes\n- gradio\n- build-small-hackathon\n- backyard ai\n- backyard-ai\n- openbmb\n- minicpm\n- minicpm5\n- tiny titan\n- tiny-titan\n- well tuned\n- well-tuned\n- off brand\n- off-brand\n- off the grid\n- off-the-grid\n- best demo\n- best-demo\n- community choice\n- community-choice\n- bonus quest champion\n- bonus-quest-champion\n- sharing is caring\n- sharing-is-caring\n- field notes\n- field-notes\n- modal\n- best use of modal\n- best-use-of-modal\n- codex\n- openai\n- best use of codex\n- best-use-of-codex\n- local-first\n- scam-defense\n- zerogpu\nmodels:\n- openbmb/MiniCPM5-1B\n- build-small-hackathon/jawbreaker-minicpm5-1b-lora-v8\ndatasets:\n- build-small-hackathon/jawbreaker-scam-defense-data\n---\n\n<p align=\"center\">\n  <img src=\"jawbreaker_logo.png\" alt=\"Jawbreaker logo\" width=\"160\" />\n</p>\n\n# Jawbreaker\n\nScam defense for someone you love.\n\n**Try it:** [Live Space](https://huggingface.co/spaces/build-small-hackathon/jawbreaker) · [Demo video](https://youtu.be/oh0GRKYXvGM) · [Reddit post](https://www.reddit.com/r/huggingface/comments/1u48yt6/finetuned_a_1b_model_that_helps_families_check/) · [LinkedIn post](https://www.linkedin.com/posts/gsarveswaran_jawbreaker-private-scam-defense-for-someone-share-7471351364137164800-cv__/) · [X post](https://x.com/GothamSarves/status/2065649294623813925?s=20) · [Article](https://huggingface.co/blog/build-small-hackathon/jawbreaker-private-scam-defense) · [Model](https://huggingface.co/build-small-hackathon/jawbreaker-minicpm5-1b-lora-v8) · [Dataset/evals](https://huggingface.co/datasets/build-small-hackathon/jawbreaker-scam-defense-data) · [GitHub](https://github.com/gowtham0992/jawbreaker)\n\n**Why this exists:** The motivating user is a friend's grandmother who had already been affected by scam messages. Private details are intentionally omitted, but that family context shaped the product: this is not a generic spam classifier for security experts; it is a calm safety check for someone who needs to know whether to reply, click, call, or ask for help.\n\n## TL;DR for Judges\n\n- **Backyard AI:** a practical scam-defense safety card for non-technical people and their families.\n- **Best MiniCPM Build / Tiny Titan / Well-Tuned:** `openbmb/MiniCPM5-1B` + [Jawbreaker LoRA v8](https://huggingface.co/build-small-hackathon/jawbreaker-minicpm5-1b-lora-v8), evaluated on a 632-case hard suite with **0 dangerous undercalls** and **0 safe overcalls**.\n- **Best Use of Modal:** Modal (`modal.com`) A100 was used for LoRA training and guarded eval runs; see [`training/modal_train.py`](training/modal_train.py), [`training/modal_eval.py`](training/modal_eval.py), the [`632-case v8 report`](eval/reports/jawbreaker-minicpm5-1b-lora-v8-hard632-safetyguard-v4.json), plus the earlier [`394-case v4 report`](eval/reports/jawbreaker-minicpm5-1b-lora-v4-hard394-guarded.json).\n- **OpenAI / Best Use of Codex:** Codex-attributed commits plus [`CODEX_JUDGE_EVIDENCE.md`](CODEX_JUDGE_EVIDENCE.md), [`AGENT_TRACE.md`](AGENT_TRACE.md), and [`CODEX_BUILD_LOG.md`](CODEX_BUILD_LOG.md), with file-level contribution notes below.\n- **Off Brand / Off the Grid / Sharing is Caring / Field Notes:** custom candy-brutalist Gradio UI, no external LLM API, public [dataset/eval bundle](https://huggingface.co/datasets/build-small-hackathon/jawbreaker-scam-defense-data), and [`FIELD_NOTES.md`](FIELD_NOTES.md).\n- **Submission package:** [Live Space](https://huggingface.co/spaces/build-small-hackathon/jawbreaker), [demo video](https://youtu.be/oh0GRKYXvGM), [Reddit post](https://www.reddit.com/r/huggingface/comments/1u48yt6/finetuned_a_1b_model_that_helps_families_check/), [LinkedIn post](https://www.linkedin.com/posts/gsarveswaran_jawbreaker-private-scam-defense-for-someone-share-7471351364137164800-cv__/), [X post](https://x.com/GothamSarves/status/2065649294623813925?s=20), [article](https://huggingface.co/blog/build-small-hackathon/jawbreaker-private-scam-defense), [model](https://huggingface.co/build-small-hackathon/jawbreaker-minicpm5-1b-lora-v8), [dataset](https://huggingface.co/datasets/build-small-hackathon/jawbreaker-scam-defense-data), and [collection](https://huggingface.co/collections/build-small-hackathon/jawbreaker-6a263632dcd0b6d41ca914ff).\n\nJawbreaker is built around direct small-model inference to protect user privacy. The public hackathon Space runs MiniCPM5-1B + Jawbreaker LoRA on Hugging Face ZeroGPU for judge access, and the repo keeps local Transformers/GGUF tooling for running without hosted LLM APIs.\n\nJawbreaker helps a real person pause before clicking, replying, or sending money. Paste a suspicious text, email, or DM and Jawbreaker breaks "
  },
  {
    "repo_id": "build-small-hackathon/world-simulator",
    "title": "World Simulator",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/world-simulator",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "world",
      "simulator",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 10,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: World Simulator\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.17.3\npython_version: 3.11\napp_file: app.py\nheader: mini\nfullWidth: true\nstartup_duration_timeout: 1h\npinned: true\ntags:\n  - thousand-token-wood\n  - judges-wildcard\n  - best-demo\n  - best-agent\n---\n\n# World Simulator\n\nAn AI civilization sandbox: LLM-driven NPCs live, move, talk, and fight on a 80×20\ngrid, rendered in a Three.js/R3F 3D client. The Python simulation owns all meaning;\nthe renderer only displays it — so planners, memory, factions, and world events can\nbe added without touching the UI.\n\n> **Build Small Hackathon** — Track: **Thousand Token Wood** · Badges: **Judges'\n> Wildcard**, **Best Demo**, **Best Agent**\n>\n> - 📹 Demo video: https://youtu.be/cAIkA-WHoFM\n> - 📣 Social post: _TODO: add link_\n>\n> **Models (all < 32B):** NPC decisions run on `Nemotron-3-Nano-30B-A3B-FP8` (30B),\n> the God Console on `Qwen3.6-27B-FP8` (27B) — small models combined under the cap.\n\n## Quick Start\n\n```powershell\n# Backend (deterministic, no model calls)\n$env:WORLD_SIMULATOR_FORCE_DETERMINISTIC=\"1\"\nuv run python -m world_simulator run     # serves /health, /state, /scene/state, POST /tick\n\n# 3D UI\nnpm install\nnpm run dev                              # http://127.0.0.1:5173\n```\n\nControls: drag to orbit, scroll to zoom, `WASD`/arrows to pan.\n\nAutoplay is on by default: the manual tick controls are hidden and the world\nadvances ticks automatically. Set `VITE_WORLD_SIMULATOR_AUTOPLAY=0` (build/dev\nenv) to show the manual step / play-pause controls instead.\n\n## Hugging Face Space\n\n`app.py` wraps the same game runtime in `gradio.Server` and serves the built React/Three\nUI from `dist/frontend`. Build it first, then run:\n\n```powershell\nnpm install\nnpm run gradio\n```\n\n`requirements.txt` installs the local project from `pyproject.toml`. Set\n`WORLD_SIMULATOR_CONFIG` to override the default `config/game.modal.local.json`.\n\n## How NPCs Think\n\nEach tick, every living NPC gets its own model request containing its health, attack,\npersonality, god directive, recent memories, and visible NPCs within 5 blocks. The model\npicks exactly one action — `walk`, `talk`, or `attack` — and all decisions are collected\nthen applied together, so turn order grants no advantage. The server records outcome\nmemories afterward and compacts older ones into a rolling summary.\n\nA **God Console** (when an OpenAI-compatible connector is active) takes freeform commands\nlike _\"Ada wants to kill everyone\"_ and applies the model's tool calls to world state\n(health, attack, personality, directive, memory, position).\n\nConnector is binary: `connector.type` is `deterministic` (every tick deterministic) or\n`openai_compatible` (model every tick). `WORLD_SIMULATOR_FORCE_DETERMINISTIC=1` forces\ndeterministic ticks and disables model-backed features.\n\n## Model Configs\n\n- `config/game.modal.local.json` (default) — NPCs on a Modal vLLM Nemotron Nano H100\n  endpoint, God Console on a Modal vLLM Qwen3.6 H100 endpoint. Override with\n  `OPENAI_BASE_URL` / `OPENAI_API_KEY` / `OPENAI_MODEL`.\n- `config/game.local.json` — fully offline/deterministic.\n\nDeploy the Modal endpoints with `modal deploy deployments/modal_nemotron_nano.py` and\n`deployments/modal_qwen36_27b_h100.py`; smoke-test with `deployments/smoke_modal_endpoint.py`.\n\n## Development\n\n```powershell\nnpm run gradio\n```\n\nRuntime deps live in `pyproject.toml`; dev tools are in the `dev` group.\n"
  },
  {
    "repo_id": "build-small-hackathon/KnowledgeMesh",
    "title": "Knowledgemesh",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/KnowledgeMesh",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "knowledgemesh",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 45,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: BuildSmall KnowledgeHub\nemoji: 📚\ncolorFrom: blue\ncolorTo: purple\nsdk: gradio\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: AI knowledge hub for groups, powered by Nvidia\n\ntags:\n  - track:backyard\n  - sponsor:openai\n  - sponsor:nvidia\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n\n---\n\n# BuildSmall KnowledgeHub - https://huggingface.co/pkheria\n\nBuildSmall KnowledgeHub is a modular Gradio app for loading knowledge from:\n\n- Medium article links through Freedium\n- arXiv links or IDs\n- PDF documents\n\nIt extracts text, captures Medium image references/captions when available, chunks the content, embeds chunks locally with the configured NVIDIA Nemotron embedding model, uploads vectors into Qdrant, and generates grounded answers with NVIDIA's OpenAI-compatible chat API.\n\n## 🔗 Resources & Links\n\n- **Demo Video:** [Watch the Product Demo]([YOUR_DEMO_VIDEO_LINK_HERE](https://youtu.be/aDlKNW10pnw))\n- **Blog Post:** [Read the Full Write-up](https://huggingface.co/blog/pkheria/knowledgemesh)\n- **Social Post :** [Linkedin Post](https://www.linkedin.com/posts/piyushkheria7_buildsmall-generativeai-rag-share-7472326307721437184-pFrz/)\n## NVIDIA Usage\n\nThis project explicitly uses NVIDIA in two places:\n\n- Local retrieval embedding model: `nvidia/llama-nemotron-colembed-vl-3b-v2`\n- NVIDIA API chat model: `nvidia/nvidia-nemotron-nano-9b-v2`\n\nThe chat client calls:\n\n```text\nhttps://integrate.api.nvidia.com/v1\n```\n\nYou must provide `NVIDIA_API_KEY` as a Hugging Face Space secret or in your local `.env`.\n\n## Hugging Face Spaces Deployment\n\nFor ZeroGPU Spaces, add these Space variables:\n\n```bash\nENABLE_ZEROGPU=true\nEMBEDDING_DEVICE=cuda\nZEROGPU_DURATION_SECONDS=180\n```\n\nFor local Apple Silicon development, keep:\n\n```bash\nEMBEDDING_DEVICE=cpu\n```\n\nThe Gradio ingest, search, and answer callbacks are decorated with `spaces.GPU` when running on Hugging Face Spaces. Locally, the decorator becomes a no-op.\n\n## Hugging Face Secrets\n\nAdd these in your Space settings under **Settings → Variables and secrets**.\n\nRequired secrets:\n\n```bash\nNVIDIA_API_KEY=<your-nvidia-api-key>\nQDRANT_URL=<your-qdrant-url>\nQDRANT_API_KEY=<your-qdrant-api-key>\n```\n\nOptional variables:\n\n```bash\nQDRANT_COLLECTION_NAME=knowledge_base\nNVIDIA_API_URL=https://integrate.api.nvidia.com/v1\nNVIDIA_CHAT_MODEL=nvidia/nvidia-nemotron-nano-9b-v2\nNEMOTRON_EMBED_MODEL=nvidia/llama-nemotron-colembed-vl-3b-v2\nNEMOTRON_PARSE_MODEL=Qwen/Qwen2-VL-2B-Instruct\nHF_TOKEN=<token-if-needed-for-gated-model-downloads>\n```\n\nUse a hosted Qdrant instance for Hugging Face Spaces. `localhost:6333` only works for local development.\n\n## Qdrant Collection Name\n\nThe Ingest and Retrieve tabs each have their own collection-name field. Set both to the same Qdrant collection when you want to search what you just ingested. The fields are intentionally not auto-synced because auto-sync can cause continuous refreshes in hosted Gradio Spaces.\n\n## Setup\n\n```bash\npython3 -m venv .venv\nsource .venv/bin/activate\npip install -r requirements.txt\ncp .env.example .env\n```\n\nAdd `NVIDIA_API_KEY` to `.env` for chat completions. Start Qdrant locally or point `QDRANT_URL` to your hosted instance.\n\nThe default model split is:\n\n- Local parsing model: `Qwen/Qwen2-VL-2B-Instruct`\n- Local embedding model: `nvidia/llama-nemotron-colembed-vl-3b-v2`\n- NVIDIA API chat model: `nvidia/nvidia-nemotron-nano-9b-v2`\n\n## Run\n\n```bash\npython app.py\n```\n\nOpen the local Gradio URL printed in the terminal, usually `http://127.0.0.1:7860`.\n\nThe app binds to `0.0.0.0:7860`, which is suitable for Hugging Face Spaces and container deployments.\n"
  },
  {
    "repo_id": "build-small-hackathon/run-reachy-run",
    "title": "Run Reachy Run",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/run-reachy-run",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "run",
      "achievement",
      "reachy",
      "summary",
      "available"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 10,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Run Reachy Run\nemoji: 🏃\ncolorFrom: indigo\ncolorTo: purple\nsdk: gradio\nsdk_version: 6.10.0\napp_file: app.py\npinned: false\nmodels:\n  - black-forest-labs/FLUX.2-klein-9B\n  - openbmb/MiniCPM4.1-8B\n  - nvidia/Nemotron-3.5-Content-Safety\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# Run Reachy Run\n\nType **any** character → a local AI conjures it as your pixel runner → play instantly.\nA tile-based, juice-heavy infinite runner built for the **Build Small Hackathon** (Gradio × Hugging Face).\n\nThe world is fixed and hand-crafted; the **hero is yours**. Pick a prebuilt realm or bring your own character, then chase distance, coins, and a permanent prestige economy.\n\n## Demo & links\n\n- ▶️ **Demo video** — https://www.youtube.com/watch?v=ALCPq0HJBCs\n- 💬 **Launch post (LinkedIn)** — https://www.linkedin.com/feed/update/urn:li:activity:7471234801245724673/\n- 📝 **How it's built + what I learned (write-up)** — https://huggingface.co/blog/build-small-hackathon/run-reachy-run-blog\n\n## Play\n\n| Input | Action |\n|---|---|\n| `SPACE` / `↑` / tap | **Jump** over crates, walls, spikes & ground foes |\n| `X` / `J` | **Throw** to kill enemies (they drop coins). Barricades **block** shots — only Ultra smashes them |\n| land from above | **Stomp** a ground foe (bounce + coins) |\n| — | Run **under** flying foes; never jump into them |\n| `?` box | Grab a **power-up** · dive a **Coin Rush** for gold |\n| `P` / `Esc` | Pause (controls + power legend) |\n\nSpeed ramps the longer you run.\n\n## Progression & economy\n\n- **🛒 Shop** — level each skill (haste, shield, double-jump, magnet, armor, throw), unlock **Ultra Bullet** (a smash-shot that destroys barricades & spikes), and **Prestige** for a permanent coin multiplier. Everything is **global** — upgrades follow you into every realm and custom hero (only the power *names* re-theme per realm). Prestige also ranks up your coin skin: **Copper → Silver → Gold → Emerald**.\n- **Realms** — pick from the home screen's 4-way split: *Executive Suite*, *Code Crush*, and *Reachy's Lab* — all free, each themed top to bottom (background, foes, hero, and power-up names) — plus a **Bring Your Own** slot.\n\n### Power-ups (from the `?` box)\n\n| Power | Effect |\n|---|---|\n| 💥 Screen Wipe | Clears every hazard on screen + rains coins |\n| ⚡ Speed + 2× Score | Big speed boost and double score |\n| 🛡 Invincible | Plow straight through anything |\n| 🌙 Double Jump | A second mid-air jump |\n| 🧲 Coin Magnet | Pulls every coin on screen toward you |\n| 💢 Smash Shots | Your shots smash barricades & spikes too *(unlock in Shop)* |\n\nEach realm renames these to fit its theme (e.g. Coin Magnet → *Payday* / *VC Money* / *Bonus Pool*).\n\n## How it's built\n\n- **UI / deliverable** — a Gradio `gr.Blocks` app (`app.py`) whose entire surface is a custom HTML5 `<canvas>` game served from `static/` and embedded via a same-origin iframe. No stock Gradio widgets — the \"off-brand\" angle.\n- **Character art** — **FLUX.2-klein-9B** (Black Forest Labs) running **in this Space on ZeroGPU** (distilled, 4 steps, 512px), then chroma-keyed off magenta + downscaled into sprites (`sprites.py`). Locally the game ships with pre-generated sprites.\n- **Commentary** — MiniCPM (free hackathon API) for near-miss hype and death epitaphs, proxied same-origin via `POST /quip`.\n- **Content safety** — before any world is generated, the typed character (plus any photo caption) is screened by **NVIDIA Nemotron-3.5-Content-Safety** (Gemma-3 4B), hosted on a **[Modal](https://modal.com)** GPU and called as an inference API. A configurable per-category blocklist (`SAFETY_BLOCK`) stops serious harm while *allowing* fictional violence/weapons, and the gate is **fail-open** so a hiccup never blocks play. See `moderation.py` (client) + `modal_safety.py` (service).\n- **World & FX** — 100% procedural pixels: parallax, day/night tint, particles, screen shake, hit-stop.\n\n### Backend routes (mounted on Gradio's FastAPI server)\n\n| Route | Purpose |\n|---|---|\n| `POST /quip` | MiniCPM commentary |\n| `POST /generate` | world generation, safety-screened: Nemotron gate (on Modal) → MiniCPM theme pack → in-Space FLUX.2-klein sprites |\n\n## Project layout\n\n| File | Role |\n|---|---|\n| `app.py` | Gradio app + FastAPI routes (Space entrypoint) |\n| `static/index.html` · `style.css` · `game.js` | The canvas game (engine, UI, all menus) |\n| `sprites.py` | Chroma-key + downscale raw Flux PNGs into sprites |\n| `flux_local.py` | FLUX.2-klein-9B on ZeroGPU (in-Space sprite renderer) |\n| `moderation.py` · `modal_safety.py` | Content-safety gate — Nemotron-3.5-Content-Safety hosted on Modal (client + service) |\n| `test_sim.cjs` | Headless fairness test — an AI plays a full run to confirm it's beatable |\n\n## Run locally\n\n```bash\nexport MINICPM_URL=http://...:8001/v1/chat/completions   # hackathon MiniCPM endpoint\nexport MINICPM_KEY=sk-...                                 # …and its key (ca"
  },
  {
    "repo_id": "build-small-hackathon/FamilyLegacy",
    "title": "Familylegacy",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/FamilyLegacy",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "familylegacy",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 43,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: FamilyLegacy\nemoji: 📊\ncolorFrom: gray\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: AI-driven voice cloning to preserve the voices of loved ones\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n## 🔗 Resources & Links\n\n- **Demo Video:** [Watch the Product Demo]([YOUR_DEMO_VIDEO_LINK_HERE](https://youtu.be/e02qP1gLHVI))\n- **Blog Post:** [Read the Full Write-up](https://huggingface.co/blog/pkheria/familylegacy)\n- **Linkedin Post** [Read the post](https://www.linkedin.com/posts/piyushkheria7_buildsmall-opensource-generativeai-share-7472298281763770368-En6K/)\n\nhttps://huggingface.co/pkheria\n\n## 🌟 Project Overview\n\n**FamilyLegacy** is a compassionate AI application designed to bridge the gap between generations. By leveraging state-of-the-art voice cloning and large language models, it allows users to preserve the voices and stories of their loved ones, creating a digital legacy that can \"talk back\" with warmth and personality.\n\n## ✨ Key Features\n\n- **Voice Cloning:** Capture a short sample of a loved one's voice to create a high-fidelity digital twin using `VoxCPM2`.\n- **Memory Storage:** Upload audio recordings of family stories. The system transcribes them and stores them in a vector database (`Qdrant`) for semantic retrieval.\n- **Heartwarming Interactions:** Ask questions or prompt the AI to tell a story. The system retrieves relevant memories and generates a personalized response in the cloned voice.\n- **Emotionally Aware:** Responses are tailored to the relationship (e.g., grandmother, father, friend) with specific tonal traits like \"warm,\" \"wise,\" or \"cheerful.\"\n- **Privacy Focused:** Designed to be hosted securely, ensuring your family's precious data and voices remain under your control.\n\n## 🛠️ Tech Stack\n\n- **Backend:** [FastAPI](https://fastapi.tiangolo.com/) & [Gradio](https://gradio.app/)\n- **Voice Synthesis:** [VoxCPM2](https://github.com/OpenBMB/VoxCPM)\n- **Text Generation:** [Qwen2.5-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-3B-Instruct)\n- **Transcription:** [OpenAI Whisper](https://github.com/openai/whisper)\n- **Vector Database:** [Qdrant](https://qdrant.tech/) (for story memory retrieval)\n- **Primary Database:** [MongoDB](https://www.mongodb.com/) (for voice profiles and metadata)\n- **Embeddings:** `sentence-transformers/all-MiniLM-L6-v2`\n\n## 🏗️ Architecture\n\nThe application follows a modern AI pipeline:\n1.  **Ingestion:** Voice samples are saved to MongoDB. Story audio is transcribed via Whisper and indexed into Qdrant.\n2.  **Retrieval:** When a user asks a question, the system searches Qdrant for relevant family memories.\n3.  **Generation:** An LLM (Qwen) takes the retrieved context and generates a heartwarming response.\n4.  **Synthesis:** The generated text is passed to VoxCPM2 along with the reference voice profile to produce the final audio output.\n\n## 🚀 Getting Started\n\n### Prerequisites\n\n- Python 3.10+\n- MongoDB instance\n- Qdrant instance\n- FFmpeg (for audio processing)\n\n### Installation\n\n1.  **Clone the repository:**\n    ```bash\n    git clone <repository-url>\n    cd FAMILY-LEGACY\n    ```\n\n2.  **Set up a virtual environment:**\n    ```bash\n    python -m venv .venv\n    source .venv/bin/activate  # On Windows use `.venv\\Scripts\\activate`\n    ```\n\n3.  **Install dependencies:**\n    ```bash\n    pip install -r requirements.txt\n    ```\n\n4.  **Environment Configuration:**\n    Create a `.env` file in the root directory and add your connection strings (do not share these!):\n    ```env\n    MONGO_URL=your_mongodb_connection_string\n    MONGO_DB_NAME=family_legacy\n    QDRANT_URL=your_qdrant_url\n    QDRANT_API_KEY=your_qdrant_api_key\n    ```\n\n### Running the App\n\n```bash\npython app.py\n```\nThe application will be available at `http://localhost:7860`.\n\n## 📜 License\n\nThis project is licensed under the MIT License - see the header of this file for details.\n"
  },
  {
    "repo_id": "build-small-hackathon/chhaya",
    "title": "Chhaya",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/chhaya",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "modal",
      "backyard",
      "tiny-titan",
      "well-tuned",
      "sharing"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 12,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Chhaya — Shade For Those Who Work In The Sun\nemoji: ⛱️\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 5.49.1\napp_file: app.py\npinned: true\nlicense: apache-2.0\nshort_description: MedGemma skin & heat-health companion for outdoor workers\ntags:\n  - track:backyard\n  - achievement:tiny-titan\n  - achievement:well-tuned\n  - achievement:sharing\n  - achievement:offgrid\n  - achievement:off-brand\n  - achievement:best-demo\n  - sponsor:modal\n  - build-small-hackathon\n  - backyard ai\n  - backyard-ai\n  - tiny titan\n  - tiny-titan\n  - well tuned\n  - well-tuned\n  - sharing is caring\n  - sharing-is-caring\n  - off the grid\n  - off-the-grid\n  - best demo\n  - best-demo\n  - off brand\n  - off-brand\n  - modal\n  - best use of modal\n  - best-use-of-modal\n  - medgemma\n  - kokoro\n  - healthcare\n  - gradio\n  - zerogpu\nmodels:\n  - google/medgemma-1.5-4b-it\n  - build-small-hackathon/chhaya-medgemma-lora\n  - hexgrad/Kokoro-82M\ndatasets:\n  - build-small-hackathon/chhaya-skin-extract\n---\n\n# Chhaya (छाया — \"shade\") ⛱️\n\n**A skin & heat-health companion for the people who work in the sun** — drivers, delivery riders, construction workers, street vendors, farmers.\n\nHeat waves are making outdoor work harsher every year, and the people most exposed to sun damage are the least likely to ever get a skin check. Chhaya is a small, free tool for exactly those hours.\n\n**Built for the Hugging Face Build Small Hackathon (Backyard AI track).**\n\n▶️ **Demo videos:** [Full walkthrough (4.5 min)](https://youtu.be/-4Fh8WYlUbo) · [90-second cut](https://youtu.be/0465erKuv7Y)\n📝 **Blog:** [Chhaya: teaching a 4B model to be the eyes, not the doctor →](https://huggingface.co/blog/build-small-hackathon/chhaya-blog)\n📄 [Launch post on LinkedIn →](https://www.linkedin.com/posts/manjunathan-r-06396b1b7_buildsmallhackathon-medgemma-healthtech-share-7472213562757734400-cZby/)\n\n## What it does\n\n1. **☀️ Skin check** — snap a photo of any spot or patch of skin. **MedGemma-1.5-4B** describes what it sees (type, colour, borders, symmetry, texture), flags how concerning it looks (`looks ordinary` / `worth watching` / `show a doctor`), and spots visible sun & heat damage.\n2. **Plans built around *your* work day** — tell Chhaya your occupation, hours in direct sun, and water access; it builds a hydration schedule and protection plan that fits (drivers learn about window-side UV, riders about helmet-line burns, construction workers about NDMA work-hour guidance).\n3. **🌡️ Heat symptom check** — first-aid triage for heat cramps / heat exhaustion / heatstroke. Deliberately **zero AI**: pure deterministic logic from NDMA / WHO first-aid guidance, because emergencies are not the place for sampling.\n4. **🗂️ My record** — save checks per body part within the session and re-compare the same spot over time. The spot that *changes* is the spot that matters.\n5. **🔊 Read it aloud** — every verdict and the heat first-aid steps can be spoken with one tap (Kokoro-82M, on CPU), in **English or Hindi**, so the plan reaches workers who don't read English easily.\n6. **👆 One-tap sample workers** — preloaded persona cards (a bike-taxi rider, an auto driver, a vendor, a farmer) prefill a real skin photo + work-day context and run the check instantly, so anyone can try Chhaya without uploading. Sample photos are open-license images from ISIC + Google SCIN; names/jobs are fictional.\n\n## Design principle: the model is the eyes, the guidelines are the medicine\n\nMedGemma reads the photo and returns a structured description — that's all it's trusted with. Every medical number and action (litres per shift, ORS timing, 12–3 pm shade window, when to call 108/112) is curated, deterministic content from:\n\n- **NDMA India** heat-action / heat-illness first-aid guidance\n- **WHO** heat-health advice\n- **Cancer Council ABCDE** skin self-check criteria (Asymmetry, Border, Colour, Diameter, Evolving)\n\nThe model never invents medical advice. Chhaya describes — it never diagnoses.\n\n## Stack\n\n- **Vision model:** [`google/medgemma-1.5-4b-it`](https://huggingface.co/google/medgemma-1.5-4b-it) (4B, image-text-to-text, bf16) — well under the 32B cap — served with our **fine-tuned LoRA adapter** [`build-small-hackathon/chhaya-medgemma-lora`](https://huggingface.co/build-small-hackathon/chhaya-medgemma-lora)\n- **Speech:** [`hexgrad/Kokoro-82M`](https://huggingface.co/hexgrad/Kokoro-82M) text-to-speech, run on **CPU** so read-aloud never touches the GPU quota\n- **Training data:** [`build-small-hackathon/chhaya-skin-extract`](https://huggingface.co/datasets/build-small-hackathon/chhaya-skin-extract) — built from ISIC-2024 (biopsy-anchored) + Google SCIN (phone photos, Monk skin tones 1–7)\n- **Fine-tune compute:** QLoRA on **Modal** (~$6 of credits, v1 + v2)\n- **Runtime:** Gradio on Hugging Face Spaces **ZeroGPU**\n- **UI:** fully custom \"step into the shade\" theme — full-bleed illustrated hero, striped awning, editorial Fraunces headline, IBM Plex Mono kickers, rubber-stamp verdicts, tactile icon tiles and a daw"
  },
  {
    "repo_id": "build-small-hackathon/sangue-e-grafi",
    "title": "Sangue E Grafi",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/sangue-e-grafi",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "sangue",
      "grafi",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 14,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Sangue e Grafi\nemoji: \"🩸\"\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.15.2\npython_version: \"3.11\"\napp_file: app.py\npinned: false\nlicense: mit\ntags:\n  - knowledge-graph\n  - grpo\n  - ontology\n  - inheritance-reasoning\n  - build-small-hackathon\n  - track:wood\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# 🩸 Sangue e Grafi 📊\n\n### Blood & Graphs — Ontology-Guided Reasoning for Small Models\n\n> **Hugging Face Build Small Hackathon 2026**\n> Track: 🍄 *An Adventure in Thousand Token Wood*\n>\n> 📓 [Blog Post](https://wordlift.io/blog/en/sangue-e-grafi/) · 🎬 [Video Demo](https://wor.ai/yt-sangue-e-grafi)\n\n---\n\n## 🎯 The Pitch\n\nCan a small open model outperform much larger frontier models on adversarial inheritance reasoning?\n\n**Yes, when the task is not about linguistic fluency, but about preserving relational structure.**\n\n**Sangue e Grafi** is a research prototype showing how a small ontology-guided model can solve inheritance puzzles that often mislead text-only frontier models. The failure mode is not random. It is a form of **semantic drift**: the model follows emotionally salient or semantically related information while dropping the formal dependency that determines the correct answer.\n\nIn our testbed, the model must decide who inherits an asset according to a will. The narrative contains distractors: spouses, caregivers, emotionally important relatives, and socially prominent figures. A text-only model may select the most salient person. A graph-guided agent must instead follow the legally relevant biological lineage encoded in the ontology.\n\nThis project builds on two research threads:\n\n1. [**RLM-on-KG**](https://github.com/wordlift/rlm-on-kg) ([arXiv:2604.17056](https://arxiv.org/abs/2604.17056)), where a language model acts as an autonomous navigator over a Knowledge Graph.\n2. [**SEOcrate-4B**](https://huggingface.co/cyberandy/SEOcrate-4B_grpo_new_01), our small-model GRPO experiment for SEO reasoning.\n\nSangue e Grafi combines these ideas: it uses the graph-navigation pattern from RLM-on-KG and the post-training pattern from SEOcrate, then applies both to a harder relational reasoning problem where the reward comes from OWL axioms instead of an LLM judge.\n\nThe core idea:\n\n> Use the ontology not only as documentation or retrieval memory, but as a **deterministic reward layer** for training small models to follow valid reasoning paths.\n\n---\n\n## 🧠 Why This Matters\n\nLarge language models are excellent at semantic association. They understand that a spouse, a caregiver, a child, and an heir all belong to the same family story.\n\nBut inheritance reasoning is not solved by semantic similarity.\n\nIt requires preserving:\n\n* identity,\n* biological lineage,\n* directionality,\n* age constraints,\n* aliveness,\n* exclusions,\n* and the exact clause of the will.\n\nThis is a small example of a broader problem in agentic AI.\n\nEnterprise agents fail in similar ways:\n\n* a product is similar, but not compatible;\n* a claim is plausible, but not allowed;\n* a workflow step is reasonable, but a prerequisite is missing;\n* a customer is relevant, but not eligible;\n* a document is related, but not authoritative;\n* an entity is mentioned, but not the canonical one.\n\nThe problem is not lack of language. The problem is missing **constraint-preserving reasoning**.\n\nSangue e Grafi explores how ontologies and Knowledge Graphs can make this missing structure explicit, verifiable, and eventually trainable.\n\n---\n\n## 📊 Benchmark Results\n\nWe tested **10 procedurally generated inheritance puzzles** using the same seeds across several frontier baselines and our graph-guided agent.\n\nEach puzzle contains semantically primed distractors designed to mislead text-only reasoning.\n\n### Accuracy\n\n| Model                |       Approx. scale       |   Score   | Accuracy |\n| :------------------- | :-----------------------: | :-------: | :------: |\n| Gemini 2.5 Flash     |     frontier baseline     |    3/10   |    30%   |\n| Gemini 3.5 Flash     |     frontier baseline     |    6/10   |    60%   |\n| Gemini 3.1 Pro       |     frontier baseline     |    6/10   |    60%   |\n| **Gemma + KG Agent** | **small model + KG tools** | **10/10** | **100%** |\n\n> **Note**: This is a small adversarial benchmark, not a general claim that small models outperform frontier models. The result shows that explicit structure can dominate scale when the task depends on formal relational constraints.\n\n### Per-scenario breakdown\n\n| Seed | Gold Answer        |  2.5 Flash |  3.5 Flash |  3.1 Pro | KG Agent |\n| :--: | :----------------- | :--------: | :--------: | :------: | :------: |\n|  42  | Claudio Sala       |  ❌ Enrico  |  ❌ Enrico  | ❌ Enrico |     ✅    |\n|  99  | Damiano Lombardi   |   ❌ Marta  |   ❌ Marta  |  ❌ Marta |     ✅    |\n|  137 | Ilaria Colombo     |  ❌ Silvia  |      ✅     |     ✅    |     ✅    |\n|  256 | Gio"
  },
  {
    "repo_id": "build-small-hackathon/pawmap",
    "title": "Pawmap",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/pawmap",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "pawmap",
      "summary",
      "available",
      "yet.",
      "docker"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 9,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: PawMap\nemoji: 🐾\ncolorFrom: green\ncolorTo: green\nsdk: docker\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: Collaborative stray animal mapping with AI\ntags:\n  - stray-animals\n  - animal-welfare\n  - maps\n  - computer-vision\n  - small-models\n  - gradio\n  - build-small-hackathon\n  - backyard-ai\n  - leaflet\n  - cosine-similarity\n  - track:backyard\n  - sponsor:nvidia\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\nmodels:\n  - meta-llama/Llama-3.2-11B-Vision-Instruct\n  - nvidia/nemotron-3-nano-30b-a3b\n  - sentence-transformers/all-MiniLM-L6-v2\n---\n\n# PawMap 🐾 — Collaborative stray animal mapping with AI\n\nTake a photo of the stray animal you just spotted. The AI identifies species, breed and color — and checks if it's already been registered before.  \nOver time, the map tells each animal's story: where it shows up, who helped, whether it's doing okay.\n\nBuilt for the [**Build Small Hackathon 2026**](https://huggingface.co/build-small-hackathon) · **Backyard AI** track 🏡\n\n**Try it:** [live Space](https://huggingface.co/spaces/build-small-hackathon/viralata-mapper) · **Traces:** [dataset on the Hub](https://huggingface.co/datasets/build-small-hackathon/viralata-mapper-storage)\n\n> 🎬 **Demo video:** https://youtu.be/Oa9oYtmZQHU\n> 📣 **Social post:** `https://www.linkedin.com/posts/sara-holanda_buildsmallhackathon-smallmodels-huggingface-ugcPost-7472398720395997186-sfM6/?utm_source=share&utm_medium=member_desktop&rcm=ACoAADDScDkBYQSXcI8-nyPjr-4BO2deLlWJw2w`\n> 📝 **Article:** https://huggingface.co/blog/build-small-hackathon/pawmap\n---\n\n## The Problem\n\nBrasília, DF — the capital of Brazil — has thousands of stray dogs and cats scattered across its residential wings, parking lots and green areas. Independent rescuers know each animal by nickname, by street, by habit. But that knowledge lives in WhatsApp groups, in individual memory, nowhere persistent.\n\nThe friend who inspired this app has been feeding a colony of cats near the Asa Norte commercial strip for years. She knows exactly which ones have been neutered, which ones have a history of injury, which one vanished last month. But when someone new wants to help — or when she needs backing to request food donations — there's nothing to show. No map, no timeline, no proof that a specific animal has been living on that block for two years.\n\n**PawMap is that record.** Anyone takes a photo of a stray animal with their phone. The AI identifies it, GPS pins the location, and the app checks whether that animal has been seen before — linking all sightings to its profile. The map is public. The animal's trail becomes visible. Anyone who wants to help knows exactly where to go.\n\n---\n\n## How it works\n\nThere are two AI-powered flows: **registering a sighting** and **recording help**.\n\n### Sighting flow\n\n```mermaid\nflowchart TD\n    A[\"📷 Photo from phone\"] --> B[\"🤖 Vision AI\\nLlama-3.2-11B-Vision\\n(NVIDIA NIM / HF)\"]\n    B -->|\"is it an animal?\"| C{Animal detected?}\n    C -->|\"No\"| ERR[\"❌ Error message\\n'No dog or cat identified'\"]\n    C -->|\"Yes\"| D[\"🧠 Nemotron 3 Nano (brain)\\nreasons over the description →\\nstructured JSON: species · breed · color · condition · marks\"]\n    D --> E[\"🔢 Semantic embedding\\nall-MiniLM-L6-v2 · 384 dim\\n(local, 22M params)\"]\n    E --> F{Cosine similarity\\n≥ 0.80?}\n    F -->|\"Yes — known animal\"| G[\"➕ New sighting\\nadded to existing animal\"]\n    F -->|\"No — new animal\"| H[\"🆕 New animal\\nregistered in the database\"]\n    G --> I[\"💾 SQLite\"]\n    H --> I\n    I --> J[\"🗺️ Leaflet map\\nprofiles · trail · gallery\"]\n```\n\nBefore confirming, the user sees the top 3 most similar animals already in the database — they can edit what the AI detected, give the animal a name, and note its condition. Only then is the sighting saved.\n\n### Help flow\n\nWhen someone wants to help an animal, they can submit a **help proof** — a photo showing they fed it, treated a wound, or got it to a vet. The AI does two things with that photo:\n\n```mermaid\nflowchart TD\n    A[\"📷 Help proof photo\\n(optional)\"] --> B[\"🤖 Vision AI\\nanalyzes the new photo\"]\n    B --> C[\"🔢 New embedding\\ngenerated from description\"]\n    C --> D{Cosine similarity\\nagainst registered animal?}\n    D -->|\"Match ≥ 0.80\\nsame animal ID\"| E[\"✅ AI-verified\\nmatch score shown\"]\n    D -->|\"No match\"| F[\"⚠️ Unverified\\nstill recorded\"]\n    B --> G{Condition improved?}\n    G -->|\"e.g. injured → healthy\"| H[\"📈 Condition update\\nlogged on profile\"]\n    G -->|\"same or worse\"| I[\"No change\"]\n    E --> J[\"💾 Help event saved\\nto sightings table\"]\n    F --> J\n    H --> J\n    I --> J\n```\n\nThe profile page shows the full help history with photos, help type (fed / treated / rescued / other), AI verification status and any detected condition improvement.\n\nThe map uses color to signal urgency: **🟢 green** = dog · **🟠 orange** = cat · **🔴 red** = not seen in +30 days.\n\n---\n\n## What's inside\n\n| Component | Model / Library | Where it runs |\n|---|---|---|\n| Visual perception | **Llama-3.2-11B-"
  },
  {
    "repo_id": "build-small-hackathon/anim-vid-ai",
    "title": "Anim Vid Ai",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/anim-vid-ai",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "track",
      "anim",
      "vid",
      "summary"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 8,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Anim Vid AI\nemoji: 🎬\ncolorFrom: purple\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: true\nshort_description: Turn any topic into an engaging Manim animation video\ntags:\n- track:backyard\n- track:wood\n- sponsor:openai\n- sponsor:nvidia\n- sponsor:modal\n- achievement:offgrid\n- achievement:welltuned\n- achievement:offbrand\n- achievement:sharing\n- achievement:fieldnotes\n---\n\n# 🎬 Anim Vid AI\n\n**Turn any topic or concept into a beautiful Manim animation — powered by fine-tuned small models.**\n\n[![Build Small Hackathon](https://img.shields.io/badge/Build%20Small%20Hackathon-blue)](https://huggingface.co/build-small-hackathon)\n[![Track 1: Backyard AI](https://img.shields.io/badge/Track%201-Backyard%20AI-blueviolet)](https://huggingface.co/build-small-hackathon)\n[![Track 2: An Adventure in Thousand Token Wood](https://img.shields.io/badge/Track%202-An%20Adventure%20in%20Thousand%20Token%20Wood-blueviolet)](https://huggingface.co/build-small-hackathon)\n\n[![Demo Video](https://img.shields.io/badge/▶_Watch_Demo-FF0000?style=for-the-badge&logo=youtube&logoColor=white)](https://youtu.be/nEQo2R7aMSo)\n[![Social Media Post](https://img.shields.io/badge/Social_Media_Post-000000?style=for-the-badge&logo=x&logoColor=white)](https://x.com/ovi054/status/2066592295122833861)\n[![Blog Post](https://img.shields.io/badge/📓_Field_Notes_Blog-FF6B35?style=for-the-badge)](https://huggingface.co/blog/build-small-hackathon/anim-vid-ai)\n\n## 📖 Overview\n\nAnim Vid AI lets you describe any concept — a math formula, a physics law, an algorithm — and instantly generates a complete, working Manim animation. The app uses a fine-tuned Qwen3-14B model to write the Manim Python code, with a Qwen3-0.6B draft model for speculative decoding, then hands the code off to a companion Manim rendering Space to produce the final video.\n\nNo Manim experience needed. Just describe what you want to visualize.\n\n## 🏆 Hackathon Patches\n\n| Patch | Status |\n| :--- | :--- |\n| 🔬 **Small Models Only** — Qwen3-14B (14B params) + Qwen3-0.6B draft model, both well under the 32B limit | ✅ |\n| 🔌 **Off the Grid** — No cloud APIs. Runs entirely on the model weights loaded on GPU | ✅ |\n| 🎯 **Well-Tuned** — Custom LoRA adapters [`ovi054/qwen3-14b-manim-lora`](https://huggingface.co/ovi054/qwen3-14b-manim-lora) and [`ovi054/qwen3-0.6b-manim-lora`](https://huggingface.co/ovi054/qwen3-0.6b-manim-lora) published on Hub | ✅ |\n| 🎨 **Off-Brand** — Custom UI built beyond default Gradio styling | ✅ |\n| 📡 **Sharing is Caring** — Agent trace shared on the Hub | ✅ |\n| 📓 **Field Notes** — Blog post on what was built and learned | ✅ |\n\n### Key Features\n\n* **Topic → Animation:** Describe any concept in plain English and get a rendered Manim video.\n* **Speculative Decoding:** Pairs the 14B main model with a 0.6B draft model for faster generation.\n* **Editable Code:** Generated Manim Python code is fully editable before rendering — tweak it live.\n* **Streaming Generation:** Code streams token-by-token so you can see progress in real time.\n* **Microservice Rendering:** Code generation and rendering are split across two Gradio Spaces — this app handles AI inference, while a companion Space handles Manim rendering, keeping each concern isolated and independently scalable.\n\n---\n\n## 🧠 LoRA Fine-Tuning Process\n\nBoth LoRA adapters — one for the 14B main model and one for the 0.6B draft model — were trained from scratch for this project.\n\n### The Dataset\n\nPublic Manim code is hard to come by at the scale and quality a fine-tune actually needs — and what does exist rarely matches a specific rendering environment's requirements. To solve this, ~10,000 Manim Python scripts were synthetically generated using **Claude Sonnet 4.5**, seeded from topics collected during a prior Manim animation project built on fal.ai.\n\nThis gave full control over quality and consistency: every script follows ManimCE v0.19 syntax, names the scene class `MyScene`, ends with `self.wait()`, and covers a broad range of subjects — geometry, calculus, physics, algebra, algorithms, and more. The result is a clean, uniform dataset that **both LoRA adapters — the 14B and the 0.6B — were trained on identically**, ensuring their learned token distributions stay closely aligned.\n\n### Training Setup\n\nBoth models were fine-tuned on **Modal** using an **A100 GPU**, completing training in approximately **3 hours** total.\n\n### What the LoRA Actually Fixed\n\nBefore fine-tuning, even with an explicit system prompt, base Qwen3 models produced unreliable Manim output. The LoRA consistently resolved all of these issues:\n\n| Behaviour | Base Model (no LoRA) | Fine-Tuned with LoRA |\n| :--- | :--- | :--- |\n| **Output format** | Sometimes returned plain text explanations instead of code | Always returns a valid Python code block |\n| **Code structure** | Ignored the `MyScene` class naming and `self.wait()` requirement even when told explicitly | Consistently follows the required structure |\n| **Library com"
  },
  {
    "repo_id": "build-small-hackathon/CodeFlow",
    "title": "Codeflow",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/CodeFlow",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "codeflow",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 8,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: CodeFlow\nemoji: 📊\ncolorFrom: indigo\ncolorTo: blue\nsdk: gradio\npython_version: '3.13'\nsdk_version: 6.16.0\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: Turn code into a readable Mermaid.js flowchart 📊!\ntags:\n- track:backyard\n- achievement:offgrid\n- achievement:sharing\n- achievement:offbrand\n- achievement:llama\n- achievement:fieldnotes\n- achievement:welltuned\n- build-small-hackathon\n- backyard-ai\n- llama-cpp\n- field-notes\n- sharing-is-caring\n- off-brand\n- off-the-grid\n- code\n- mermaid.js\n- flowchart\n- small-models\n- seq2seq\n- gradio\n- agentic\n---\n\n# 📊 CodeFlow\n\n**Paste code → read its logic as a flowchart.** A 30B coder model runs entirely on **CPU via llama.cpp** to translate source code into a clean, animated [Mermaid.js](https://mermaid.js.org/) control-flow diagram — with each node wired back to the exact lines it came from.\n\n### 🔗 Links\n\n[🚀 **Live Space**][space] · [▶️ **Demo Video**][video] · [🐦 **Social Post**][social] · [📓 **Field Notes (blog)**][blog] · [🔍 **Agent Traces**][traces] · [🎛️ **Fine-Tuned Model**][model]\n\n[space]:  https://huggingface.co/spaces/build-small-hackathon/CodeFlow  \"Hugging Face Space\"\n[video]:  https://youtu.be/R5GbpN9FVxo  \"Demo video\"\n[social]: https://www.linkedin.com/feed/update/urn:li:share:7471327684539785217/  \"Social post\"\n[blog]:   https://huggingface.co/blog/build-small-hackathon/codeflow-field-notes  \"Field notes / blog post\"\n[traces]: https://huggingface.co/datasets/build-small-hackathon/codeflow-agent-traces  \"Agent traces dataset\"\n[model]:  https://huggingface.co/build-small-hackathon/codeflow-qwen-3-finetuning  \"Fine-tuned model\"\n\n---\n\n## ❓ The Problem\n\nReading unfamiliar code means simulating its control flow in your head — chasing branches, loops, and early returns line by line. That's slow, error-prone, and gets worse the deeper the nesting. Existing \"code → diagram\" tools are usually rigid AST parsers (brittle, language-locked) or cloud LLM APIs (your code leaves the building).\n\n**CodeFlow** turns any snippet into a scannable flowchart you can audit at a glance — generated by a real language model that runs **100% locally**, so nothing is sent to an external API.\n\n## ⚙️ How It Works\n\n```\n Paste code ──▶ Generate ──▶ POST /generate_flowchart        (Gradio API)\n                                    │\n                    number the source lines + structured system prompt\n                                    │\n          CodeFlow fine-tune of Qwen3-Coder-30B-A3B  (llama.cpp · CPU)\n                                    │\n                 <thinking> …reasoning… </thinking>\n                 graph TD … nodes & edges …\n                 <linemap> A:1  B:2  C:3-4 </linemap>\n                                    │\n        strip reasoning · parse + validate the line-map · sanitize labels\n                                    │\n                  { mermaid, linemap }  ──▶  append agent_traces.jsonl\n                                    │\n   Mermaid render + \"trace-the-path\" reveal + node ↔ code linking\n```\n\n1. You paste code (or pick a pre-rendered example) into the **CodeMirror** editor and hit **Generate**.\n2. The backend numbers the source lines and sends them with a strict system prompt to the **CodeFlow fine-tune of Qwen3-Coder** running on **llama.cpp**.\n3. The model returns hidden `<thinking>`, the Mermaid `graph`, and a `<linemap>` mapping every node to its source line(s).\n4. The server strips the reasoning, **validates** the line-map against the source, sanitizes labels for Mermaid, and returns `{ mermaid, linemap }`.\n5. The frontend renders the diagram with a **trace-the-path reveal** that flows out of a persistent Start node while the canvas scrolls along in real time.\n6. **Node ↔ code linking:** hover a node to highlight its source lines, click a node to jump-and-edit them, or move your cursor over a line to light up the matching node.\n7. Every generation is captured as a structured **agent trace** (`/traces`).\n\n## 🎛️ Fine-Tuning\n\nCodeFlow runs a [**LoRA fine-tune**][model] of **Qwen3-Coder-30B-A3B-Instruct** (≈30.5B params), specialized for the code → Mermaid + `<linemap>` task rather than relying on the base model's general coding ability.\n\n- **Data:** **2,400 synthetic examples** (2,208 train / 192 val — 8% holdout), built from **22 control-flow templates** across **Python, JavaScript, C++, and C**.\n- **Method:** LoRA `r=16, α=32` on the attention + MLP projections, **bf16**, cosine schedule — then merged and exported to a **Q3_K_L GGUF** for CPU inference.\n- **Validation:** the holdout is **hard-validated** — generated outputs are syntax-checked / compiled, not just eyeballed.\n\nSee the [model card][model] for the full data engine, `finetune.py` options, and dataset preview.\n\n## 🧰 Tech Stack\n\n| Layer | What it is | Used for |\n|---|---|---|\n| **Model** | [**CodeFlow fine-tune**][model] of [Qwen3-Coder-30B-A3B-Instruct](https://huggingface.co/Qwen) (Mixture-of-Experts) | Code → Mermaid + line-map generation |\n| **Fine-tuning** | LoRA SFT (`r=16"
  },
  {
    "repo_id": "build-small-hackathon/her",
    "title": "Her",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/her",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "her",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 21,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Her · हेर\nemoji: 🕵️\ncolorFrom: gray\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.16.0\napp_file: app.py\npython_version: \"3.10.13\"\npinned: false\nshort_description: A detective for your Claude Code sessions\nstartup_duration_timeout: 1h\ntags:\n  - track:backyard\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n<!-- ↑ Hugging Face Space config (Gradio SDK + ZeroGPU). The app entrypoint is app.py\n     (Gradio Server mode); ZeroGPU hardware is set on the Space, not in this header.\n     This README is the Space card — it describes the HOSTED, upload-based version.\n     The fully-local product (`./her`, no upload) lives in the same repo; see the foot. -->\n\n<p align=\"center\">\n  <img src=\"ui/public/her-logo.png\" alt=\"Her · हेर\" height=\"120\" />\n</p>\n\n<h1 align=\"center\">Her · हेर</h1>\n<p align=\"center\"><em>हेर — Marathi for “detective.”</em><br/>\nA detective for your coding-agent sessions. Drop a Claude Code session export and Her\nreads the whole trace — so you can see what actually happened, and what to do better\nnext time.</p>\n\n---\n\n## What this Space does\n\nUpload your Claude Code session exports (`.jsonl`) and Her investigates them:\n\n- **The journey.** Every query as a node, sized by cost, the heaviest glowing — with a\n  plain-English **“what happened”** on top and the deterministic cost-shape below.\n- **The dataflow.** The tool calls along each turn, with the **proven value-flow** path\n  highlighted on focus (a value that reappeared *verbatim* from an earlier result) —\n  proven (solid) vs. hypothesis (dotted), always kept separate.\n- **Risky moves, surfaced.** Deploys, production & config changes, secrets — the actions\n  worth a second look, each traceable to the turn it happened in.\n- **What to do better.** Tips grounded in Anthropic’s and the community’s best practices.\n  Her **suggests, never asserts** — and stays silent unless a named, fixable pattern fires.\n- **Ask Her.** A chat bound to your trace. *“Why was this turn so expensive?”* → she\n  answers from the trace, **cites the turns**, and opens the exact tool call.\n\n## How to use it\n\n**One or a few sessions — drag & drop.** Find a session file under\n`~/.claude/projects/<encoded-folder>/<session-id>.jsonl`, then drop it onto the page\n(or click **Upload .jsonl**). One file opens a **session view**; drop several to build a\n**project view** across them.\n\n**All your projects at once — the uploader script.** Grab `scripts/her_upload.py` from\nthis Space’s **Files** tab (or `hf download <owner>/<space> scripts/her_upload.py\n--repo-type space --local-dir .`) and run it:\n\n```bash\npython scripts/her_upload.py\n```\n\nIt **copies** the sessions you pick into a staging folder, **scrubs likely secrets**, and\n**uploads** them — each step waits for your approval — then prints a link that opens your\n**Projects view** here. A project groups many sessions under one working directory, with a\nplain-English **changelog across sessions** and **Ask Her about the project**\n(*“when did we add column X?”* → names the exact session).\n\n## Your data & privacy\n\nThis is the hosted version, so your sessions **are** uploaded to analyze them — but they\nstay yours and don’t stick around:\n\n- **Private to your browser.** Each browser gets a random token (`crypto.randomUUID()`);\n  your uploads land in a namespace keyed to it, so **you only ever see your own sessions**.\n- **Temporary by default.** A background sweeper deletes anything older than **24 hours**;\n  **“clear my data”** wipes your namespace immediately, and the tab-close does a best-effort\n  clear too.\n- **Scrubbed on the way in.** The uploader redacts likely secrets before anything leaves\n  your machine (best-effort — review the staged copies if unsure).\n- **No trace content ever leaves the Space.** The optional “share learnings” path (bare,\n  scrubbed *tool names* only — never commands, paths, code, or JSONL) is **off** here.\n- **Guardrails.** Up to **70 MB** per session file, **50 sessions** per project, **50\n  projects** per browser — enough for real work, capped so no one can flood the box.\n\n## What makes her trustworthy\n\n- **Deterministic core, model for prose only.** Value-flow edges, token sums, loop &\n  re-read detection, heavy-turn ranking, entity & binary extraction, risk scanning —\n  **pure code, no model.** A model is used *only* to write the English and to *propose*\n  (never assert) findings. The numbers don’t move when the model changes.\n- **Proven vs. hypothesis is always separated.** A verbatim value reappearance is asserted;\n  temporal proximity is a hypothesis you judge.\n- **Cost alone is never advice.** “Expensive but clean” is a valid, important output.\n\n## The model\n\nNarration — the plain-English summaries, advice prose, and chat — runs **on the Space**\non **`nvidia/Nemotron-Mini-4B-Instruct`** via **ZeroGPU**. The first narration after a cold\nstart can take a few seconds while the GPU spins up. Swap the model with the\n**`SPACE_MODEL_REPO`** Space variable — no"
  },
  {
    "repo_id": "build-small-hackathon/ux-crime-scene",
    "title": "Ux Crime Scene",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/ux-crime-scene",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "crime",
      "scene",
      "summary",
      "available"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 9,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: UX Crime Scene\nemoji: 🔎\ncolorFrom: red\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.16.0\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: A film-noir detective investigates your UI as a crime scene.\nmodels:\n  - Qwen/Qwen2.5-VL-7B-Instruct\n  - black-forest-labs/FLUX.2-klein-4B\n  - nvidia/Nemotron-Mini-4B-Instruct\n  - hexgrad/Kokoro-82M\ndatasets:\n  - build-small-hackathon/ux-crime-scene-traces\ntags:\n  - track:wood\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# 🔎 UX Crime Scene\n\n### *Every interface hides a crime.*\n\n[![UX Crime Scene — watch the trailer](assets/poster.jpg)](https://youtu.be/6u58YIEPrkA)\n\n<p align=\"center\"><b>▶️ <a href=\"https://youtu.be/6u58YIEPrkA\">Watch the trailer</a></b></p>\n\nDrop a screenshot of **any** website or app. **THE INSPECTOR** — a hard-boiled,\nfilm-noir detective — works the scene, circles every UX flaw as evidence, and\nfiles a verdict with a letter grade.\n\nIt's a UX audit that plays like a detective thriller.\n\n> ⏳ **Worth the wait — please run a real scan.** The whole experience is the *live* case:\n> drop your own screenshot and watch the Inspector work it end-to-end — scan → verdict →\n> **The Trial** → **FLUX reconstruction** → the **voice** → the **Most Wanted** board. To keep\n> this **solo, self-funded** project affordable, the GPU backends (vision · FLUX · voice) run on\n> **Modal scale-to-zero**, so the **first** scan after an idle spell takes **~1–2 min** to wake\n> them — **the app is NOT broken**, it's just warming up; once warm, each scan is **~20–30s**.\n> *In a hurry?* A **cold case** or the **Precinct Archive** load pre-rendered verdicts instantly —\n> but those are only a preview. The real magic is a live investigation, and it's worth the minute. 🕵️\n\n---\n\n## 🧑‍⚖️ TL;DR for judges\n\n- **Track:** 🍄 **Thousand Token Wood** — the AI *is* the detective; remove the model and there's no case.\n- **What it does:** drop any UI screenshot → a film-noir detective investigates it, **circles each UX flaw on the real pixels**, names the charge, suggests a fix, and files a verdict + letter grade. Every result gets a unique shareable case file.\n- **Small-model bet:** runs on **`Qwen2.5-VL-7B`** (8.3B — well under the 32B cap), pushed above its weight with a multi-step agent + ~4 MP vision + tiling.\n- **Measured, not vibes:** a senior UX designer graded every charge from 16 live pages — **84% of circles land on the exact named element, 92% of charges are real design issues** (N=38, evidence public). Full methodology + failure analysis: [EVAL.md](EVAL.md).\n- **Then it goes further — powers no other entry has:**\n  - ⚖️ **The Trial — two small models argue one case.** The verdict opens a courtroom: a **separate NVIDIA Nemotron model** steps in as **THE PROSECUTION** and presses the filed charges, while the guilty UI elements **take the stand and defend themselves** (Qwen again) and the Inspector rules from the grounded evidence. Press a move (On the Stand · Cross-Examine · Confess · The Verdict) or speak freely. *Qwen sees and defends · Nemotron prosecutes · the Inspector judges* — agentic, not a script.\n  - 🖼️ **The Reconstruction** — one click rebuilds the worst exhibit *fixed*, rendered live by **FLUX.2 Klein** (Black Forest Labs). Before/after, on the real element.\n  - 🔊 **The Inspector's Voice** — hear the verdict read aloud by an 82M-param **Kokoro** voice running locally. No API, no keys.\n  - 🚨 **Most Wanted** — a public, multiplayer **rogues' gallery**: opt-in to book your case onto a shared board where the city's worst interfaces are ranked by their crimes. *Booked by the public.* Seeded with real archive cases, with a live Inspector's city report.\n- **Why the badges:**\n  - 🎨 **Off-Brand** — a fully bespoke noir world (cinematic alley intro, evidence desk, live laptop investigation, typewritten case file, a *Precinct Archive* of famous booked sites). Nothing looks like default Gradio.\n  - 🤖 **Best Agent** — a real visual agent: **sweep → zoom into each suspect → verify/clear → file**, then **answers follow-up interrogation** — all under 32B.\n  - 🎬 **Best Demo** — cinematic trailer + social post + polished app.\n  - 🟢 **Best Use of Modal** — four GPU backends (vision, FLUX, voice, prosecutor) all run on Modal.\n  - 🟩 **NVIDIA Nemotron** — `Nemotron-Mini-4B` is THE PROSECUTION: a distinct small model arguing the case against the interface, not a generic text call.\n- **Try it in one click:** open the desk → *\"grab a cold case\"* (HuggingFace / NYTimes) or *\"browse the precinct archive\"* — no screenshot needed.\n- **Links:** [▶️ trailer](https://youtu.be/6u58YIEPrkA) · [📹 full walkthrough](https://youtu.be/WyQbY0XJ_9E) · [📱 social post](https://x.com/p36649/status/2066277845567930447) · [📓 Field Notes article](https://huggingface.co/blog/kasbsquall/ux-crime-scene) · [📡 traces dataset](https://huggingface.co/datasets/build-small-hackathon/ux-crime-scene-traces)\n- **Author:** [@kasbsquall](https://hu"
  },
  {
    "repo_id": "build-small-hackathon/eyas",
    "title": "Eyas",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/eyas",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "eyas",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 7,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Eyas\nemoji: 🦅\ncolorFrom: blue\ncolorTo: yellow\npinned: true\nsdk: gradio\nsdk_version: 5.38.0\npython_version: \"3.12\"\napp_file: eyas/app.py\nlicense: mit\nshort_description: AI Security Camera Agent\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:openai\n  - sponsor:cohere\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n<p align=\"center\">\n  <img src=\"docs/assets/eyas_logo_wide.png\" alt=\"Eyas\" width=\"600\" />\n</p>\n\n# Eyas: AI Security Camera Agent\n\n| | Name | HuggingFace |\n|---|---|---|\n| | Javier Huang | [@JavRedstone](https://huggingface.co/JavRedstone) |\n| | Hanhee Lee | [@hanheelee](https://huggingface.co/hanheelee) |\n| | Joe Lee | [@sehyunlee217](https://huggingface.co/sehyunlee217) |\n\n**[HuggingFace Space](https://huggingface.co/spaces/build-small-hackathon/eyas)** · **[GitHub](https://github.com/JavRedstone/eyas)** · **[Demo Video](https://www.youtube.com/watch?v=x9h7nMv_KeQ)** · **[Field Notes](https://huggingface.co/blog/build-small-hackathon/eyas)** · **[Social Media](https://www.linkedin.com/feed/update/urn:li:activity:7472122729828364288/)**\n\nEyas is an on-device security camera agent built for our teammate's family's convenience store. It runs person tracking, event detection, and LLM reasoning over CCTV footage to surface theft, loitering, and suspicious activity as a structured, searchable log.\n\n---\n\n## What it does\n\n- **Visual pipeline** — person tracking → VLM observation → event structuring → LLM reasoning\n- **Event Timeline** — scatter chart + table; click any event to seek the annotated video\n- **Summary & Alerts** — risk gauge, flag breakdown, and per-camera narrative\n- **Ask Footage** — natural-language Q&A over the event log via the on-device LLM\n- **Audio Report** — spoken security brief via VoxCPM2 TTS\n- **Multi-camera** — queue multiple clips, get a unified cross-camera session summary\n- **Korean** — full UI and pipeline output translation, hot-swap without restart\n\n## Architecture\n\n<p align=\"center\">\n  <img src=\"docs/assets/eyas-architecture-diagram.png\" alt=\"Eyas architecture diagram\" width=\"900\" />\n</p>\n\n## Models\n\n| Model | Role | Size |\n|---|---|---|\n| [YOLO11n](https://github.com/ultralytics/ultralytics) | Person detector + BotSORT tracker | ~6 MB |\n| [MiniCPM-V 4.6](https://huggingface.co/openbmb/MiniCPM-V-4.6) | Vision-language observer | ~1.3B params |\n| [Nemotron 3 Nano 4B](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF) | LLM reasoner (GGUF Q4) | ~2.5 GB |\n| [TinyAya Global](https://huggingface.co/CohereLabs/tiny-aya-global-GGUF) | Korean translation (GGUF Q4) | ~0.5 GB |\n| [VoxCPM2](https://huggingface.co/openbmb/VoxCPM2) | Text-to-speech | ~2.4B params |\n\nAll models download automatically on first run. No API keys required.\n\n---\n\n## Docs\n\n**Guides**\n\n| Document | Contents |\n|---|---|\n| [Setup & Development](docs/guides/SETUP.md) | Quick start, local dev, Docker, HF Spaces deploy |\n| [AI Theft Detection](docs/guides/AI_THEFT_DETECTION.md) | Capabilities, limits, and best practices |\n| [Codex Contributions](docs/project/CODEX.md) | Agent-assisted commits, reasoning traces |\n\n**Architecture**\n\n| Document | Contents |\n|---|---|\n| [Architecture](docs/architecture/ARCHITECTURE.md) | Pipeline diagram, component breakdown, event schema |\n| [Off-Brand Frontend](docs/architecture/OFF_BRAND.md) | Why and how the UI is a React SPA instead of Gradio components |\n\n**Models**\n\n| Document | Contents |\n|---|---|\n| [YOLO11n](docs/models/yolo11n.md) | Person detection + BotSORT tracking |\n| [MiniCPM-V 4.6](docs/models/minicpm-v.md) | Vision-language observer (VLM) |\n| [Nemotron 3 Nano 4B](docs/models/nemotron-nano.md) | LLM reasoner — summary, Q&A, alerts |\n| [TinyAya Global](docs/models/tinyaya.md) | Korean translation |\n| [VoxCPM2](docs/models/voxcpm2.md) | Text-to-speech audio report |\n\n**Project**\n\n| Document | Contents |\n|---|---|\n| [Field Notes](docs/project/FIELD_NOTES.md) | Build log — design decisions, lessons from each stage, store field test |\n| [Submission](docs/project/SUBMISSION.md) | Hackathon checklist and what we built |\n| [Hackathon](docs/project/HACKATHON.md) | Track info and award categories |\n\nLive space: [build-small-hackathon/eyas](https://huggingface.co/spaces/build-small-hackathon/eyas)\n\n<p align=\"center\">\n  <img src=\"docs/assets/build-small-hackathon-checklist.png\" alt=\"Build Small Hackathon checklist\" width=\"700\" />\n</p>\n"
  },
  {
    "repo_id": "build-small-hackathon/kids-education-ai",
    "title": "Kids Education Ai",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/kids-education-ai",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "achievement",
      "kids",
      "education",
      "summary",
      "available"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Test it with a beginner question and review the answer.",
    "likes": 7,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: kids-education-ai\nsdk: gradio\nsdk_version: 5.34.0\npython_version: \"3.11\"\napp_file: app.py\npinned: false\n\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n  \n---\n\n\n# 🎓 Kids Education AI\n\nAn AI-powered educational platform that generates children's stories, illustrations, audio narration, educational poems, and visual learning experiences using state-of-the-art Small Language Models.\n\n---\n\n# 🌟 Overview\n\nKids Education AI transforms a simple topic into a complete interactive learning experience.\n\n---\n\n## 🏆 Build Small Hackathon Submission\n\n Space:\nhttps://huggingface.co/spaces/build-small-hackathon/kids-education-ai\n\n Demo Video:\nhttps://youtu.be/dP0XM33YxuM\n\n Social Post:\nhttps://x.com/gajanand2004/status/2066546382920687976\n\n Article:\nhttps://huggingface.co/blog/build-small-hackathon/kids-education-ai-small-models\n\n---\n## 👥 Team\n\n| Name | Hugging Face Username |\n|------|----------------------|\n| Gajanan Deshmukh | Gaju19 |\n\n\n## Architecture\n\n![kids-education-ai](./architecture.png)\n\n\n# 🤖 Models Used\n\n| Task | Model | Parameters | Purpose |\n|--------|--------|------------|---------|\n| Story Generation | NVIDIA Nemotron 3 Nano 4B GGUF | 4B | Children's story generation |\n| Image Generation | FLUX.2-klein-base-4B | 4B | Scene illustration generation |\n| Audio Narration | NVIDIA Magpie TTS Multilingual 357M | 357M | Natural speech generation |\n| OCR & Vision | MiniCPM-V 4.6 | Multimodal | OCR and visual understanding |\n\n---\n\n# 🧠 Why These Models?\n\n| Model | Why Selected |\n|---------|--------------|\n| NVIDIA Nemotron 3 Nano 4B | Efficient small language model with strong storytelling capabilities |\n| FLUX.2 Klein 4B | High-quality image generation with low resource requirements |\n| NVIDIA Magpie TTS | Fast multilingual speech synthesis |\n| MiniCPM-V 4.6 | Strong OCR, image understanding, and document analysis |\n\n---\n\n\n\n## Additional Learning Modes\n\n| Feature | Description |\n|----------|-------------|\n| 📖 Story Generation | Create age-appropriate stories |\n| 🎨 Image Generation | Generate scene illustrations |\n| 🔊 Audio Narration | Convert stories into speech |\n| ✍️ Educational Poems | Theme-based learning poems |\n| 👁️ OCR & Vision | Understand images and documents |\n| 📄 PDF Question Answering | Ask questions from PDFs |\n| 📚 Story Book Builder | Combine story, images, and audio |\n\n---\n## Learning Pipeline\n\n```text\nAge + Topic\n      ↓\nStory Generation\n      ↓\nScene Extraction\n      ↓\nImage Generation\n      ↓\nAudio Narration\n      ↓\nInteractive Story Book\n```\n\n# 📈 Monitoring\n\n| Component | Monitoring |\n|------------|------------|\n| Hugging Face Space | Build & Runtime Logs |\n| ZeroGPU | GPU Usage Tracking |\n| Modal Backend APIs | Endpoint Health Monitoring |\n| Story Generation | Output Validation |\n| Image Generation | Quality Monitoring |\n| Audio Narration | Audio Response Validation |\n| OCR & RAG | Retrieval Accuracy Monitoring |\n| Storybook Creation | Generation Success Tracking |\n\n\n# ⚙️ Technical Architecture\n\n| Layer | Technology |\n|---------|------------|\n| Frontend | Gradio |\n| Backend | Python |\n| Model Serving | Modal |\n| Story Generation | NVIDIA Nemotron |\n| Image Generation | FLUX |\n| Audio Generation | NVIDIA Magpie TTS |\n| OCR & Vision | MiniCPM-V |\n| Deployment | Hugging Face Spaces |\n| Runtime | Llama.cpp |\n| Model Hosting | Hugging Face Hub |\n\n---\n\n# 🔄 End-to-End Workflow\n\n| Step | Process |\n|------|----------|\n| 1 | User enters Age and Topic |\n| 2 | Nemotron generates story |\n| 3 | Scene extraction |\n| 4 | FLUX generates illustrations |\n| 5 | Magpie TTS creates narration |\n| 6 | Storybook creation |\n| 7 | Results displayed to user |\n\n---\n\n# 📂 Project Structure\n\n```text\nkids-education-ai/\n│\n├── app.py\n├── requirements.txt\n├── architecture.png\n└── README.md\n```\n\n---\n\n# 💻 Installation\n\n## Create Virtual Environment\n\n```bash\npython -m venv venv\n```\n\n## Activate Environment\n\n### Windows\n\n```bash\nvenv\\Scripts\\activate\n```\n\n### Linux / Mac\n\n```bash\nsource venv/bin/activate\n```\n\n## Install Dependencies\n\n```bash\npip install -r requirements.txt\n```\n\n---\n\n# ☁️ Modal Setup\n\nInstall Modal:\n\n```bash\npip install modal\n```\n\nAuthenticate Modal:\n\n```bash\nmodal setup\n```\n\nDeploy Backend:\n\n```bash\nmodal deploy backend/app.py\n```\n\n---\n\n# 🚀 Run Application\n\n```bash\npython frontend/app.py\n```\n\nApplication URL:\n\n```text\nhttp://127.0.0.1:7860\n```\n\n---\n\n# 🌐 Deployment\n\n| Component | Platform |\n|------------|-----------|\n| Frontend | Hugging Face Spaces |\n| Backend APIs | Modal |\n| Models | Hugging Face Hub |\n| Runtime | ZeroGPU |\n\n---\n\n# 🎯 Target Users\n\n- Children (3–12 Years)\n- Parents\n- Teachers\n- Schools\n- Educational Content Creators\n\n---\n\n# 🏆 Hackathon Highlights\n\n- Small Language Models\n- Multi-Modal AI\n- Story Generation\n- Image Generation\n- Text-to-Speech\n- OCR & Vision\n- Educational Learning Platform\n- Hugging Face Space Deployment\n\n---\n\n# 📊 Tech Stack\n\n- Python\n- Gradio\n- Modal\n- Hugging Face\n- Tran"
  },
  {
    "repo_id": "build-small-hackathon/TinyNarrator",
    "title": "Tinynarrator",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/TinyNarrator",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "achievement",
      "tinynarrator",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 7,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: TinyNarrator\nemoji: 🏆\ncolorFrom: gray\ncolorTo: yellow\nsdk: docker\npinned: false\nlicense: mit\nshort_description: A small-model accessibility screen reader.\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\nPublic GitHub repo: [https://github.com/cvpfus/tiny-narrator](https://github.com/cvpfus/tiny-narrator)\n\nDemo video: [https://youtu.be/EYXhb9kKBzA](https://youtu.be/EYXhb9kKBzA)\n\nSocial post: [https://x.com/cvpfus_id/status/2066666318527713469](https://x.com/cvpfus_id/status/2066666318527713469)\n\nSpaces (and blog post): [https://huggingface.co/spaces/build-small-hackathon/TinyNarrator](https://huggingface.co/spaces/build-small-hackathon/TinyNarrator)\n\n# Tiny Narrator\n\nTiny Narrator is a Build Small Hackathon prototype: a custom Gradio Server app with two routes: a guided screen-reader article and a small-model article generator.\n\nFor the hackathon form, see [SUBMISSION.md](SUBMISSION.md).\n\n## Award Strategy\n\n- **Tiny Titan:** every planned model is at or below 4B parameters.\n- **Llama Champion:** the reader-brain layer calls a GGUF model through `llama.cpp`.\n- **Off-Brand:** the visible UI is custom HTML, CSS, and JavaScript served by `gr.Server`.\n- **Field Notes:** the repo documents model sizes, runtime choices, fallbacks, and accessibility decisions.\n\n## Recommended Models\n\n| Role | Model | Params | Runtime |\n| --- | ---: | ---: | --- |\n| Reader brain | `nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF` | 3.97B | `llama.cpp` |\n| Image understanding | `openbmb/MiniCPM-V-4.6` | 1B | OpenAI-compatible chat completions |\n| Text to speech | `hexgrad/Kokoro-82M` | 82M | Python |\n| Image generation | `black-forest-labs/FLUX.2-klein-4B` | 4B | Modal-hosted Klein |\n\n## Run Locally\n\nInstall dependencies:\n\n```powershell\npython -m pip install -r requirements.txt\n```\n\nCopy the example environment file to `.env` and fill in the values. The app loads `.env` automatically at startup, so no shell exports are required. Process environment variables still override `.env` values.\n\n```powershell\nCopy-Item .env.example .env\n```\n\nStart the llama.cpp reader-brain server locally:\n\n```powershell\nllama-server -hf nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF:Q4_K_M --alias narrator-brain --port 8080 --host 0.0.0.0 --ctx-size 4096 --parallel 1 --reasoning off --n-gpu-layers 999\n```\n\nStart the app:\n\n```powershell\npython app.py\n```\n\nOpen the local URL printed by Gradio. The Reader route calls `/api/reader-brain`, `/api/image-descriptions`, `/api/describe-image`, `/api/speak`, and `/api/model-budget`. The Generate route calls `/api/generate-article`, which drafts an article with the reader-brain model path and attaches a Klein thumbnail receipt.\n\n## Modal Reader Brain\n\nFor a free CPU Hugging Face Space, host the llama.cpp reader-brain server on Modal and point the app at its OpenAI-compatible `/v1` endpoint:\n\n```powershell\nmodal secret create tiny-narrator-reader-brain-token LLAMA_CPP_TOKEN=your-random-token\nmodal deploy modal_workers/reader_brain.py\n```\n\nSet `LLAMA_CPP_BASE_URL` to the deployed Modal URL with `/v1` appended, for example:\n\n```env\nLLAMA_CPP_BASE_URL=https://your-workspace--tiny-narrator-reader-brain.modal.run/v1\nLLAMA_CPP_MODEL=narrator-brain\nLLAMA_CPP_TOKEN=your-random-token\n```\n\nThe Modal worker starts Nemotron on T4 with `--ctx-size 4096`, `--parallel 1`, `--reasoning off`, full GPU offload, and `--api-key` when `LLAMA_CPP_TOKEN` is configured. It uses the prebuilt `ghcr.io/ggml-org/llama.cpp:server-cuda12` image instead of compiling llama.cpp during deploy, clears that image's entrypoint so Modal can start its Python runner, and allows up to 10 minutes for the first GGUF download/load. It scales down when idle, so the first request after a cold start can be slower.\n\n## Modal Klein Image Generation\n\nThe image generation path uses a Modal-hosted `black-forest-labs/FLUX.2-klein-4B` worker. Deploy the worker:\n\n```powershell\nmodal secret create tiny-narrator-klein-token KLEIN_MODAL_TOKEN=your-random-token\nmodal deploy modal_workers/klein_image.py\n```\n\nSet `KLEIN_MODAL_ENDPOINT` to the deployed worker URL. Set the same `KLEIN_MODAL_TOKEN` value in your local `.env`. The Modal worker uses the fixed `tiny-narrator-klein-token` secret name so Modal's local and remote dependency graphs stay identical.\n\n`/api/runtime-status` reports whether the Modal Klein worker is online or fallback-ready. `/api/runtime-setup` includes the deploy command and required environment variables.\n\n## MiniCPM-V-4.6 Image Descriptions\n\nThe image description path uses `openbmb/MiniCPM-V-4.6` through an OpenAI-compatible `/v1/chat/completions` endpoint. Provide `MINICPM_VISION_BASE_URL` and `MINICPM_VISION_API_KEY` to enable live screen-reader alt text for article images.\n\nWhen the endpoint is not configured, unreachable, or returns invalid content, `/api/describe-image` and `/api/image-descriptions` fall ba"
  },
  {
    "repo_id": "build-small-hackathon/hackathon-advisor",
    "title": "Hackathon Advisor",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/hackathon-advisor",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "advisor",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 11,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Hackathon Advisor\nemoji: \"📜\"\ncolorFrom: yellow\ncolorTo: green\nsdk: gradio\nsdk_version: 6.16.0\npython_version: \"3.11\"\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: Originality advisor for small-model project ideas.\ntags:\n  - gradio\n  - build-small-hackathon\n  - small-models\n  - agent\n  - originality\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n  - tiny-titan\n  - best-demo\n  - best-agent\n  - bonus-quest-champion\nmodels:\n  - openbmb/MiniCPM5-1B\n  - build-small-hackathon/hackathon-advisor-minicpm5-lora\n  - build-small-hackathon/hackathon-advisor-quest-minicpm5-lora\n  - ggml-org/embeddinggemma-300m-qat-q8_0-GGUF\n  - nvidia/nemotron-speech-streaming-en-0.6b\ndatasets:\n  - build-small-hackathon/hackathon-advisor-quest-dataset\n  - build-small-hackathon/hackathon-advisor-codex-traces\n---\n\n# Hackathon Advisor\n\n**Hackathon Advisor** is a live map of the Build Small Hackathon and a small-model originality coach for builders. It\nopens on an atlas of public `build-small-hackathon` Spaces, then lets a builder search the field, inspect project\nclusters, see quest evidence, and open **The Unwritten Almanac** to evaluate an idea against the work already on the\ntrail.\n\nThe [Build Small Hackathon](https://huggingface.co/build-small-hackathon) asks participants to build under a 32B\nparameter cap, solve a concrete problem for someone nearby or make a delightful AI-native experience, and submit a Space,\ndemo video, and social post. Hackathon Advisor treats that setting as the data surface: every public Space becomes part\nof a continuously refreshed project atlas, and every advisor response is grounded in that shared map.\n\n## Demo\n\n- Live app: <https://build-small-hackathon-hackathon-advisor.hf.space>\n- Hugging Face Space: <https://huggingface.co/spaces/build-small-hackathon/hackathon-advisor>\n- Source code (GitHub): <https://github.com/JacobLinCool/hackathon-advisor>\n- Demo video: <https://youtu.be/Gq-FUiL-ZPw>\n- Social post: <https://x.com/JacobLinCool/status/2066156056724848965>\n- Team HF username: [`JacobLinCool`](https://huggingface.co/JacobLinCool)\n- Submission notes: [`docs/submission-notes.md`](docs/submission-notes.md)\n- Start at the Idea Map, search for a theme, click nearby projects, hover quest badges for evidence, and open the\n  advisor when you are ready to test an idea.\n\n## What This Establishes\n\nBuilders enter a fast-moving hackathon with limited context. A promising idea can already be crowded, a quiet niche can\nbe hard to see, and prize alignment can be scattered across READMEs, tags, and app files. Hackathon Advisor turns the\nfield itself into the starting point. The app shows where projects cluster, which submissions sit near each other, which\nquests they appear to satisfy, and where a new idea may still have room to breathe.\n\nThe project also exists because the dataset is already there. A public hackathon organization is a living corpus of\nSpaces, READMEs, model declarations, app files, tags, and demos. Once that corpus is indexed, builders can explore what\nothers are making, which methods they are using, and which results are emerging across the field.\n\nThat visibility gives an online hackathon some of the creative force of OpenAI's Parameter Golf challenge: the event\nbecomes a shared surface for ideas to interact while people are still building. Participants can find adjacent work,\nrecognize overlap, borrow useful patterns, extend ideas into new domains, and connect with others working on similar\nproblems. The result is a faster open-source feedback loop, where good ideas become easier to find, improve, and build\non together.\n\nThe atlas is the default experience because the map is the evidence. The advisor is available behind `Open advisor`,\nwhere it uses the same project snapshot to cite overlap, propose whitespace, score the idea, draft a build plan, and\nexport the session evidence.\n\n## Hackathon Submission\n\nHackathon Advisor is submitted primarily for the **Thousand Token Wood** track. The product is useful to builders, but\nits core form is an AI-native field guide: a living map, an almanac voice, quest evidence, and shareable artifacts that\nmake the hackathon field itself explorable.\n\nThe demo video is part of that submission evidence. It is built from real app footage of the atlas and advisor flows. Codex helped draft the storyboard, drive the app, capture the screen, generate voice-over, compose the cut, and verify frames and ASR transcripts against the intended narration.\n\nThe Space is also targeting the official sponsor and achievement tags shown in the README front matter:\n\n- `sponsor:openbmb`: MiniCPM5-1B is the central planner and quest-classifier base model.\n- `sponsor:openai`: Codex served as the engineering partner across the build. It helped translate the hackathon\n  requiremen"
  },
  {
    "repo_id": "build-small-hackathon/aether-garden",
    "title": "Aether Garden",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/aether-garden",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "modal",
      "achievement",
      "aether",
      "garden",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 40,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Aether Garden\nemoji: 🌙\ncolorFrom: gray\ncolorTo: purple\nsdk: gradio\nsdk_version: 5.50.0\napp_file: app.py\npinned: true\nlicense: mit\ntags:\n  - gradio\n  - agents\n  - simulation\n  - three.js\n  - persistent-world\n  - hackathon\n  - thousand-token-wood\n  - best-agent\n  - off-brand\n  - modal\n  - codex\n  - minicpm\n  - tiny-model\n  - tiny-titan\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n<div align=\"center\">\n  <img src=\"assets/banner.png\" width=\"100%\" alt=\"Aether Garden\" />\n\n  # Aether Garden\n\n  **The Living Tome — a persistent AI world where every visitor writes a new page.**\n\n  *Made with 🌙 by [Kavya](https://huggingface.co/kavyabhand) & [Aadi](https://huggingface.co/aadi-joshi)*\n\n  [![Gradio](https://img.shields.io/badge/Gradio-5.x-F97316?style=flat-square)](https://gradio.app)\n  [![Hugging Face](https://img.shields.io/badge/Hugging_Face-Spaces-FFD21E?style=flat-square&logo=huggingface&logoColor=black)](https://huggingface.co/spaces/build-small-hackathon/aether-garden)\n  [![Modal](https://img.shields.io/badge/Modal-A10G-7C3AED?style=flat-square)](https://modal.com)\n  [![MiniCPM](https://img.shields.io/badge/MiniCPM3-4B-06B6D4?style=flat-square)](https://huggingface.co/openbmb/MiniCPM3-4B)\n  [![Thousand Token Wood](https://img.shields.io/badge/Hackathon-Thousand_Token_Wood-C8A040?style=flat-square)](https://huggingface.co/blog/gradio-llm-hackathon)\n  [![Demo Video](https://img.shields.io/badge/Demo-YouTube-FF0000?style=flat-square&logo=youtube&logoColor=white)](https://youtu.be/M1_UxPpYe54)\n  [![Social Post](https://img.shields.io/badge/Social-Reddit-FF4500?style=flat-square&logo=reddit&logoColor=white)](https://www.reddit.com/r/playmygame/comments/1u3b1rq/i_built_a_grimoire_where_strangers_leave_souls/)\n  [![Read the Article](https://img.shields.io/badge/Read-Field_Notes-FFD21E?style=flat-square&logo=huggingface&logoColor=black)](https://huggingface.co/blog/build-small-hackathon/aether-garden-article)\n</div>\n\n---\n\n## Submission Snapshot\n\n| | |\n|---|---|\n| **Live Space** | [build-small-hackathon/aether-garden](https://huggingface.co/spaces/build-small-hackathon/aether-garden) |\n| **Demo video** | [youtu.be/M1_UxPpYe54](https://youtu.be/M1_UxPpYe54) |\n| **Social post** | [r/playmygame — \"I built a grimoire where strangers leave souls\"](https://www.reddit.com/r/playmygame/comments/1u3b1rq/i_built_a_grimoire_where_strangers_leave_souls/) |\n| **Field Notes article** | [Aether Garden — Field Notes (HF Blog)](https://huggingface.co/blog/build-small-hackathon/aether-garden-article) |\n| **Track** | Thousand Token Wood — *AI as the simulation engine, not a static generator* |\n| **Core model (this branch)** | `openbmb/MiniCPM3-4B` (4B params) via Modal |\n| **Portrait model** | `stabilityai/sdxl-turbo` |\n| **Agent behavior** | Autonomous hourly simulation tick — reasoning *and* action, with memory updates, with no user input |\n| **Persistent state** | SQLite (WAL) + Hugging Face Dataset backup/restore |\n| **Made by** | [Kavya Bhand](https://huggingface.co/kavyabhand) & [Aadi Joshi](https://huggingface.co/aadi-joshi) |\n\n---\n\n## The Idea\n\nMost AI demos have amnesia. You type a prompt, get a character sheet, close\nthe tab — and the output dies with the session. There's no world, no\nhistory, no consequence.\n\n**Aether Garden** is the opposite. It's a persistent fantasy realm, built as\na leather-bound grimoire you have to *open* before you're let in — no\ndashboard, no tab that says \"Chat,\" you turn pages.\n\nWhen you arrive, the world is already days old. Strangers have already built\nit. A Joke Dragon became Mayor while you were asleep. The Book of Ages\nremembers all of it. You summon one strange thing —\n*\"a blind cartographer who maps places that don't exist yet\"* — and hours\nlater, without you watching, the simulation runs. The cartographer meets\nsomeone else's Crystal Tree. History writes itself.\n\nRemove the model and the Realm is just a SQLite file full of vibes. **The\nAI is load-bearing by definition** — which is exactly what this track asked\nfor.\n\n---\n\n## What It Does\n\nThe app is a single Gradio Space styled as an illuminated manuscript. Instead\nof dashboard tabs, visitors turn pages through six spreads:\n\n1. **The Realm** — an SVG map of eight sacred places, a live chronicle, and a \"Realm Pulse\" ticker\n2. **Explore the Realm** — step *into* a location via a Three.js diorama window\n3. **Book of Ages** — every arrival, meeting, world event, and milestone, in styled prose\n4. **Souls of the Garden** — every summoned soul, with backstory, goals, fears, and an evolving memory\n5. **Bonds and Alliances** — a 3D web of relationships the simulation forged on its own\n6. **Summon a New Soul** — describe something strange; the Oracle gives it a name, a face, a home, and eventually enemies\n\n### The Summon Pipeline\n\n```\nVisitor description (5–200 words)\n        |\nMiniCPM3-4B on Modal (A10G)        name, appearance, backstory, goals, fears,\n         "
  },
  {
    "repo_id": "build-small-hackathon/maindlock",
    "title": "Maindlock",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/maindlock",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "maindlock",
      "summary",
      "available",
      "yet.",
      "docker",
      "thousand-token-wood"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 6,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: mAIndlock\nemoji: 🧠\ncolorFrom: gray\ncolorTo: green\nsdk: docker\napp_port: 7860\npinned: true\nlicense: mit\nshort_description: Escape room where every NPC is a mortal mind of tiny LLMs\nmodels:\n- openbmb/MiniCPM5-1B-GGUF\n- openbmb/VoxCPM2\n- nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF\ntags:\n- thousand-token-wood\n- off-the-grid\n- off-brand\n- tiny-titan\n- llama-champion\n- llama-cpp\n- openbmb\n- minicpm\n- nemotron\n- agents\n- game\n- neuroscience\n---\n\n<p align=\"center\">\n  <img src=\"media/banner.gif\" alt=\"mAIndlock\" width=\"820\">\n</p>\n\n# 🧠 mAIndlock\n\n**An escape room where the lock is a mind — and you can kill it.**\n\n▶ **[Watch the 100-second demo](https://youtu.be/zEwnVR1kTZU)**  ·  🎮 **[Play it live](https://huggingface.co/spaces/build-small-hackathon/maindlock)**  ·  🧠 **[Deliberation traces](https://huggingface.co/datasets/arbios/maindlock-brain-traces)**\n\nEvery character is not one chatbot with a personality prompt. It is a **hierarchy of six tiny\noffline language models**, the way decision neuroscience says a brain is built. You don't crack\na code. You change a person's decision by reaching their fears and the memories they keep — or\nyou burn their mind down trying.\n\nCruelty makes a mind ruminate, and rumination spends its finite **thinking tokens**. As they\nburn, it **forgets** — for good. At zero, the mind goes dark, taking everything it knew with it.\nThere is no reload. Only the next room, and your reputation travelling ahead of you.\n\n> **Tokens are a lifespan here, not a context window.**\n\n---\n\n### ⚖️ TL;DR for judges\n\n- **Track — Thousand Token Wood.** \"A thousand tokens to think with\" is literal: every mind\n  starts with **1000 thinking tokens of life**. Spend them cruelly and it dies.\n- **OpenBMB ($10k).** **Six of the ~seven model calls per NPC turn are MiniCPM** — the four\n  sensory brain regions (threat, memory, habit, cost), each a separate 1B call. MiniCPM *is* the\n  brain. (more under fear: the amygdala ruminates, firing extra calls that burn life) The\n  story's key-handover lines and the voiced demo are rendered with **OpenBMB VoxCPM2** TTS — a\n  second model from the family, fully offline.\n- **NVIDIA.** **Nemotron 3 Nano 4B** is the voice you actually argue with — the dlPFC that turns\n  the integrated value into words.\n- **Off the Grid / Llama Champion.** Pure **llama.cpp**, zero cloud APIs. Flip on airplane mode;\n  every mind keeps thinking. Token counts come from the runtime, so the life-burn is honest\n  accounting.\n- **Off-Brand.** A custom canvas game served from FastAPI, with a Gradio block mounted at\n  [`/about`](/about) — not a stock Gradio UI.\n- **No CPU patience required:** menu → **👁 Watch a mind** replays a *real* recorded session\n  instantly (zero model calls), so the six-region cascade, the token burn and a mind's death\n  are visible in ten seconds.\n\nThe brain is real, not a metaphor: [**docs/ARCHITECTURE.md**](https://github.com/Arbios/maindlock/blob/main/docs/ARCHITECTURE.md)\n· read one mind's full deliberation in [**docs/TRACE.md**](https://github.com/Arbios/maindlock/blob/main/docs/TRACE.md).\n\n---\n\n## Every NPC is a brain, not a chatbot\n\n<p align=\"center\">\n  <img src=\"media/skull-conviction.png\" alt=\"open the skull: six regions argue, each with a conviction read from its logits\" width=\"760\">\n</p>\n\nYour words travel through six regions — **amygdala → hippocampus → striatum → ACC → vmPFC →\ndlPFC** — each one a real call to a small local model. The amygdala rates threat; the\nhippocampus surfaces a memory and whether it leans trust or fear; the striatum weighs habit; the\nACC weighs cost. The **vmPFC integrates them deterministically into one value** — so the number\nyou see in the skull *is* the number that moves the relationship; the panel can never lie about\nthe outcome. The dlPFC speaks it in character.\n\n**Open the skull** mid-conversation (🧠) and watch the regions argue about you in real time —\neach showing its **conviction**, read straight from the model's own token entropy. *A hosted\nchat API never exposes that. Only a local mind can.*\n\nGrounding is modern decision neuroscience — the **value-based network** and the **dual-system**\n(model-free habit vs. model-based goal) accounts — deliberately **not** the debunked triune\n\"lizard brain.\" Acute stress shifts control from goal to habit (Schwabe & Wolf, 2009): that is\nliterally why lowering a character's fear unlocks their reasoning, and why fear burns life for\nnothing.\n\n## A life measured in words\n\n<p align=\"center\">\n  <img src=\"media/epitaph.png\" alt=\"the epitaph a dead mind leaves: its burned memories, the secrets it never told\" width=\"320\">\n</p>\n\nBe cruel and the amygdala loops, spending tokens that move nothing. Each quarter of life lost\n**burns a memory away** — the hippocampus genuinely loses it, and the *Forgotten* panel shows\nwhat's gone. Push far enough and the mind dies, leaving a savable **epitaph**: what it knew,\nwhat it never got to tell. Be kind, keep the alarm quiet, and the mind spends almost nothing —\nempathy liter"
  },
  {
    "repo_id": "build-small-hackathon/aerosphere",
    "title": "Aerosphere",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/aerosphere",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "aerosphere",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 8,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Aerosphere\nemoji: 🌎\ncolorFrom: green\ncolorTo: purple\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: A 3D planetary ecosystem driven by an 8B model.\ntags:\n- track:wood\n- sponsor:nvidia\n- sponsor:modal\n- achievement:offgrid\n- achievement:offbrand\n- achievement:sharing\n- achievement:fieldnotes\n---\n\n# AeroSphere\n**A local LLM-driven planetary physics engine. AeroSphere translates stateless natural language generation into continuous, real-time WebGL/Three.js state machine transitions.**\n\nSubmission for the Build Small Hackathon · Chapter Two · An Adventure in Thousand Token Wood.\n\n[Live demo](https://huggingface.co/spaces/build-small-hackathon/aerosphere) · [Public GitHub repo](https://github.com/rAdvirtua/aerosphere) · [HF Space repo](https://huggingface.co/spaces/build-small-hackathon/aerosphere/tree/main)\n\n**Presentations:** [Social Media Post](https://www.linkedin.com/posts/itsanurag-paul_buildsmallhackathon-huggingface-threejs-ugcPost-7472002881706110976-WWNb/?utm_source=social_share_send&utm_medium=member_desktop_web&rcm=ACoAAE5X43sBOTHkuI1k1vjlYTqaFWY3FOJjsDg) · [Tech Demo Video](https://youtu.be/q_wNTPyrij4?si=TGCrrUaCCVzXtG1j) · [Field Notes Blog Post](https://huggingface.co/blog/build-small-hackathon/aerosphere-blog)\n\n---\n\n## Architecture Overview\nAeroSphere challenges the traditional `text-in, text-out` paradigm of LLMs. It utilizes an 8-Billion parameter inference loop natively as the core logical backend required to interpolate rendering parameters for a live 3D environment.\n\n\n<div align=\"center\">\n  <img src=\"https://media.discordapp.net/attachments/903256683709616199/1515989157619564564/architecturediagram.png?ex=6a31026c&is=6a2fb0ec&hm=07b8ad968b24932e90f47dfba7a8a37ea789a69648a494078a345e5514710ccf&=&format=webp&quality=lossless&width=531&height=924\" alt=\"Architecture Diagram\" />\n</div>\n\n\n### 1. Physics Inference Engine \n*   **Model:** `nvidia/Mistral-NeMo-Minitron-8B-Instruct`\n*   **Local Execution:** Runs locally via the `transformers` pipeline. On Hugging Face Spaces, it leverages the `@spaces.GPU` ZeroGPU binding to keep operations 100% off-the-grid without routing to external vendor APIs.\n*   **State Constraint:** To prevent the renderer from crashing on hallucinated math variables, the Python backend binds the Mistral-8B payload strictly using `Pydantic` schemas. The LLM is forced to extract normalized `PlanetStateDelta` floats (e.g., `lava_intensity: 0.85`).\n\n### 2. State Sync Pipeline\n1.  **Context Construction:** The system aggregates user prompts alongside a rolling computational buffer of the planet's previous chronological iterations.\n2.  **LLM Inference:** Mistral-8B predicts the geological and atmospheric consequences zero-shot.\n3.  **JSON Payload:** The backend resolves a `JSON` configuration block housing exact physical constants.\n4.  **Shader Bridging:** A client-side listener injects this payload synchronously into the running DOM.\n\n### 3. Rendering Engine (Three.js WebGL)\nThe environment calculates visual states procedurally on the native device GPU using **Three.js**:\n\n*   **Procedural Generation**: The planet mesh ignores static image textures entirely, computing environments using complex custom **GLSL Fragment Shaders**.\n*   **Fractal Brownian Motion (FBM)**: Noise equations are manipulated by the JSON floats to dynamically shift continents, freeze oceans, and form storms. The terrain complexity is governed by:\n    <br>`FBM(p) = Σ (aᵢ × noise(fᵢ × p))`\n    <br>where `aᵢ₊₁ = aᵢ × gain` and `fᵢ₊₁ = fᵢ × lacunarity`. The Mistral backend directly modulates these phase thresholds.\n*   **Latency Masking (LERP)**: Inference latency is securely masked locally via continuous Linear Interpolation algorithm running on the `requestAnimationFrame` loop:\n    <br>`V_current = V_current + α × (V_target - V_current)`\n    <br>This forces a smooth, unbroken visual transition across the 3D surface while the backend asynchronously processes the next token state.\n\n---\n\n## UI/UX: The Gradio \"CSS Heist\"\nAeroSphere implements a massive DOM override to convert standard Gradio columns into an immersive cinematic Single Page Application (SPA).\n* **Canvas Injection:** Uses `gr.HTML(bg_html)` to inject the raw WebGL instances permanently into the background layout level, preventing React hydration logic from wiping the active render context.\n* **Layout Decoupling:** `style.css` (~1,400 lines) forcibly disables Gradio's internal `gap` grids and replaces the interface with fixed, absolute-positioned glassmorphic overlay elements mimicking game UI layers.\n* **Mobile Viewport Optimization:** Converts scaling calculations to strict `100dvh` units coupled with native `env(safe-area-inset-bottom)` rules. This prevents catastrophic UI jittering caused by native iOS/Android address bars collapsing on touch input.\n\n---\n\n## Hackathon Tracks & Merit Badges\n- **An Adventure in Thousand Token Wood (Main Track)** — Built explicitly for "
  },
  {
    "repo_id": "build-small-hackathon/retro",
    "title": "Retro",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/retro",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "retro",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 6,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Retro Alpha\nemoji: 📺\ncolorFrom: green\ncolorTo: blue\nsdk: docker\napp_port: 7860\npinned: false\ntags:\n- track:wood\n- sponsor:nvidia\n- sponsor:modal\n- achievement:offgrid\n- achievement:welltuned\n- achievement:offbrand\n- achievement:llama\n- achievement:sharing\n- achievement:fieldnotes\nlicense: mit\n---\n\n# Retro Alpha\n\n**A 90s CRT-style Indian stock market survival game (1994–2004), powered by a fine-tuned NVIDIA Nemotron-3-Nano 4B model.**\n\nBuilt for the [🤗 Build Small Hackathon](https://huggingface.co/build-small-hackathon).\n\n---\n\n## 🔗 Links\n\n| | |\n|---|---|\n| 🎮 **Play (main)** | [build-small-hackathon/retro](https://huggingface.co/spaces/build-small-hackathon/retro) |\n| 🎮 **Play (alpha)** | [sankalphs/retro-alpha](https://huggingface.co/spaces/sankalphs/retro-alpha) |\n| 📝 **Build Blog** | [sankalphs.blogspot.com/2026/06/retro-alpha.html](https://sankalphs.blogspot.com/2026/06/retro-alpha.html) |\n| 🐙 **GitHub** | [github.com/sankalphs/retro](https://github.com/sankalphs/retro) |\n| ▶️ **Video Demo** | [youtu.be/OLJDXhos0Iw](https://youtu.be/OLJDXhos0Iw) |\n| 🐦 **Social Post** | [x.com/sankalphs/status/2066602928350359694](https://x.com/sankalphs/status/2066602928350359694) |\n\n---\n\n## 🎮 How to Play\n\n| Step | Action |\n|------|--------|\n| **Goal** | Turn ₹10,00,000 into ₹20,00,000 over 10 years (120 months) |\n| **Review** | Check the Market Watch for asset prices & trends |\n| **Trade** | Buy/Sell any asset as a % of your portfolio using the Order Pad |\n| **Advance** | Press Advance Month to trigger real historical events & market moves |\n| **Analyze** | Ask the AI Advisor about your portfolio or strategy |\n| **Review** | Get a Year-End Mentor Review for a sarcastic roast & investment lesson |\n\n### Historical Events\nAsian Financial Crisis, Pokhran-II nuclear tests, Dot-com bubble, 9/11, 2004 Indian elections, and more — all influencing asset prices based on real historical data.\n\n---\n\n## 🏆 Badges Earned\n\n| Badge | How |\n|-------|-----|\n| **Off-Brand** | Custom CRT terminal UI built from scratch (no Gradio default) |\n| **Well-Tuned** | Fine-tuned Nemotron-3-Nano 4B on 1,500+ synthetic market scenarios |\n| **Nemotron** | Uses fine-tuned NVIDIA Nemotron-3-Nano-4B (Q4_K_M GGUF) |\n| **Off the Grid** | Fully self-contained Docker Space with on-device inference |\n| **Sharing is Caring** | Infrastructure-as-code scripts open-sourced on GitHub |\n| **Field Notes** | Detailed build log & methodology documented |\n\n---\n\n## 🧱 Tech Stack\n\n```\nFrontend   → Custom CRT terminal UI (vanilla HTML/CSS/JS) served via ASGI\nBackend    → Python simulation engine + Gradio API\nModel      → Fine-tuned NVIDIA Nemotron-3-Nano 4B (Q4_K_M GGUF)\nInference  → Modal GPU cloud endpoint (A10G) with deterministic fallbacks\nData       → 1,500+ synthetic Indian market scenarios via zenmux API\nCI/CD      → GitHub Actions → HF Spaces auto-deploy\n```\n\n---\n\n## 🚀 Running Locally\n\n```bash\npip install -r requirements.txt\nMOCK_LLM=1 python app.py\n```\n\nFor LLM-powered features, set one of:\n- `MODAL_INFERENCE_URL` — Modal cloud endpoint\n- `HF_API_URL` + `HF_TOKEN` — Hugging Face Inference API\n\n---\n\n## 📁 Project Structure\n\n```\n├── app.py              # Gradio app entrypoint (ASGI)\n├── agents.py           # LLM inference wrapper\n├── engine.py           # Market simulation engine\n├── events.py           # Historical event triggers\n├── mentor.py           # AI mentor review generator\n├── modal_app.py        # Modal GPU inference endpoint\n├── download_model.py   # GGUF model downloader\n├── Dockerfile          # HF Space container\n├── requirements.txt    # Runtime dependencies\n├── requirements-train.txt  # Training dependencies\n├── config/\n│   └── assets.json     # Asset definitions\n├── static/             # Frontend (CSS, JS, HTML)\n├── schemas/            # JSON schemas for dataset validation\n├── data/               # Training datasets\n├── scripts/            # Dataset generation & validation\n├── training/           # Modal LoRA fine-tuning scripts\n└── tests/              # Test suite\n```\n\n---\n\n## 🧠 Model\n\nThe game uses a LoRA fine-tune of **NVIDIA Nemotron-3-Nano-4B** on a custom dataset of 1,500+ Indian market scenarios covering:\n\n- **Agent decisions** (730 examples) — institutional, retail, and tech-permabull personas\n- **News impacts** (281 examples) — historical event market reactions\n- **Mentor reviews** (255 examples) — year-end portfolio roasts with Sharpe ratios\n- **Guardrails** (180 examples) — safety and formatting guidelines\n\nFine-tuned on Modal A100 40GB → exported as GGUF Q4_K_M for efficient inference.\n\n---\n\n## 📄 License\n\nMIT"
  },
  {
    "repo_id": "build-small-hackathon/mind-of-tashi",
    "title": "Mind Of Tashi",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/mind-of-tashi",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "modal",
      "openbmb",
      "mind",
      "tashi"
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Try the app and decide whether it feels beginner-friendly.",
    "likes": 7,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: The Mind of Tashi\nemoji: 🌫️\ncolorFrom: indigo\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.15.0\napp_file: app.py\npinned: true\nlicense: mit\nhf_oauth: true\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - off-the-grid\n  - llama-champion\n  - off-brand\n  - well-tuned\n  - tiny-titan\n  - sharing-is-caring\n  - field-notes\n  - zerogpu\n  - modal\n  - openbmb\n  - minicpm\n  - nemotron\n  - reasoning-game\n  - bilingual\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\nmodels:\n  - build-small-hackathon/mind-of-tashi-micro-sft\n  - build-small-hackathon/mind-of-tashi-micro-sft-gguf\n  - build-small-hackathon/mind-of-tashi-micro-grpo\n  - build-small-hackathon/mind-of-tashi-micro-grpo-gguf\n  - build-small-hackathon/mind-of-tashi-mini-sft\n  - openbmb/MiniCPM5-1B-GGUF\n  - nvidia/Nemotron-Mini-4B-Instruct\ndatasets:\n  - build-small-hackathon/mind-of-tashi-selfplay\n---\n\n# The Mind of Tashi — a duel of foresight\n\n> Built for the **Build Small Hackathon · Track Two: An Adventure in Thousand\n> Token Wood**.\n\n**▶️ [Play the Space](https://huggingface.co/spaces/build-small-hackathon/mind-of-tashi) ·\n🎬 [Watch the demo](https://www.linkedin.com/posts/kshitij-thakkar-2061b924_buildsmallhackathon-thousandtokenwood-smallmodels-activity-7470170147283017729-UfCR) (on LinkedIn) ·\n📦 [The bundle](https://huggingface.co/collections/build-small-hackathon/the-mind-of-tashi-6a27107214f1265b159ade35) ·\n💻 [Code](https://github.com/Mandark-droid/mind-of-tashi)**\n\nYou climb a gauntlet of ninja-monks in the Village Hidden in the Mist, high in\nthe Himalayas. Every round, you and your opponent **secretly commit one move at\nthe same time** — there is no reacting, only *reading*. The opponent is a small\nreasoning model, and the thing on centre stage is its **mind**: after each\ncommit, the scroll reveals how it just read you (\"you've drawn breath twice\nunpunished — you're greedy, so I strike\"). The whole game is out-thinking\nsomething that narrates how it's out-thinking you.\n\n## Why the AI is load-bearing\n\nStrip the model and this is rock-paper-scissors. The model's *prediction of\nyour next move* is the entire mechanic — and it commits **blind**, seeing only\nthe history, never your current move. The signature move, **Mist-Step**, only\nrewards you if your opponent attacks this turn, so it is a pure bet on a read.\nThat recursion (\"I think you'll strike, so I Mist-Step / I think you think\nthat, so I draw breath\") is what a reasoning model is uniquely good at.\n\nAnd because the opponent is a **small local model run through llama.cpp**, the\ngame can read its *uncertainty*: per-token entropy drives a **Conviction\nMeter**, your reads raise its sampling temperature (\"crack her composure\"), and\na prāṇa-spent **Oath** drops a move from its decode grammar so it literally\ncannot choose it. A cloud API hides the logits; a giant model can't stream them\non a CPU Space — this is what the small-model class uniquely makes possible.\n\n## The model — David vs Goliath\n\nThe opponent's mind is a custom MoE (~0.4B total / **~200M active per token**),\nSFT'd (and GRPO-trained) to read an opponent and commit blind in an English +\nHindi/Sanskrit (IAST) code-switched register — 10–100× smaller (active) than\nthe frontier APIs it learned from. It ships as a Q4_K_M GGUF and runs in-Space\nvia llama.cpp: **no cloud API at runtime.**\n\n## The bundle\n\nThis Space is the centrepiece of one linked\n[collection](https://huggingface.co/collections/build-small-hackathon/the-mind-of-tashi-6a27107214f1265b159ade35):\n\n1. **Game / Space** — you are here.\n2. **Self-play dataset** — [`mind-of-tashi-selfplay`](https://huggingface.co/datasets/build-small-hackathon/mind-of-tashi-selfplay)\n3. **SFT model + GGUF** — [`mind-of-tashi-micro-sft`](https://huggingface.co/build-small-hackathon/mind-of-tashi-micro-sft) · [`-sft-gguf`](https://huggingface.co/build-small-hackathon/mind-of-tashi-micro-sft-gguf)\n4. **OpenEnv gym** — [`mind-of-tashi-env`](https://huggingface.co/spaces/build-small-hackathon/mind-of-tashi-env)\n5. **GRPO model + GGUF** — [`mind-of-tashi-micro-grpo`](https://huggingface.co/build-small-hackathon/mind-of-tashi-micro-grpo) · [`-grpo-gguf`](https://huggingface.co/build-small-hackathon/mind-of-tashi-micro-grpo-gguf)\n6. **Mini student (1B MoE) + LoRA** — [`mind-of-tashi-mini-sft`](https://huggingface.co/build-small-hackathon/mind-of-tashi-mini-sft) · [`-lora`](https://huggingface.co/build-small-hackathon/mind-of-tashi-mini-sft-lora) — the bigger sibling, playable in self-play.\n7. **Live gameplay traces** — [`mind-of-tashi-live-traces`](https://huggingface.co/datasets/build-small-hackathon/mind-of-tashi-live-traces) — real matches, sealed + pushed from this Space as they finish.\n8. **Leaderboard runs** — [`mind-of-tashi-runs`](https://huggingface.co/datasets/build-small-hackathon/mind-of-tashi-runs)\n\n## The numbers\n\n**Fo"
  },
  {
    "repo_id": "build-small-hackathon/jungle-story-time",
    "title": "Jungle Story Time",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/jungle-story-time",
    "zone": "Creative Studio",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "jungle",
      "story",
      "time",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Casual",
    "quest": "🎨 Creative Studio: Find one surprising or delightful result.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎨",
    "readme_text": "---\ntitle: Jungle Story Time\nemoji: 🦁\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: \"6.15.2\"\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Personalized kids' stories with AI voice narration\n---\n\n# Jungle Story Time — Kids Story Studio 🦁📖\n\n> **3 taps and GO.** Pick a friend, pick a place, pick story-or-poem — a fine-tuned 1B model writes a gentle, personalized tale for your child, an illustration is painted for it, and it's read aloud in a sweet voice (or your own family's voice).\n\nBuilt for the **Build Small Hackathon (June 2026)**.\n\n---\n\n## What it does\n\nA bedtime-story machine for children aged 2–5. The child (or parent) makes a few taps:\n\n| Step | Choice | Options |\n|------|--------|---------|\n| 1 | **Friend** | Simba 🦁 · Tiger 🐯 · Panda 🐼 · Bhalu 🐻 · Parrot 🦜 · Elephant 🐘 · Bunny 🐰 · Duck 🦆 · or type any animal |\n| 2 | **Place** | Home 🏠 · Jungle 🌳 · Pond 🌊 · Mango Tree 🥭 · Night Sky 🌙 |\n| 3 | **Type** | Story or Poem |\n| 4 | **Voice** | Sunny 🌞 · Koyal 🐦 · Dadu 🌙 · Robo (instant) 🤖 · or 🎙️ your family's voice |\n\nThe app picks a surprise lesson (sharing, patience, counting, colors, animal sounds…), writes the story, streams it back word-by-word into an animated storybook poster, paints a matching watercolor illustration, and reads it aloud. It can also stitch a **narrated story \"movie\"** (one illustration + one narration clip per scene, joined with a Ken-Burns pan).\n\n---\n\n## Architecture\n\n![Jungle Story Time architecture](./architecture.png)\n\nA child makes a few taps in the **Gradio UI** (friend, place, story type, voice). The request goes to the **Modal serverless platform**, which runs three scale-to-zero services:\n\n- **Story Agent** — the fine-tuned **MiniCPM5-1B** GGUF, served on CPU via `llama.cpp`.\n- **Voice Agent** — **VoxCPM2** for narration, with designed voices and zero-shot family-voice cloning.\n- **Illustrator** — **FLUX.2-klein-4B** (4-bit GGUF) paints one storybook picture per tale.\n\nA fast session cache reuses identical prompts so repeats are instant. The same fine-tuned story model also runs **fully offline** on a laptop CPU via `llama-cpp-python` + the Q4_K_M GGUF (`LOCAL_MODE=1`) — no cloud, no API, the child's name never leaves the machine.\n\n---\n\n## Models\n\nThree models power the app — one fine-tuned, two strong off-the-shelf bases:\n\n| Service | Model | Fine-tuned? | Notes |\n|---------|-------|:---:|-------|\n| **Story** | `ThePradip/minicpm5-1b-kids-storyteller-GGUF` (1B, Q4_K_M, llama.cpp, CPU) | ✅ Unsloth LoRA on `openbmb/MiniCPM5-1B` (A10G) | published with full-precision sibling `ThePradip/minicpm5-1b-kids-storyteller` |\n| **Voice** | `openbmb/VoxCPM2` (2B) | ❌ off-the-shelf | designed voices + zero-shot family-voice cloning, no training needed |\n| **Image** | `unsloth/FLUX.2-klein-4B-GGUF` (4B, Q4_K_M, diffusers) | ❌ off-the-shelf | kid-safe by prompt construction, few-step distilled |\n\nThe fine-tune wired into production is the **story model**; the voice and image models are used as-is.\n\n### Why a 1B story model?\n\nA 2–5 year old needs **fast** (no 30s waits), **private** (it hears your child's name), **cheap** (runs on the family laptop), and **stylistically reliable** (tiny sentences, sound words, a refrain repeated 3×). Style is exactly what small-model fine-tuning is good at. `MiniCPM5-1B` is a plain `LlamaForCausalLM`, so the whole Unsloth toolchain works with zero patches, and the Q4_K_M GGUF runs at **~60 tokens/sec on an M-series MacBook CPU**.\n\n---\n\n## Dataset\n\nThe story model is trained on **hand-authored** stories — not scraped, not bulk-generated by a frontier API.\n\n| Dataset | Size | Use |\n|---------|------|-----|\n| Hand-authored (`build-small-hackathon/kids-story`) | 129 plots → 258 examples (name variants) | **SFT for the 1B story model** — 12 categories, style-validated |\n\n**Categories (12):** animals · birds · colors · shapes · surroundings · family · friends · environment · morals · speech practice · early learning · rhymes\n\n**Style contract** — machine-enforced by `finetune/author_kit.py` before publishing:\n- 30–170 words · average sentence ≤ 9 words\n- a refrain repeated exactly 3×\n- at least one sound word (quack, splash, whoosh…)\n- a visualizability score — concrete picture-words a toddler can see; abstract stories rejected\n- nothing scary · clean ending · the child's name appears\n\n---\n\n## Fine-tuning pipeline\n\n```\nfinetune/\n├── author_kit.py          ← validate + push the hand-authored dataset\n├── generate_dataset.py    ← build the personalized SFT set\n├── modal_finetune.py      ← MiniCPM5-1B LoRA training job\n├── eval_base_vs_tuned.py  ← before/after quality comparison\n└── data/                  ← authored stories (markdown blocks)\n```\n\n### Training jobs\n\nEach job below maps to a function in `finetune/modal_finetune.py` (GPU and method\nas declared there):\n\n| Job | GPU | Method |\n|-----|-----|--------|\n| Dataset prep | CPU (Modal) | author-kit validation |\n| MiniCPM5-1B LoRA | A10G | Unsloth |\n| Serving endpoints (story · "
  },
  {
    "repo_id": "build-small-hackathon/ai-puzzle-maker",
    "title": "Ai Puzzle Maker",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/ai-puzzle-maker",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "achievement",
      "puzzle",
      "maker",
      "summary",
      "available"
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Play one round and note what made it fun.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: AI Puzzle Maker\nemoji: 🧩\ncolorFrom: purple\ncolorTo: pink\nsdk: gradio\nsdk_version: 6.10.0\napp_file: app.py\npinned: false\nmodels:\n  - black-forest-labs/FLUX.2-klein-9B\n  - openbmb/MiniCPM4.1-8B\n  - nvidia/Nemotron-3.5-Content-Safety\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# 🧩 AI Puzzle Maker\n\nDescribe **anything** → **FLUX.2-klein-9B** paints it → the client cuts the picture into a **real jigsaw** (bezier tabs, scattered pieces, drag / rotate / snap) → race the clock.\nA MiniCPM-voiced mascot hops around the board commentating — live hints, roasts, victory lines. Built for the **Build Small Hackathon** (Gradio × Hugging Face).\n\nThe **picture** is AI-generated and different every time; the **jigsaw** is a fixed, hand-built HTML5 canvas engine (cutting, piece-to-piece snapping, rotation, timing) that works for *any* image — so any prompt becomes a fair, satisfying puzzle.\n\n## Demo & links\n\n[![AI Puzzle Maker — watch the demo](puzzle-demo-thumb.png)](https://youtu.be/kSjLr6dC6jI)\n\n- ▶️ **[Demo video (YouTube)](https://youtu.be/kSjLr6dC6jI)** — type a subject, watch it paint, then race the clock as a real jigsaw snaps together.\n- 🐦 **[Launch post (X)](https://x.com/kumar_munish_/status/2066064558796747022)**\n- 📝 **[How it's built — write-up](https://huggingface.co/blog/build-small-hackathon/ai-puzzle-maker-blog)**\n\n## What it does\n\n- **⏱ Timed challenges** — the clock starts on your first grab; a 💡 hint flashes the right slot and costs **+15s**; your fastest solve lands on each puzzle's **solve-time leaderboard**.\n- **🌍 Shareable puzzles** — publish your creation to a **community gallery** (anyone can play it and race your time), with puzzle codes and **❤ likes**.\n- **🎚 Difficulty & rotation** — **12 / 24 / 48 / 96** pieces, plus an optional **rotation mode** where pieces scatter at random angles (`R` / right-click / double-tap to turn them).\n\n## Play\n\n| Input | Action |\n|---|---|\n| drag a piece | **Grab** & move; release to **snap** to the board *or* to a neighbouring piece — pieces stick to *each other*, not just the board |\n| `R` / right-click / double-tap | **Rotate** the held piece *(rotation mode only)* |\n| 💡 `+15s` | **Hint** — flashes the correct slot, adds 15s to your clock |\n| 👁 / `P` | Toggle the ghost picture **preview** on the board |\n| 🔊 / `M` | Mute |\n| 🏠 | Back to the menu |\n\nSnap streaks make the mascot lose its mind.\n\n## How it's built\n\nOne Space, a few small models behind one route.\n\n| Piece | What runs it |\n|---|---|\n| **Puzzle artwork + mascot** | **FLUX.2-klein-9B** (Black Forest Labs), distilled, **in this Space on ZeroGPU** (4 steps, guidance 1.0) — one GPU attach renders **both** the 1024×768 artwork and the 512×512 mascot sprite. Locally (no GPU/torch) `flux_local.AVAILABLE=False` and the game falls back to bundled art. |\n| **Theme pack + live commentary** | **MiniCPM4.1-8B** (free hackathon API) writes the per-subject pack and the live hints/roasts. A thinking model — the app strips `<think>` blocks. Without `MINICPM_URL` / `MINICPM_KEY`, commentary falls back to canned lines. |\n| **Content safety** | **NVIDIA Nemotron-3.5-Content-Safety** (Gemma-3 4B, public) on a **[Modal](https://modal.com)** GPU, called as a tiny token-protected API. See [Content safety](#content-safety). |\n| **Game engine** | Hand-built HTML5 `<canvas>`, vanilla JS, **zero frameworks** (`static/puzzle.js`). |\n| **Gallery + leaderboards** | A Hugging Face Bucket mounted at `/data` — **one immutable file per record** (race-free, no database, no locking) for scores, packs, and likes. |\n\n### The ZeroGPU postMessage bridge\n\nThe game lives in an iframe and **can't attach ZeroGPU's visitor token to raw `fetch()` calls** — those arrive identity-less and get **zero** GPU quota. So generation runs as a **real Gradio event on the outer page** (whose native client carries the token), and the iframe drives it over a `postMessage` bridge (message types prefixed `ajp-`, e.g. `ajp-ping` / `ajp-gen`). `POST /generate` is the fallback path.\n\n### Backend routes (mounted on Gradio's FastAPI server, same-origin)\n\n| Route | Purpose |\n|---|---|\n| `POST /quip` | MiniCPM commentary — live hints, roasts, victory lines |\n| `POST /generate` | safety gate → MiniCPM theme pack → in-Space FLUX art *(also exposed as a Gradio API `generate_puzzle` so ZeroGPU quota attributes correctly)* |\n| `POST /publish` | share a puzzle to the community gallery |\n| `GET /puzzles` | gallery listing |\n| `GET /puzzle/{id}` | full pack |\n| `POST /like` | like a puzzle |\n| `GET /times` | per-puzzle solve-time leaderboard |\n| `POST /times` | submit a solve |\n\n## Project layout\n\n| File | Role |\n|---|---|\n| `app.py` | Gradio app + FastAPI routes (Space entrypoint) + the postMessage bridge JS |\n| `static/index.html` · `style.css` · `puzzle.js` | The canvas jigsaw game — engine, UI, all menus. `puzzle.js` does bezier-tab cutting, piece-to-piece grou"
  },
  {
    "repo_id": "build-small-hackathon/ouroboros-kernel-mint",
    "title": "Ouroboros Kernel Mint",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/ouroboros-kernel-mint",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "track",
      "ouroboros",
      "kernel",
      "mint"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Test it with a beginner question and review the answer.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: OUROBOROS Kernel Mint\nemoji: 🪙\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: A 1B model writes GPU kernels. A referee checks them.\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - sponsor:openai\n  - achievement:offbrand\n  - achievement:welltuned\n  - achievement:sharing\n  - achievement:fieldnotes\n  - achievement:offgrid\n  - achievement:llama\n  - tiny-titan\n  - best-agent\n  - minicpm\n  - triton\n  - gpu-kernels\n  - reinforcement-learning\n  - self-distillation\n---\n\n# 🪙 OUROBOROS Kernel Mint\n\nBuild a GPU operation out of blocks. A 1-billion-parameter model writes a real Triton kernel\nfor it. Then a referee that can't be talked into anything decides whether the kernel is real,\nand times it against PyTorch's own compiler. Beat the compiler and you land on the leaderboard.\n\nThe point of the whole thing is that the green tick is earned. Nothing here is self-reported.\nEvery kernel gets compiled, checked against PyTorch on nasty inputs, and benchmarked before it\ncounts.\n\n## How to play\n\n1. **Build.** Snap blocks together (a normalization block, an optional residual, and one\n   activation), or just pick a classic like softmax or swiglu. The numbers flowing through the\n   machine update as you go, so you can see what each block does.\n2. **Mint.** A fine-tuned MiniCPM5-1B writes a fused Triton kernel for your machine. It drafts a\n   few attempts. Each one faces the referee: compile, check correctness against PyTorch, then\n   time it against PyTorch eager, `torch.compile`, and `torch.compile` max-autotune. Attempts\n   that fail are shown, not hidden. Two ways to run it: **Local (offline)** does the whole loop\n   inside this Space with no cloud calls, and **Pro** mints with the 27B on Modal. Both modes use\n   the same three-baseline comparison.\n3. **Beat the compiler.** If your kernel is correct and faster than `torch.compile`\n   max-autotune, it goes on the board. The crowns up top were minted by the 27B model. Switch\n   on Pro mode to mint with that bigger model yourself.\n\nThe first mint of a session takes about 90 seconds while the model wakes up (the backend\nscales to zero when nobody's using it). After that it's a few seconds. There's a \"watch a real\nmint\" button that replays a recorded, verified run instantly, so you can see the whole loop\nwithout waiting.\n\n## Why bother\n\nA GPU kernel is the small program that runs one step of a neural network on the graphics card.\nFusing several steps into one kernel cuts trips to memory, and that's where a lot of real\ninference speed comes from. Writing them well is expert work, which is why PyTorch ships a\ncompiler to do it for you.\n\nThe bet behind this project: the part that's actually scarce isn't the big model, it's a\nverifier the model can't fool. Give a small model a referee where correctness is a yes/no and\nspeed is a measurement, let it learn from its own verified wins, and it gets good. The referee\nthat scores your kernel in this Space is the same one that trained the models.\n\n## The numbers (all checked by the harness, none typed in by hand)\n\n- The larger Qwen3.6-27B run produced 76 verified compiler-beating kernels on H200. 69 of\n  them held up across 5 fresh re-benchmark runs (mean of means 1.30x, range 1.11x to\n  2.04x across this reproducible set; a single live mint on the board can read a little\n  higher). The other 7 are single-shot probes on problems the model had never trained on.\n- On a 376-cell grid of shapes and dtypes, the trained kernels keep a 1.49x geomean against\n  max-autotune recompiled per cell. About 10% of cells are losses, and those are listed per\n  cell rather than swept under the rug.\n- They also beat hand-written expert kernels (Liger, Unsloth, the Triton tutorial) on swiglu,\n  rmsnorm, relu2 and geglu. softmax and layernorm come out as ties within noise.\n- The referee defends itself. A 30-case self-test passes good kernels, rejects subtly wrong\n  ones, and blocks three specific ways of gaming the benchmark. It's green on a 4090 and an\n  H200.\n\nTo be clear about what these are: reproducible scheduling wins on memory-bound fusion ops\nagainst the compiler's autotuner. They are not wins over cuBLAS or FlashAttention, and they're\nnot new algorithms.\n\n## How it's built\n\n- **Models.** OpenBMB's MiniCPM5-1B is the default smith (it really is 1B). Qwen3.6-27B is Pro\n  mode. Both were fine-tuned with the same loop: supervised training on verified kernels, then\n  RL where the only reward is the referee's verdict. No human labels anywhere.\n- **Modal** does both the training and the serving. The 27B was trained on Modal H200s (the RL\n  run peaks around 110 GB of VRAM), and the live backend runs on Modal with scale-to-zero. The\n  interactive Modal backend re-benchmarks the selected verified kernel against PyTorch eager,\n  `torch.compile`, and `torch.compile` max-autotune before returning a result.\n- **Local (offline) mode.** Flip the to"
  },
  {
    "repo_id": "build-small-hackathon/lost-frequency-radio",
    "title": "Lost Frequency Radio",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/lost-frequency-radio",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "openbmb",
      "lost",
      "frequency",
      "radio",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Lost Frequency Radio\nemoji: 📻\ncolorFrom: yellow\ncolorTo: green\nsdk: gradio\nsdk_version: \"6.17.3\"\napp_file: app.py\npinned: true\nlicense: apache-2.0\nshort_description: An old radio that picks up parallel universes, live\nmodels:\n  - openbmb/MiniCPM5-1B\n  - build-small-hackathon/MiniCPM5-1B-lost-frequency-radio-GGUF\ndatasets:\n  - build-small-hackathon/lost-frequency-radio-transmissions\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - llama-cpp\n  - minicpm\n  - openbmb\n  - off-the-grid\n  - off-brand\n  - well-tuned\n  - tiny-titan\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# 📻 Lost Frequency Radio\n\n> An old radio that picks up broadcasts from parallel universes, written live by a **1-billion-parameter** model running **100% on CPU, with no cloud APIs anywhere in the loop.** Turn the dial. The voices are real-time. The wifi can be off.\n\n<p align=\"center\">\n  <a href=\"https://huggingface.co/spaces/build-small-hackathon/lost-frequency-radio\"><b>▶ Live demo</b></a> &nbsp;·&nbsp;\n  <a href=\"https://www.youtube.com/watch?v=MIv956xWMzc\">🎬 Demo video</a> &nbsp;·&nbsp;\n  <a href=\"https://x.com/MarianaCodebase/status/2065857307444199484\">📣 Social post</a> &nbsp;·&nbsp;\n  <a href=\"https://huggingface.co/build-small-hackathon/MiniCPM5-1B-lost-frequency-radio-GGUF\">🧠 Fine-tuned model</a> &nbsp;·&nbsp;\n  <a href=\"https://huggingface.co/datasets/build-small-hackathon/lost-frequency-radio-transmissions\">📚 Dataset</a> &nbsp;·&nbsp;\n  <a href=\"https://huggingface.co/blog/build-small-hackathon/lost-frequency-radio\">📓 Field Notes write-up</a> &nbsp;·&nbsp;\n  <a href=\"https://huggingface.co/datasets/build-small-hackathon/lost-frequency-radio-agent-trace\">🛰️ Agent trace</a>\n</p>\n\n<p align=\"center\">\n  <img src=\"static/img/demo.gif\" alt=\"Lost Frequency Radio: tuning the dial through parallel-universe broadcasts\" width=\"640\" />\n</p>\n\n## Hi, I'm Mariana, and I want to tell you about this radio\n\nIt's 2 a.m. and you turn the dial. Static, static, then a 1950s announcer\ncalling a chess match between cats. Keep turning: a weather report for Jupiter,\na commercial for renting clouds, a chef serving a dish named after you. Around\n104.7 the warmth disappears and a flat voice starts reading numbers in Morse.\nBreak its cipher and the dial does something it shouldn't be able to do. It\ngrows, and every station you found turns out to be a room in the same\ncollapsing universe.\n\nI started this with a small, slightly stubborn belief: that a tiny model, the\nkind you can run on the laptop in front of you, could still make someone feel\nsomething. Most of the AI news lately is about models getting bigger and a\nlittle scarier. I wanted to go the other way and build something that was small,\nlocal, and just plain joyful. No cloud, no API keys, nothing phoning home. Just\na 1B model, a weird idea, and an old radio.\n\nI did not expect to fall this hard for the project. Every word you hear was\nwritten by a model living on the machine in front of you, no cloud, no script\nbought off a shelf. And if you flip the **MODE** switch to LIVE, you can watch\nit write a fresh broadcast token by token, right there. That is the part I still\nfind a little magical, and it is the part I most want you to feel when you turn\nthe dial.\n\n---\n\n## 🎖️ Merit badges this entry earns\n\nHere is the honest scorecard. Everything below is verifiable from the links in this README, I have tried not to claim a single thing I cannot point to.\n\n| Badge | What it requires | How Lost Frequency Radio earns it |\n|---|---|---|\n| 🛰️ **Off the Grid** | No cloud APIs; the whole thing runs on the model in front of you | Inference is 100% local through `llama.cpp`. No API keys, no hosted inference, no network calls at generation time. Unplug the ethernet mid-broadcast and nothing changes. See [`model.py`](model.py). |\n| 🎯 **Well-Tuned** | Uses a fine-tuned model you published on Hugging Face | A LoRA fine-tune of MiniCPM5-1B, trained on a hand-built dataset, published as [GGUF on the Hub](https://huggingface.co/build-small-hackathon/MiniCPM5-1B-lost-frequency-radio-GGUF). Training is fully reproducible from this repo. |\n| 🎨 **Off-Brand** | A custom frontend that pushes past the default Gradio look (hint: `gr.Server`) | Built on [`gradio.Server`](app.py): a real FastAPI app under Gradio. A hand-written skeuomorphic CRT radio (wooden bezel, phosphor glow, live oscilloscope, synthesized voices, Morse). **Zero default Gradio UI is visible.** |\n| 🦙 **Llama Champion** | Your model runs through the llama.cpp runtime | The published model is GGUF Q4_K_M, served with `llama-cpp-python`. See [`requirements.txt`](requirements.txt) and [`model.py`](model.py). |\n| 📓 **Field Notes** | A blog post / report about what you built and learned | Published: **[Field Notes: I Built a Radio That Picks Up Lost Universes](https://huggingface.co/blog/build-small-hackathon/lost"
  },
  {
    "repo_id": "build-small-hackathon/Rune-Goblin",
    "title": "Rune Goblin",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Rune-Goblin",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "rune",
      "goblin",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 6,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Rune Goblin\ncolorFrom: purple\ncolorTo: gray\nsdk: docker\napp_port: 7860\npinned: false\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# 🪄 Rune Goblin\n\n> Draw bad spells, suffer beautifully.\n\n[![Deploy to Hugging Face Space](https://github.com/ASH1998/Rune-Goblin/actions/workflows/deploy-hf-space.yml/badge.svg?branch=master)](https://github.com/ASH1998/Rune-Goblin/actions/workflows/deploy-hf-space.yml)\n[![Built with Codex](https://img.shields.io/badge/built%20with-Codex-111111)](https://github.com/codex)\n[![Fine-tuned OpenBMB](https://img.shields.io/badge/fine--tuned-OpenBMB%20MiniCPM--V--4.6-blue)](https://huggingface.co/openbmb/MiniCPM-V-4.6)\n[![Public model on Hugging Face](https://img.shields.io/badge/public%20model-ASHu2%2FgoblinV1-yellow)](https://huggingface.co/ASHu2/goblinV1)\n[![Public visual dataset](https://img.shields.io/badge/public%20dataset-rune__goblin__visual__dataset-yellow)](https://huggingface.co/datasets/ASHu2/rune_goblin_visual_dataset)\n[![GGUF on Modal](https://img.shields.io/badge/GGUF%20deployed%20on-Modal-7c3aed)](https://modal.com)\n[![llama.cpp pipeline](https://img.shields.io/badge/pipeline-llama.cpp-orange)](https://github.com/ggml-org/llama.cpp)\n\n> ### 🙏 Credits\n> - 🤖 **Made with [Codex](https://github.com/codex)** — used heavily across game code, dataset generation, docs, fine-tuning workflow, model iteration, frontend design, asset placement, asset creation, and map design. I'm not a frontend dev; Codex helped make the custom UI and RPG world feel playable and polished (the `BOSS` and `rune_goblin` magic-circle assets are Codex-generated).\n> - 🧠 **Models by [OpenBMB](https://huggingface.co/openbmb)** — [`openbmb/MiniCPM-V-4.6`](https://huggingface.co/openbmb/MiniCPM-V-4.6) (fine-tuned spell engine + base dialogue/story).\n> - ⚡ **Thanks to [Modal](https://modal.com)** for fine-tuning credits.\n>\n> Full attributions, including game art packs, are in [`CREDITS.md`](./CREDITS.md).\n\n> ### Posts & Videos\n> - **Build Small Hackathon blog**: [Rune Goblin write-up](https://huggingface.co/blog/build-small-hackathon/rune-goblin-blog)\n> - **Social post**: [X / Twitter launch post](https://x.com/ashu_mi_2/status/2066227745743577348)\n> - **Gameplay teaser**: [YouTube](https://www.youtube.com/watch?v=GYkB-XWRe7c)\n> - **Demo**: [YouTube](https://www.youtube.com/watch?v=I1IBbFgokLY)\n\nA tiny dungeon crawler where players draw spells in an invented symbolic\nlanguage (**RuneLang**) and a fine-tuned [`openbmb/MiniCPM-V-4.6`](https://huggingface.co/openbmb/MiniCPM-V-4.6)\nacts as the **spell engine** — reading glyph combinations and emitting JSON\nthat drives attacks, curses and game-state changes. Runtime visuals are not\nimage-generated; spell metadata recolors, resizes, retargets and animates\nexisting/procedural game assets.\n\nSee [`rune_goblin_plan.md`](./rune_goblin_plan.md) for the full design doc.\n\n<p align=\"center\">\n  <img src=\"./merged.gif\" alt=\"Rune Goblin gameplay preview\" width=\"900\" />\n</p>\n\n## Architecture\n\n![Rune Goblin architecture](./Rune_Goblin_ARCH.png)\n\nFor the playable MVP the asset planner is a small deterministic function\n(`rune_goblin.vfx`) instead of a second model: it is instant and never fails,\nand its metadata shape mirrors the plan so a real `MiniCPM-V-4.6` planner can be\nswapped in later without touching the renderer.\n\nThe deterministic rule engine (`rune_goblin.engine`) is both the **dataset\noracle** (it generates training targets) and the **runtime fallback** (used\nuntil a fine-tuned adapter exists, so the UIs run before training finishes).\n\nThe **RPG sandbox** (`app/rpg_app.py` + `src/rune_goblin/world.py`) realizes the\nmap-exploration direction: tile-map areas with enemies, NPCs, chests, shrines,\npowerups, locked doors and a boss. The canvas client owns movement/rendering;\n`resolve_world_cast` turns drawn/selected runes + the faced target into a\nvalidated spell **and** a list of world actions (unlock, loot, defeat, heal,\ntravel-gate…). Python stays the spell engine and balance authority.\n\n## Agentic Runtime Loop\n\nRune Goblin treats model output like an **agent suggestion**, not an unchecked\nsave-file mutation. The model receives a compact snapshot of the situation:\nplayer HP, enemy type, room context, selected or drawn runes, known weaknesses,\ninventory hints and recent story state. It proposes structured spell intent\nJSON: detected runes, confidence, ambiguity, spell name, target, effect, status\nchanges, presentation text and visual tags.\n\nThat proposal then goes through the game pipeline:\n\n1. Parse or repair the JSON.\n2. Clamp it to the spell schema.\n3. Resolve it against RuneLang grammar and combo rules.\n4. Check enemy weakness / resistance mappings.\n5. Convert the result into an allowlisted set of world actions.\n6. Apply those actions through the deterministic game engine.\n\nTho"
  },
  {
    "repo_id": "build-small-hackathon/ai-picturebook",
    "title": "Ai Picturebook",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/ai-picturebook",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "track",
      "achievement",
      "picturebook",
      "summary",
      "available"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: AI Picture Book\nemoji: 📖\ncolorFrom: yellow\ncolorTo: pink\nsdk: gradio\nsdk_version: 6.10.0\napp_file: app.py\npinned: false\nmodels:\n  - black-forest-labs/FLUX.2-klein-9B\n  - openbmb/MiniCPM4.1-8B\n  - openbmb/MiniCPM-V-4.6\n  - nvidia/Nemotron-3.5-Content-Safety\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# 📖 AI Picture Book\n\nType an idea → **MiniCPM4.1-8B** writes a complete story and splits it into pages → **FLUX.2-klein-9B** paints every page → read it in an **animated, fullscreen flip-book**, **download a PDF**, or **share it to a global library**.\n\nIt's a little book *studio*: pick a **style** (watercolor, fairy tale, manga, comic, sci-fi, spooky, pixel, paper-cut…), **picture-book or comic** layout (1–6 **panels per page**), **colour / B&W / sepia**, the **audience** (toddler → 9-12), page count, and A4 / square page size. Optionally **upload a hero photo** — **MiniCPM-V** describes it and that description (plus the photo as a FLUX reference) keeps your character consistent across every page. Built for the **Build Small Hackathon** (Gradio × Hugging Face).\n\nThe **words and pictures** are AI-generated and different every time; the **reader, PDF export and gallery** are a fixed, hand-built front end that works for any book.\n\n  ## Demo & links\n\n  [![AI Picture Book — watch the demo](pb-demo-thumb.png)](https://youtu.be/9TOxWbk_-tk)\n\n  - ▶️  **[Demo video (YouTube)](https://youtu.be/9TOxWbk_-tk)** — three diverse full books (a kids' bedtime picture book, a \"bring your own hero\" story, and an action comic),\n  then publish to the community library.\n  - 🐦 **[Launch post (X)](https://x.com/kumar_munish_/status/2066571014428152180)**\n  - 📝 **[How it's built — write-up](https://huggingface.co/blog/build-small-hackathon/ai-picturebook-blog)**\n\n## What it does\n\n- **🎨 Configurable books** — style preset, colour mode, audience, page count, page size, and a **Picture book** or **Comic** layout with **1 / 2 / 3 / 4 / 6 panels per page**.\n- **🦸 Bring your own hero** — upload a photo; **MiniCPM-V** captions it and the look is locked into every page (and passed to FLUX as an `image=` reference).\n- **✍️ Baked-in text** — FLUX renders the captions / speech bubbles / SFX right in the art (klein does this surprisingly well); a **per-page 🎲 re-roll** fixes the rare wonky line. Picture books can instead show a crisp **caption below** the picture.\n- **📚 Animated reader** — open-book page-turn animation, keyboard / swipe nav, and a **fullscreen** mode.\n- **⬇️ PDF export** — one tap builds a print-ready PDF (cover + pages) **in the browser** (jsPDF) — no server round-trip.\n- **🌍 Share globally** — publish to a community **Library** anyone can read, with ❤️ likes, search and new/top sort.\n\n## How it's built\n\nOne Space, a few small models behind a couple of routes.\n\n| Piece | What runs it |\n|---|---|\n| **Story + per-page art prompts** | **MiniCPM4.1-8B** (free hackathon API) — one call writes the whole book: title, blurb, a fixed cast, and each page's scene + dialogue. A thinking model — the app strips `<think>`. See `prompt_pipeline.py`. |\n| **Page artwork** | **FLUX.2-klein-9B** (Black Forest Labs), distilled, **in this Space on ZeroGPU** (4 steps, guidance 1.0) — one GPU attach paints the cover + all pages. An uploaded hero photo rides along as an `image=` reference. Locally (no GPU) `flux_local.AVAILABLE=False` and the reader falls back to soft placeholder art. |\n| **Hero photo → description** | **MiniCPM-V** (vision sibling, same key) captions an uploaded character; the caption is then screened by the content-safety gate like any other text. |\n| **Content safety** | **NVIDIA Nemotron-3.5-Content-Safety** (Gemma-3 4B, public) on a **[Modal](https://modal.com)** GPU, as a tiny token-protected API. See [Content safety](#content-safety). |\n| **Reader / PDF / gallery** | Hand-built HTML + vanilla JS, **zero frameworks** (`static/`), with **jsPDF** vendored for the PDF. |\n| **Library** | A Hugging Face Bucket mounted at `/data` — **one immutable file per record** (race-free, no database) for shared books and likes. |\n\n### The ZeroGPU postMessage bridge\n\nThe reader lives in an iframe and **can't attach ZeroGPU's visitor token to raw `fetch()`** — those calls get **zero** GPU quota. So generation runs as a **real Gradio event on the outer page** (whose native client carries the token), and the iframe drives it over a `postMessage` bridge (messages prefixed `sb-`). One relay carries `{action, payload}` for both `generate` and `reroll`. `POST /generate` and `POST /reroll` are same-origin fallbacks.\n\n### Backend routes (mounted on Gradio's FastAPI server, same-origin)\n\n| Route | Purpose |\n|---|---|\n| `POST /generate` | safety gate → MiniCPM story → in-Space FLUX art (whole book) |\n| `POST /reroll` | re-paint a single page from its stored prompt |\n| `POST /publish` | share a book to the community Library |"
  },
  {
    "repo_id": "build-small-hackathon/Indian-Celebrity-Voice-Studio",
    "title": "Indian Celebrity Voice Studio",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Indian-Celebrity-Voice-Studio",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "track",
      "indian",
      "celebrity",
      "voice"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Indian Celebrity Voice Studio\nemoji: 🎬\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 6.15.2\napp_file: app.py\npinned: true\nlicense: other\nshort_description: Speak your script in the voice of 11 Bollywood legends\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# 🎬 Indian Celebrity Voice Studio\n\nGenerate authentic Indian celebrity voices from any script — powered by a\n**VoxCPM2 LoRA** fine-tuned on 11 Bollywood legends.\n\n> 🐦 **On X:** [see the launch post](https://x.com/ThePradip1/status/2066547329948405838)\n> 🏆 Built for the **Small Language Model Hackathon** — VoxCPM2 LoRA fine-tuned on an\n> **A10 GPU on [Modal](https://modal.com) in ~2.5 hours**.\n\n## 🎥 Demo\n\n[![Watch the demo](https://huggingface.co/spaces/build-small-hackathon/Indian-Celebrity-Voice-Studio/resolve/main/demo_thumb.jpg)](https://huggingface.co/spaces/build-small-hackathon/Indian-Celebrity-Voice-Studio/resolve/main/actor_voice.mp4)\n\n▶️ **[Watch the 49-second demo](https://huggingface.co/spaces/build-small-hackathon/Indian-Celebrity-Voice-Studio/resolve/main/actor_voice.mp4)** — narrated by the AI Amitabh voice itself.\n\n- **UI** runs here on Spaces (CPU).\n- **GPU inference** runs on a [Modal](https://modal.com) A10G endpoint (the model is\n  too large for free CPU Spaces). Set the `MODAL_ENDPOINT` Space variable to your\n  deployed `voxcpm_modal_endpoint.py` URL.\n\n## Voices\nAmitabh Bachchan · Irrfan Khan · Aamir Khan · Akshay Kumar · Naseeruddin Shah ·\nOm Puri · John Abraham · Vidya Balan · Preity Zinta · Parineeti Chopra · Freida Pinto\n\n## How it works\n1. Pick a celebrity poster\n2. Write or choose a script\n3. Hit **ACTION!** — the GPU renders the voice\n4. Play / download the clip\n\n> **Consent:** trained on the VoxCeleb1 Indian subset (non-commercial research license).\n> Not for commercial voice cloning without the speakers' permission.\n"
  },
  {
    "repo_id": "build-small-hackathon/emoji_studio",
    "title": "Emoji_Studio",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/emoji_studio",
    "zone": "Creative Studio",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "emoji_studio",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Casual",
    "quest": "🎨 Creative Studio: Generate one creative output.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎨",
    "readme_text": "---\ntitle: Emoji Studio\nemoji: 💬\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: 5.33.0\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Create emojis, give them meaning, and chat with them.\ntags:\n  - emoji\n  - image-generation\n  - chatbot\n  - agentic\n  - track:wood\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# 💬 Emoji Studio\n\nHave you ever thought '*there really should be an emoji for this*'? I have. Constantly.\n\nBut here's what makes this more interesting: **LLMs use emojis**. They use them naturally in responses, and they're genuinely expressive when they do, allowing LLMs to express themselves better visually. The catch is that they're locked into whatever emoji vocabulary existed in their training data. They can't use something they've never seen. Furthermore, some emojis are more popular than others, which affects how likely the model is to reach for them.\n \nSo I started wondering: what if you could just... invent new ones and teach an LLM to use them when you talk to it?\n\n**Emoji Studio** is a chat experience that explores new ways for humans and AI to communicate. Rather than relying on existing emojis and language, you and the assistant create entirely new emojis together, define their meanings, and build a shared visual vocabulary that evolves throughout your conversations.\n\n## 🔗 Links\n\n🚀 [Try it](https://huggingface.co/spaces/build-small-hackathon/emoji_studio) |\n📓 [Read blog post](https://huggingface.co/blog/build-small-hackathon/emoji-studio) |\n🎬 [Watch demo video](https://huggingface.co/spaces/build-small-hackathon/emoji_studio/resolve/main/demo.mp4) |\n💬 [Social post](https://www.reddit.com/r/huggingface/comments/1u4x5fr/emoji_studio_my_project_for_hf_build_small/)\n\n## 🎬 Demo\n\n<video controls preload=\"metadata\" width=\"100%\">\n  <source src=\"https://huggingface.co/spaces/build-small-hackathon/emoji_studio/resolve/main/demo.mp4\" type=\"video/mp4\">\n</video>\n\n## 🛠️ Models\n\n- **Qwen/Qwen3-8B** — chat and tool-calling to decide when to generate a new emoji\n- **FLUX.1-schnell** — image generation for the emoji itself\n- **Rembg** — background removal for a clean transparent finish\n\nA total of $8B+12B=20B$ parameters. The first two models run via the Hugging Face Inference API.\n\n## 📋 Submission details\n\n- **Track:** Thousand Token Wood 🍄\n- **Bonus quests:** 🎨 Off-Brand · 📓 Field Notes\n- Built with Gradio 🐾 and hosted on Hugging Face 🤗\n\n\n"
  },
  {
    "repo_id": "build-small-hackathon/lolaby",
    "title": "Lolaby",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/lolaby",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "lolaby",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 15,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Lolaby\nemoji: 🌙\ncolorFrom: yellow\ncolorTo: blue\nsdk: gradio\npython_version: \"3.11\"\nsdk_version: \"6.17.3\"\napp_file: app.py\npinned: true\nlicense: llama3.2\nshort_description: AI-powered lullabies.\ntags:\n  - lullaby\n  - children\n  - small-models\n  - llama-cpp\n  - fine-tuned\n  - on-device\n  - build-small-hackathon\n  - backyard-ai\n  - text-to-audio\n  - agentic\n  - gradio\n  - track:backyard\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n  - badge-tiny-titan\nmodels:\n  - build-small-hackathon/lolaby-llama-3b\n  - openbmb/MiniCPM-V-4.6\n  - hexgrad/Kokoro-82M\ndatasets:\n  - build-small-hackathon/lolaby-traces\n---\n\n# Lolaby — AI-powered lullabies\n\nMeet Lola, your personal bedtime singer.\nA tiny AI that watches your child's drawings and sings them a personalised lullaby.\n\nBuilt for the [Hugging Face **Build Small Hackathon 2026**](https://huggingface.co/build-small-hackathon) · **Backyard AI** track 🏡\n\n**Team:** [andyolivers](https://huggingface.co/andyolivers) & [volivers](https://huggingface.co/volivers)\n\n**Try it:** [the live Space](https://huggingface.co/spaces/build-small-hackathon/lolaby)  \n**Demo video:** [YouTube walkthrough](https://youtu.be/eY_JnijT62E)  \n**Field notes:** [the build journal](https://huggingface.co/blog/build-small-hackathon/lolaby-blog)  \n**Social media post**: [live on LinkedIn](https://www.linkedin.com/posts/andyolivers_huggingface-gradio-hackathon-ugcPost-7471134065912573953-4uv2)  \n**Models:** \n- [`lolaby-llama-3b`](https://huggingface.co/build-small-hackathon/lolaby-llama-3b) — lyrics\n- MiniCPM-V 4.6 1.3B — vision\n- Kokoro 82M — voice\n\nEvery runtime model in Lolaby runs under **4B parameters**, keeping the full pipeline comfortably within the hackathon's **32B cap**.\n\n---\n\n## The Problem\n\nGetting a small child to fall asleep is a daily battle for parents and anyone who looks after kids.\n\nMy partner's sister teaches kindergarten. Every day she runs nap time for fifteen 4-year-olds, and ever since they learned about music and instruments in class, it starts the same way: *\"sing a song for me.\"* She'd love to give each child their own song, built from whatever they love that week — a stuffed fox, a new puppy, the rainbow. She doesn't have the time, the musical training, or a tool that could do it.\n\n**Lolaby is that tool.** The child shows Lola what they love — doodling on screen, or handing over a paper drawing for the teacher to photograph. The teacher types the child's name. A small, on-device AI looks at the drawing, writes a lullaby about it, and sings it back — in about a minute.\n\nEverything runs locally. No cloud LLM, no per-song API cost, no child's drawing or name ever leaving the device. No massive models — just genuinely tiny AI that fits everywhere.\n\nLola turns any child's drawing into a unique lullaby, all on-device.\n\n## How it works\n\n<p align=\"center\">\n  <img\n    src=\"https://i.postimg.cc/t9Q9HdvY/pipeline.png\"\n    width=\"600\">\n</p>\n\n**The drawing is optional.** \nNo canvas drawing AND no upload? ⭢ The form's *What do they love?* field is used instead. \nDrawing AND typed loves? ⭢ Both inform the song. \n\n*Lola* (Lolaby's AI character) tells you what she saw between the audio and the lyrics, so you can see how the drawing turned into the song.\n\n## What's inside: built as a tiny titan\n\nThe **total parameters** across the whole pipeline are **well under 32B**. The lyric model is 3B, the vision model is 1.3B, and the voice is 82M. The synths have zero parameters — they're DSP (Digital Signal Processors).\n\nThis app was intentionally designed as a **tiny titan**: every model in the pipeline is genuinely small, with lyrics, vision, and voice all running **under 4B parameters**. The goal wasn't just efficiency, it was portability. Lolaby was built to run locally, fit on modest hardware, and work the same way everywhere: a laptop, a CPU-only Hugging Face Space, or an offline machine with no cloud dependency at all.\n\nNo giant foundation models, no hidden API calls, and no GPU requirement — just compact, local-first AI designed to run anywhere.\n\n| Component        | Model / Library                                    | Where it runs |\n| ---------------- | -------------------------------------------------- | ------------- |\n| Lyric generation | **Llama 3.2 3B**, fine-tuned, via `llama.cpp`      | CPU, locally  |\n| Drawing → words  | **MiniCPM-V 4.6** (1.3B) via `transformers`        | CPU, locally |\n| Stroke fallback  | Pure NumPy color/density analysis                  | CPU, locally  |\n| Singing voice    | **Kokoro 82M** (<32B)                       | CPU, locally  |\n| Instruments      | Custom DSP synths, built from spectral analysis    | CPU, locally  |\n| Content safety   | Local keyword + intent filter      | CPU, locally  |\n\n## Hardware\n\nLolaby runs locally on whatever machine you give it — a laptop, or a CPU-only Hugging Face Space. There's no cloud LLM in th"
  },
  {
    "repo_id": "build-small-hackathon/Color-Grade-Transfer",
    "title": "Color Grade Transfer",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Color-Grade-Transfer",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "track",
      "sponsor",
      "color",
      "grade",
      "transfer"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 9,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Color Grade Transfer\nemoji: 😻\ncolorFrom: purple\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: true\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\nshort_description: Apply the color grade from one image to another image\n\n---\n\n# 🌓 Color Grade Transfer\n\n**Transfer color grade directly from reference image to source image without manual color grading.**\n\n[![Build Small Hackathon](https://img.shields.io/badge/Build%20Small%20Hackathon-blue)](https://huggingface.co/build-small-hackathon)\n[![Track 1: Backyard AI](https://img.shields.io/badge/Track%201-Backyard%20AI-blueviolet)](https://huggingface.co/build-small-hackathon)\n[![Track 2: An Adventure in Thousand Token Wood](https://img.shields.io/badge/Track%202-An%20Adventure%20in%20Thousand%20Token%20Wood-blueviolet)](https://huggingface.co/build-small-hackathon)\n\n[![Demo Video](https://img.shields.io/badge/▶_Watch_Demo-FF0000?style=for-the-badge&logo=youtube&logoColor=white)](https://youtu.be/DqlLDDgV3kQ)\n[![Social Media Post](https://img.shields.io/badge/Social_Media_Post-000000?style=for-the-badge&logo=x&logoColor=white)](https://x.com/ovi054/status/2064060651706925403)\n[![Try It Live](https://img.shields.io/badge/🚀_Try_It_Live-7C3AED?style=for-the-badge)](https://huggingface.co/spaces/build-small-hackathon/Color-Grade-Transfer)\n\n## 📖 Overview\n\nIt transfers the color grading from a reference image directly onto a source image.\n \n\n## 🏆 Hackathon Patches\n\n| Patch | Status |\n| :--- | :--- |\n| 🔬 **Small Models Only** — `Qwen-Image-Edit-2511` at 20B parameters, well under the 32B limit | ✅ |\n| 🔌 **Off the Grid** — No cloud APIs, runs entirely on local GPU | ✅ |\n| 🎯 **Well-Tuned** — Custom LoRA [`ovi054/QIE-2511-Color-Grade-Transfer-LoRA`](https://huggingface.co/ovi054/QIE-2511-Color-Grade-Transfer-LoRA) published on Hub | ✅ |\n| 📓 **Field Notes** — [Blog post on what I built and learned](https://huggingface.co/blog/ovi054/color-grade-transfer) | ✅ |\n\n\n### Key Features\n* **Direct Color Transfer:** Copies styling from a reference image to a source image without manual grading.\n* **4-Step Inference:** Uses a lightning adapter for fast, low-step generation.\n* **Aspect Ratio Alignment:** Uses an internal VAE dimension calculation 1MP to ensure the 2-D RoPE coordinate systems match, eliminating structural pixel shifts during inference.\n* **Interactive UI:** Features a side-by-side image comparison slider.\n* **MCP Compliant:** Work as a backend tool for Model Context Protocol clients.\n\n---\n\n\n<!-- ### 🛠️ Core Constraints\n* **Small Models Only:** Powered by the `Qwen-Image-Edit-2511` architecture family running at 20B parameters safely satisfying the 32 Billion parameter limit.\n* 🔌 **Off the Grid (Local-First):** **No cloud APIs are used.** The pipeline loads the model weights and runs entirely on the host GPU hardware.\n* 🎯 **Well-Tuned:** Utilizes a custom fine-tuned LoRA adapter published directly on my Hugging Face Hub (`ovi054/QIE-2511-Color-Grade-Transfer-LoRA`).\n\n---\n -->\n\n## 🧠 LoRA Fine-Tuning Process\n\nCurating a clean dataset for custom style mapping is often the bottleneck of training. To overcome this, a systematic data-generation technique was engineered to produce high-fidelity, bidirectional training pairs.\n\n### The Dataset Recipe\n1. **Palette Extraction:** Two entirely different content images were chosen.\n2. **Homogeneous Grading:** The exact same color palette was applied to both images so they shared an identical color profile.\n3. **Cross-Pair Mapping:** * The newly graded version of *Image B* was assigned as the color-style reference for *Image A*.\n   * The inputs were then flipped, mapping the graded version of *Image A* as the reference for *Image B*.\n4. This instantly generated **two highly consistent training pairs** per asset set, allowing the dataset to scale efficiently without content bleeding or style loss.\n\n### Versatility & Flexibility\nBecause the dataset decoupling separated semantic structure from global grading parameters, the fine-tuned LoRA natively generalizes across all input configurations:\n* 👤 ↔️ 👤 **Character to Character**\n* 🏞️ ↔️ 🏞️ **Scene to Scene**\n* 👤 ↔️ 🏞️ **Character to Scene** / **Scene to Character**\n\n### ⚙️ Training Hyperparameters\nThe model was fine-tuned using the following precise configuration settings:\n\n| Configuration Category | Parameter | Value / Setting |\n| :--- | :--- | :--- |\n| **Saving Settings** | Save Precision | `bf16` |\n| **Learning Rate** | Learning Rate | `0.0001` |\n| | Optimizer | `AdamW` |\n| **Dataset Settings** | Base Resolution | 1024 * 1024 |\n| | Enable Bucket | True (`open`) |\n| | Min Bucket Reso | `128` |\n| | Max Bucket Reso | `8192` |\n| | Bucket Reso Steps | `64` |\n| **Network Settings** | Network Rank Dim (Rank) | `16` |\n| | Network Alpha | `16` |\n\n---\n\n\n## ⚙️ Tech Stack\n\n* **Base Model:** "
  },
  {
    "repo_id": "build-small-hackathon/pay-equity-for-eu",
    "title": "Pay Equity For Eu",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/pay-equity-for-eu",
    "zone": "Safety Shield",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "pay",
      "equity",
      "summary",
      "available"
    ],
    "difficulty": "Useful",
    "quest": "🛡 Safety Shield: Check if the warning or recommendation is understandable.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛡",
    "readme_text": "---\ntitle: Pay Equity For EU\nemoji: 🐨\ncolorFrom: purple\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: main.py\npinned: false\nlicense: apache-2.0\nshort_description: Know what you should earn\ntags:\n- build-small-hackathon\n- backyard-ai\n- gradio\n- rag\n- multilingual\n- tiny-model\n- tiny-titan\n- off-brand\n- nemotron\n- pay-equity\n- legal-tech\n- denmark\n- track:backyard\n- sponsor:openbmb\n- sponsor:nvidia\n- achievement:offgrid\n- achievement:offbrand\n- achievement:sharing\n- achievement:fieldnotes\n---\n\n# Pay Equity for EU\n\nA user-friendly salary expectation tool for (currently) any danish engineers, to get an idea of what they deserve as a salary after suffering years of studies. Embark yourself in my app and see what salary you should be expecting in your future dream job, or even get to know what salary you actually should be having if you currently have a job! In parallel with an accessible salary overview, you can chat with a grounded, citation-first assistant that answers any questions you might have to prepare your next salary-talk or job-interview!\n\n **\"What pay should I expect?\"** and **\"What does the law actually let me ask for?\"**\n\n## Why I built this\n\nI am a French and Danish engineer, and I built this for my engineering friends\nand colleagues so they can find out whether they are actually being paid fairly,\nor at least have the tools to know what they deserve after studying so hard. I\nhave been in that exact situation myself: not knowing what salary to expect, and\nnot knowing whether I had any reason to ask for a raise.\n\nEU directives and laws are almost always overlooked by ordinary students,\nbecause they never really reach us. The EU Pay Transparency Directive (the\n\"equal pay act\") only entered into force very recently, on 6 June, which lands at\nan ideal moment for this hackathon. I wanted to use that moment to help my fellow\nengineers get the salary they deserve. Whether they are a man or a woman, they\ndeserve to know what they are worth.\n\nThrough this app I hope to give Danish engineers like me the proper tools to ask\nfor what they are worth, without worry or shame, and in the future to extend the\nsame help to economists, managers, political-science graduates, and to engineers\nin France.\n\n## The problem\n\nIn Denmark, salary negotiation is quite vague. New graduates rarely know\nwhat the market pays, and how much they are actually worth compared to their peers. On top of that, the EU recently passed the **Pay Transparency\nDirective (2023/970)**, which gives every employee the right to ask their employer for pay information, gender pay-gap reports, and more. But who understands what's written on these reports? So people\nsign offers without knowing either the number or the rights they already have...\n\nThis app fixes both, in one chat and an accessible user-salary-creation (do not worry, you will not be disclosing anything private!)\n\n## What it does\n\n- **Expected pay** — deterministic lookup over parsed IDA *lønstatistik*\n  (salary statistics from the engineers' union), broken down by sector,\n  experience, and role. I use all this metadata to perform precise and effective salary chunk query.\n- **Your rights** — RAG over the EU Pay Transparency Directive with\n  article-level citations on every claim (e.g. *\"You can request the\n  average pay for your role broken down by gender (Art. 7(1))\"*). Answers are\n  in **English** in v1; the directive text is already parsed in DA / FR and the\n  generation model is multilingual, so localized answers are an easy implementation!\n- **Private document review** — session-only upload of a contract or payslip,\n  never persisted, never sent to a third-party API.\n\n## Scope: Danish engineers first\n\nTo start with, this is deliberately focused on **Danish engineers**. The salary\nside is built on the official salary statistics (*lønstatistik*) published by\n**IDA** — *Ingeniørforeningen i Danmark*, the Danish engineers' trade union and\nprofessional association (*fagforening* is simply Danish for \"trade union\"). IDA\nsurveys its members every year and publishes detailed pay tables by education,\nsector, region, seniority, role, and gender. Those tables are exactly the\nground truth an engineer needs to answer \"what should I earn?\", so they are the\nfoundation here. I started with engineers since I am myself one and I know that salary conversations are always a pain.\n\n## Future goals\n\nThere are two future goals which I would love to extend on:\n\n### Extend to Djøf\n\n**Djøf** is the other large Danish professional association and trade union, but\nfor a completely different group of people: those working in **law, economics,\nbusiness administration, political science, and the social sciences** (in\nDenmark these members are often just called *djøfere*). Like IDA, Djøf publishes\nits own annual salary statistics for its members. Because the salary lookup is\njust a structured table behind a profile, adding Djøf is mostly a matter of\nparsing and indexing their stat"
  },
  {
    "repo_id": "build-small-hackathon/cxr-draft-auditor",
    "title": "Cxr Draft Auditor",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/cxr-draft-auditor",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "cxr",
      "draft",
      "auditor",
      "summary",
      "available"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: CXR Draft Auditor\nemoji: 🩻\ncolorFrom: blue\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npython_version: \"3.12\"\nlicense: apache-2.0\nshort_description: Research QA for chest X-ray draft impressions (not a device)\npinned: false\nstartup_duration_timeout: 1h\ntags:\n  - track:backyard\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n  - achievement:tinytitan\n  - achievement:bestdemo\n---\n\n# CXR Draft Auditor\n\n> RESEARCH / EDUCATIONAL QA ONLY. This project is NOT a medical device, NOT a diagnostic tool, and NOT a substitute for a qualified radiologist. It must NEVER be used for clinical decision-making, screening, or patient care. See [DISCLAIMER.md](DISCLAIMER.md).\n\n## The idea\n\nI wanted an auditor, not a report generator. There are already plenty of demos that look at a chest X-ray and write a radiology report; I find that the wrong shape for a small model, because a model that confidently writes prose is hard to trust and hard to check. So I built the inverse: you give it the image AND the human draft impression, and it tells you where the two appear to disagree, with image evidence attached. The point is the audit loop, not a verdict. It surfaces apparent MISSING findings, UNSUPPORTED claims, and URGENT review flags, then shows the bounding-box evidence so a person can look again and decide for themselves.\n\nThis framing is what keeps the task small enough for small models. I never ask one model to do everything. I decompose the problem into three narrow jobs, two of which are perception and one of which is pure logic.\n\n## How it works\n\n1. Image to grounded findings. A fine-tuned MedGemma vision-language model runs on the GPU and emits a constrained JSON list of findings over a fixed six-finding label space, each with a normalized bounding box. This is the only model that looks at pixels.\n2. Draft to labels. NVIDIA Nemotron-3 Nano 4B, run on the GPU through transformers, parses the draft impression into the same six-finding label space, marking each label asserted or denied and keeping the verbatim draft span. It reasons briefly before answering (reasoning materially improves its extraction accuracy) and that reasoning trace is stripped before the labels are read. For example, \"Cardiomegaly is present. No pneumothorax.\" parses to cardiomegaly present and pneumothorax explicitly denied, each tied to the phrase it came from. If the draft cannot be parsed, the audit degrades to an image-only pass with a visible note.\n3. Deterministic comparison. A pure-logic comparator (no model) compares the two label sets and flags:\n   - MISSING: present in the image findings, absent or denied in the draft.\n   - UNSUPPORTED: asserted in the draft, absent from the image findings.\n   - URGENT: any flagged finding on the urgent whitelist (pneumothorax and nodule/mass) is surfaced for radiologist review.\n\nA full audit takes well under a minute, roughly 15 to 30 seconds: MedGemma grounding and the Nemotron draft parse both run on the GPU. The only judgment layer is the deterministic comparator, so every flag is auditable: each one ties an image finding (with a box) and a draft phrase (with a verbatim span) to a single comparator decision.\n\n## Why two perception models\n\nMy first instinct was to make the fine-tuned MedGemma do both jobs, image grounding and draft parsing. It grounds the image well, but once it has been fine-tuned for grounding it reads free-text drafts unreliably, especially denials like \"no pneumothorax.\" Rather than fight that with prompt tricks, I gave the text job to a model built for it. Nemotron-3 Nano 4B is a small instruction-following model that parses the draft into the canonical labels cleanly, including denials with their verbatim spans. Two small specialized models plus a deterministic comparator beat one model stretched across two jobs, and the whole thing still fits on free hosted hardware.\n\n## Canonical finding set\n\n`pleural_effusion`, `pneumothorax`, `lung_opacity_consolidation`, `nodule_mass`, `cardiomegaly`, `no_finding`.\n\n## Evaluation\n\nOn a held-out set of 273 images, scored with the production draft parser, presence reaches a macro-F1 of 0.735; localization reaches an IoU@0.3 hit rate of 0.63 at precision 0.79, with a mean IoU of 0.70 over matched boxes. Urgent recall is small-sample and weak: nodule/mass 4 of 9, pneumothorax 1 of 1. Those urgent counts are tiny, and I report them as-is rather than rounding them into a story. This tool is research and educational only. It is frequently wrong, it is not a medical device, and the absence of an URGENT flag never means an image is safe.\n\n## How each track and badge is earned\n\n- Backyard AI (track:backyard): I built this for a real backyard problem, radiology draft quality assurance. A radiologist I know needed a reliable second read on chest X-ray draft impressions; he tried the live demo and shared his view (see below).\n- NVIDI"
  },
  {
    "repo_id": "build-small-hackathon/tutori",
    "title": "Tutori",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/tutori",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "backyard",
      "sharing",
      "sponsor",
      "tutori",
      "summary"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Test it with a beginner question and review the answer.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: Tutori — Your Whiteboard Tutor\nemoji: ✏️\ncolorFrom: indigo\ncolorTo: yellow\nsdk: gradio\nsdk_version: 5.50.0\napp_file: app.py\nfullWidth: true\nheader: mini\npinned: false\nlicense: apache-2.0\nshort_description: Voice tutor that sketches on a whiteboard while it talks\ntags:\n  - build-small-hackathon\n  - backyard ai\n  - off the grid\n  - off-brand\n  - sharing is caring\n  - agent\n  - education\n  - speech\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\nmodels:\n  - google/gemma-4-12B-it\n  - bosonai/higgs-audio-v2-generation-3B-base\n  - openbmb/MiniCPM5-1B\n  - openai/whisper-large-v3-turbo\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16\n  - ProCreations/tutori-board-nemotron\n  - ProCreations/tutori-board-gemma\ndatasets:\n  - ProCreations/tutori-whiteboard-lessons\n---\n\n# ✏️ Tutori — your whiteboard tutor\n\n## 🎬 Demo\n\n<video src=\"https://huggingface.co/spaces/build-small-hackathon/tutori/resolve/main/tutori_demo.mp4\" controls width=\"100%\"></video>\n\n*([demo video file](./tutori_demo.mp4) — narrated by Tutori's own voice; music synthesized from scratch.)*\n\n**Speak a question. Tutori researches it, then teaches you out loud while\nsketching the idea on a whiteboard — in real time, stroke by stroke, in sync\nwith its voice.**\n\nBuilt for the [HF Build Small Hackathon](https://huggingface.co/build-small-hackathon):\nevery model runs **on this Space itself** via ZeroGPU. No cloud APIs, no keys.\n\n## Why Backyard AI 🏡\n\nI built Tutori for my parents. They're quite behind on today's technology —\nespecially AI — and the firehose of new models and jargon is impenetrable\nfrom the outside. With Tutori they can just **ask**: \"what is Gemma?\",\n\"what happened in AI this month?\" — and get a patient, spoken explanation,\ndrawn out on a whiteboard, at their pace, with the research done for them.\nThey've actually been using it, and they find it genuinely useful for\nfinally keeping up.\n\n## The stack (Σ 16.9B params — well under the 32B cap)\n\n| Role | Model | Params |\n|---|---|---|\n| 🧠 Teacher + vision | [google/gemma-4-12B-it](https://huggingface.co/google/gemma-4-12B-it) | 12B |\n| 🧭 Research planner + study coach | [openbmb/MiniCPM5-1B](https://huggingface.co/openbmb/MiniCPM5-1B) | 1B |\n| 🗣️ Expressive voice | [bosonai/higgs-audio-v2-generation-3B-base](https://huggingface.co/bosonai/higgs-audio-v2-generation-3B-base) | 3B |\n| 👂 Speech recognition | [openai/whisper-large-v3-turbo](https://huggingface.co/openai/whisper-large-v3-turbo) | 0.8B |\n\n**Gemma teaches. MiniCPM plans and coaches.** Every turn, MiniCPM5 1B decides\nwhether the question needs fresh facts and writes the search queries — the\nagentic step that turns a chatbot into a researcher. Gemma 4 then teaches\nfrom what was found, Higgs speaks it, Whisper listens. And after every\nlesson, MiniCPM comes back as the **study coach**: it updates the learner's\nprofile and writes three personalized follow-up questions that land on the\nsticky notes under the chat — tap one and the lesson continues where your\ncuriosity points.\n\n### Engineering notes (the honest kind)\n\n- We tried to ship **Nemotron ASR** as the ears — three separate times. NeMo\n  in the main process crashes ZeroGPU's forked workers (\"GPU task aborted\");\n  lazy-loading inside the worker costs a fresh worker its whole turn (we\n  re-measured on June 12 with a dedicated probe Space: **57.1 s just to\n  restore the 0.6B streaming model**, against a 59 s turn budget, paid every\n  turn because workers are disposable); and a CPU sidecar measured RTF ≈ 24.\n  So Whisper turbo keeps the ears — it preloads with everything else and\n  transcribes in about a second.\n- Higgs Audio **v3** TTS ships only for the SGLang-Omni serving stack (needs\n  a persistent GPU), so we use v2 — same family, natively in transformers.\n- The live whiteboard now has a deterministic **diagram specialist** between\n  Gemma and the renderer. When the lesson lands on a known teaching family\n  (rockets to orbit, gradient descent, Pythagorean theorem, neural networks,\n  photosynthesis, supply/demand, binary search, recursion, rainbows, water\n  cycle), Tutori compiles the board from hand-authored diagram ops instead of\n  asking a language model to freehand coordinates. Unknown topics still use\n  the model's drawing, then pass through the same no-overlap layout engine.\n- We **LoRA-fine-tuned two dedicated whiteboard artists** on a purpose-built\n  dataset ([tutori-whiteboard-lessons](https://huggingface.co/datasets/ProCreations/tutori-whiteboard-lessons):\n  7,109 gold lesson steps, 8 diagram families, 78 topics, every one validated\n  to render with zero overlapping elements). Both are integrated behind a\n  `TUTORI_BOARD_MODEL` flag — and the flag is **off**. The honest scorecard:\n  - [**Nemotron 3 Nano 4B artist**](https://huggingface.co/ProCreations/tutori-board-nemotron)\n    (eval loss 0.021): can't run here — Nemotron-H's Mamba-2 layers need the\n    fused `mamba-ssm`"
  },
  {
    "repo_id": "build-small-hackathon/afri_farm_ai",
    "title": "Afri_Farm_Ai",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/afri_farm_ai",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "achievement",
      "afri_farm_ai",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: AfriFarmAI\nemoji: 🌱\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: \"6.17.3\"\npython_version: \"3.12\"\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Diseases Identification and support for Kenyan farmers\ntags:\n  - track:backyard\n  - sponsor:nvidia\n  - achievement:fieldnotes\n  - sponsor:modal\n  - sponsor:openai\n  - sponsor:cohere\n  - achievement:off-brand\n  - achievement:best-agent\n---\n\n# AfriFarmAI - Small AI Models, Practical Support for Kenyan Farmers\n\nAfriFarmAI helps Kenyan smallholder farmers identify likely crop and livestock\nhealth problems from text, voice, and photos. It returns cautious guidance in\nEnglish or Swahili, reads the response aloud, and finds nearby agrovets with\nphone numbers and direct Google Maps directions.\n\nThe idea began with a friend who manages crop and livestock farms across Kenya.\nHis workers often noticed problems early but struggled to describe symptoms\naccurately because of language and literacy barriers. AfriFarmAI lets them show\nor describe the problem and quickly receive practical decision support.\n\n## What It Does\n\n1. Accepts typed symptoms, voice recordings, crop photos, or animal photos.\n2. Transcribes and translates the farmer's input.\n3. Produces a structured diagnosis with confidence, severity, treatment,\n   prevention, and professional-escalation guidance.\n4. Returns localized text and spoken guidance.\n5. Finds nearby agrovets and displays their phone numbers, distances, addresses,\n   and Google Maps links.\n\nAfriFarmAI is decision support, not a replacement for a veterinarian,\nagronomist, or agricultural extension officer. Uncertain, severe, urgent, or\nworsening cases are escalated to professionals.\n\n## Why This Fits Build Small\n\nAfriFarmAI combines smaller, specialized models instead of relying on one large\nfrontier model:\n\n| Model | Size | Role |\n| --- | ---: | --- |\n| [NVIDIA Nemotron Nano 12B V2 VL](https://build.nvidia.com/nvidia/nemotron-nano-12b-v2-vl/modelcard) | 12B | Analyzes symptoms and optional images, then returns structured crop or livestock diagnosis support. |\n| [Tiny Aya Earth](https://huggingface.co/CohereLabs/tiny-aya-earth) | 3.35B | Translates Swahili input and localizes farmer-facing guidance. |\n| [Whisper Large V3](https://huggingface.co/openai/whisper-large-v3) | 1.55B | Transcribes Swahili speech through Hugging Face Inference. |\n| [Cohere Transcribe](https://docs.cohere.com/docs/models) | 2b | Transcribes English speech. |\n| [VoxCPM2](https://huggingface.co/openbmb/VoxCPM2) | 2B | Generates spoken English and Swahili responses through Modal, with gTTS fallback. |\n\nThe application minimizes sensitive information, uses privacy-safe structured\nlogging, does not store farmer conversations, and uses browser location only\nwhen searching for nearby agrovets.\n\n## Tracks And Prizes\n\n- **Backyard AI:** Practical agricultural support inspired by a real challenge\n  faced by Kenyan farm workers.\n- **Nemotron Hardware Prize:** Nemotron Nano 12B V2 VL is the core multimodal\n  diagnosis model.\n- **Best Use of Modal:** Modal hosts VoxCPM2 for multilingual spoken responses.\n- **Cohere:** Cohere Transcribe handles English speech input, while Tiny Aya\n  Earth translates Swahili input and localizes farmer-facing guidance.\n- **Best Use of Codex:** Codex supported architecture, implementation, model\n  integration, testing, safety checks, documentation, and deployment preparation.\n- **Off Brand:** A custom responsive Gradio interface with light and dark themes.\n- **Best Agent:** A multi-step pipeline coordinates transcription, translation,\n  multimodal diagnosis, safety validation, localization, speech synthesis, and\n  a Google Places agrovet-search tool.\n- **Field Notes:** The linked build article explains the project story, model\n  choices, accessibility goals, and development journey.\n\n## Links\n\n- **Live app:** https://build-small-hackathon-afri-farm-ai.hf.space/\n- **Demo video:** https://www.youtube.com/watch?v=k14J4CnC_KE\n- **Social post:** https://www.linkedin.com/posts/norbert-osiemo-0256a4144_afrifarmai-agritech-smallmodels-share-7472353158942318592-wCZY/\n- **Build article:** [Building AfriFarmAI: Using Small, Specialized AI Models for Livestock and Diseases Identification](https://www.linkedin.com/pulse/building-afrifarmai-using-small-specialized-ai-models-norbert-osiemo-2c04f)\n- **GitHub:** https://github.com/mainanorbert/AfriFarmAI\n\n## Team\n\n- [norbertosiemo](https://huggingface.co/norbertosiemo)\n\n## How It Is Built\n\nAfriFarmAI is a Python 3.12 Gradio application with Pydantic contracts and a\nsingle-process orchestration pipeline. It calls NVIDIA, Cohere, Hugging Face,\nModal, and Google Places services through replaceable provider clients.\nStructured output validation, confidence gating, cautious treatment guidance,\nand professional escalation keep the experience practical and safety-focused.\n\n## Run Locally\n\n```bash\npython3 -m venv .venv\nsource .venv/bin/activate\npython -m pip install -r requirements.txt\ncp .env.ex"
  },
  {
    "repo_id": "build-small-hackathon/Pozify",
    "title": "Pozify",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Pozify",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "pozify",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Pozify\nemoji: \"🏋️\"\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: \"6.17.3\"\npython_version: \"3.10\"\napp_file: app.py\nfullWidth: true\nshort_description: Small-model workout form review from short videos.\ntags:\n  - gradio\n  - computer-vision\n  - pose-estimation\n  - fitness\n  - video-analysis\n  - llama-cpp\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Pozify\n\nPozify is a small-model workout form coach for people who want to train at home but still need clear,\ntrustworthy feedback. A user uploads a short exercise video, adds basic training context, and gets a\nstructured form-review report with rep counts, movement notes, annotated video, and a grounded coach\nsummary.\n\nPozify is built for users who avoid gyms because they are far away, too crowded, intimidating, or too\nexpensive to replace with a private trainer. It gives them a second set of eyes without pretending to\nbe a clinician or a full personal coach.\n\nPozify is not a medical device. It does not diagnose injuries, claim injury prevention, or replace a\nqualified trainer, clinician, or physical therapist.\n\n![Pozify product](https://tihado.com/images/pozify.webp)\n\n## What Pozify Delivers\n\nFor each uploaded workout clip, Pozify produces:\n\n- detected exercise and confidence\n- rep-by-rep analysis\n- valid variation markers versus real form issues\n- annotated output video and issue clips\n- grounded coach summary with fixes and a next-session plan\n- verifier-backed confidence and safety notes\n\nThe supported exercise labels are `squat`, `push_up`, `shoulder_press`, and `unknown`. The `unknown`\nlabel is intentional: Pozify should reject unsupported or unclear clips instead of forcing every\nvideo into one of the supported movements.\n\n![Pozify coach intelligence](https://tihado.com/images/pozify-coach-intelligence.webp)\n\n## Product Flow\n\nPozify is not a generic chatbot and not a vague video captioner. It is a grounded movement-analysis\npipeline:\n\n```mermaid\nflowchart TD\n    A[\"video + user profile\"] --> B[\"video QC\"]\n    B --> C[\"pose extraction\"]\n    C --> D[\"pose cleaning\"]\n    D --> E[\"exercise router\"]\n    E --> F[\"exercise-specific rep counter\"]\n    F --> G[\"per-rep analysis\"]\n    G --> H[\"variation detection\"]\n    H --> I[\"issue markers\"]\n    I --> J[\"annotated video renderer\"]\n    J --> K[\"grounded coach summary\"]\n    K --> L[\"verifier\"]\n    L --> M[\"final report\"]\n\n    classDef vision fill:#1e3a5f,stroke:#60a5fa,color:#fff\n    classDef movement fill:#1e3f2e,stroke:#4ade80,color:#fff\n    classDef output fill:#3b2f1e,stroke:#fbbf24,color:#fff\n\n    class B,C,D vision\n    class E,F,G,H,I movement\n    class J,K,L,M output\n```\n\nThe main product decision is simple: structured evidence first, language second. The language model\ndoes not inspect the raw video directly and invent advice. It explains the structured findings that\nthe pipeline has already produced.\n\n## Small-Model Strategy\n\nPozify is built around the belief that small, task-specific models can be the right default for many\nreal products. A small model does not need to act like a large general assistant if the product gives\nit a narrow job, clean inputs, and a verifier.\n\nThat is the strategy here:\n\n- use pose and deterministic logic to extract evidence before generation\n- train small models on the exact task they must perform\n- keep each model boundary inspectable\n- retrieve exercise-specific knowledge cards instead of relying on generic memory\n- evaluate outputs against product contracts, not only fluency\n- fall back conservatively when model output is unavailable or ungrounded\n\nFor narrow tasks such as exercise routing or structured JSON-to-coaching-summary generation, a\nfine-tuned small model can match or beat a much larger generic model on the product's actual quality\nbar. The advantage comes from being optimized for the exact schema, vocabulary, examples, and failure\nmodes Pozify cares about.\n\n## Models We Use\n\nEvery runtime model used by Pozify fits under the Build Small `32B` parameter cap.\n\n| Component                   | Model or method                                  | Role                                                                  |\n| --------------------------- | ------------------------------------------------ | --------------------------------------------------------------------- |\n| Pose extraction             | MediaPipe Pose Landmarker Lite                   | Extracts body landmarks from video frames.                            |\n| Exercise router             | Custom PyTorch BiLSTM over 30-frame pose windows | Classifies `squat`, `push_up`, `shoulder_press`, or `unknown`.        |\n| Router baseline             | scikit-learn `HistGradientBoostingClassifier`    | Reference and fallback artifact for router experiments.               |\n| Rep counting                "
  },
  {
    "repo_id": "build-small-hackathon/aranya_a_wildkeepers_adventure",
    "title": "Aranya_A_Wildkeepers_Adventure",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/aranya_a_wildkeepers_adventure",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "aranya_a_wildkeepers_adventure",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Aranya A Wildkeepers Adventure\nemoji: 🌿\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: false\nshort_description: Aranya is a nature adventure where you become a Wildkeeper\nhf_oauth: true\npreload_from_hub:\n  - ankurkaul17/minicpm-v-4.6-pid-gguf plant-id-F16.gguf,mmproj-plant-id-F16.gguf\n  - ankurkaul17/minicpm-v-4.6-pdd-gguf plant-health-F16.gguf,mmproj-plant-health-F16.gguf\n  - openbmb/MiniCPM-V-4.6-gguf MiniCPM-V-4_6-F16.gguf,mmproj-model-f16.gguf\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - sponsor:openai\n  - achievement:welltuned\n  - achievement:offgrid\n  - achievement:llama\n  - achievement:offbrand\n  - achievement:fieldnotes\n  - achievement:badge-tiny-titan\n---\n\n# 🌿 Aranya: A Wildkeeper's Adventure\n\nHere is a number that should make you think:\n\n* 🌱 **Hundreds of millions of households worldwide** own at least one houseplant.\n* 🏡 In the United States alone, **66% of households**—roughly **86 million homes**—have something green and alive on a windowsill, shelf, or hanging from the ceiling.\n* 🪴 Nearly **70% of millennials** identify as **\"plant parents.\"**\n\nPlant care has become one of the world's most popular hobbies — a quiet, grounding ritual that reduces stress, improves well-being, and brings nature into everyday life.\n\nYet there is a problem:\n\nThe average plant parent has killed at least **7 plants**.\n\n* `Overwatering. Underwatering. Pests. Disease. Poor lighting. Misidentification`\n\nMost people want to care for their plants, they just don't always know how.\n\n\n## Meet Aranya\n\n**Every plant has a story. Aranya helps you hear it.**\n\n🌿 **Identify any plant** from a single photo\n\n🩺 **Diagnose plant health issues** and learn how to help it recover\n\n🎙️ **Listen to narrated stories** instead of reading long articles\n\n🤯 **Discover surprising facts** and hidden secrets from the plant world\n\n📖 **Explore myths, legends, and cultural stories** behind every species\n\n🧙 **Uncover plant superpowers** — from healing remedies to everyday uses\n\n---\n\n### 🔗 Explore\n\n🚀 **App**\n\n**HF** -  [Live on HF Spaces](https://huggingface.co/spaces/build-small-hackathon/aranya_a_wildkeepers_adventure) | 📺 **Demo** - [Watch the video](https://www.youtube.com/watch?v=8VHUkDwiiFM) | 📖 **Blog** - [How It Was Built](https://huggingface.co/blog/build-small-hackathon/aranya-a-wildkeepers-adventure) | 💻 **Code** - [Mono Repo](https://github.com/allthatido/sounth_pddm)\n\n📲 **Social**\n\n🐦 **X** - [Post](https://x.com/Sounth169323/status/2065786953644355966) | 💼 **LinkedIn** - [Post](https://www.linkedin.com/posts/ankurkaul17_aranya-teaching-a-small-model-to-see-plants-share-7471553128836788224-Eosx/)\n\n---\n\n## 🏅 Achievements\n\n| Badge | How Aranya earns it |\n|-------|---------------------|\n| 🐜 **Tiny Titan** | Both fine-tuned models, under 1.3B params each, are the app's heavylifters |\n| 🎛️ **Well-Tuned** | Two fine-tuned MiniCPM-V 4.6 models (LoRA, vision-inclusive) achieving +54% BLEU on disease diagnosis and +36% on plant identification vs baseline |\n| 🔌 **Off the Grid** | All inference runs locally via llama.cpp (no external API calls). Pocket TTS is CPU-native. Frontend assets are vendored — no CDN at runtime |\n| 🦙 **Llama** | Uses llama.cpp ecosystem (GGUF format, `llama-cpp-python`) for both training quantization and production inference |\n| 🎨 **Off-Brand** | Fully custom jungle expedition UI (vanilla HTML/CSS/JS) — zero default Gradio look |\n| 📓 **Field Notes** | [Blog post](https://huggingface.co/blog/build-small-hackathon/aranya-a-wildkeepers-adventure) detailing the full build process, data pipeline, and learnings |\n\n---\n\n## What We Built\n\n- **Two fine-tuned MiniCPM-V 4.6 models** (~1.3B params each): one for plant identification, one for disease and health diagnosis\n- **Full data pipeline**: open community plant posts, classified with rules and a small ML model, turned into 310K+ training conversations\n- **LoRA fine-tuning on Modal**: ms-swift v4.2.3 with vision-inclusive LoRA (unfrozen ViT) on L40S GPUs. BLEU up +54% on disease and +36% on plant-ID vs baseline\n- **GGUF conversion on Modal**: LoRA merge via `swift export`, then F16 and Q8_0 quantization with llama.cpp\n- **Voice narration**: Pocket TTS (100M params, CPU-native), voice cloning, streaming PCM audio synced to LLM text\n- **Fully custom frontend**: vanilla HTML/CSS/JS, jungle expedition look, markdown rendering, audio-text sync\n- **llama-cpp-python fix**: monkey-patched `Llama.generate()` so hybrid models keep multimodal image embeddings instead of wiping them on state reset\n- **Built with OpenAI Codex**: the Space app, training pipeline, data scripts, and eval tooling ([sounth_pddm](https://github.com/allthatido/sounth_pddm))\n\n---\n\n## How It Works\n\n1. Upload a plant photo (or take one with your camera) — it's center-cropped and resized to 1024px\n2. Choose a mode: **Identify species** or **Diagnose health**\n3. The image is sent to a fine-tuned MiniCPM-V 4.6 GGUF model running v"
  },
  {
    "repo_id": "build-small-hackathon/ready-to-submit",
    "title": "Ready To Submit",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/ready-to-submit",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "ready",
      "submit",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Ready to Submit?\nemoji: 👁\ncolorFrom: gray\ncolorTo: green\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\nstartup_duration_timeout: 45min\npinned: false\nlicense: mit\nshort_description: Evaluates your HF Space for Build Small Hackathon\ntags:\n  - track:backyard\n  - sponsor:nvidia\n  - sponsor:openbmb\n  - achievement:offbrand\nmodels:\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16\n  - JetBrains/Mellum2-12B-A2.5B-Instruct\n  - openbmb/MiniCPM5-1B\n---\n\n# 👁 Ready to Submit?\n\n**The app is the question.** Point it at any Space in the\n`build-small-hackathon` org and it checks the entry rules from the\n[official field guide](https://build-small-hackathon-field-guide.hf.space/)\n— then a small model (your pick, under 32B of course) writes you a grounded,\nactionable review.\n\n## The idea\n\nThe hackathon has six entry rules, two tracks, four sponsor prizes, six\nachievement badges and six judged bonus awards — and the difference between\n\"submitted\" and \"eligible\" hides in README frontmatter tags like\n`track:backyard` and `achievement:offgrid`. Ready to Submit? automates the\npre-flight check: it verifies the verifiable (deterministically, via the HF\nHub API) and lets a small model handle the judgment calls (track fit, prize\nopportunities, README polish), grounded in the machine-verified facts so it\ncan't make things up.\n\n## How it works\n\n1. **Grounded checks (no LLM):** fetches the target Space's metadata, README\n   and source via the public Hub API; parses the frontmatter tags against the\n   canonical ids from the field guide's own source; finds demo-video and\n   social-post links; detects every Hub model referenced by the app and looks\n   up its real parameter count against the 32B cap (and the ≤4B Tiny Titan\n   bar).\n2. **Small-model review:** the checklist + facts + rules digest go to the\n   reviewer model you picked, which streams back fixes, track-fit reasoning,\n   and the prizes/badges the Space could claim but hasn't.\n\n## Tech\n\n- **Models (pick your reviewer):** NVIDIA Nemotron 3 Nano 4B (default —\n  3.97B params, a hybrid Mamba-Transformer that even fits the Tiny Titan\n  bar), JetBrains Mellum 2 12B-A2.5B Instruct, OpenBMB MiniCPM5 1B.\n- **Runtime:** `gr.Server()` on ZeroGPU — plain FastAPI routes serve a\n  custom HTMX frontend (no stock Gradio components anywhere), and the\n  review streams through a Gradio-queued endpoint via `@gradio/client`,\n  `transformers` + bf16, `TextIteratorStreamer`.\n- **Custom UI:** hand-rolled pastel re-skin of the field guide's\n  woodblock-press design language — paper grain, dashed-ring stamp badges,\n  self-hosted Archivo/Spline Sans Mono, htmx swaps with a friendly loading\n  stamp. That's the `achievement:offbrand` story.\n- **Grounding:** rules and canonical tags extracted from the field guide\n  Space's source of truth (`src/lib/data/content.ts`, `src/lib/readme.ts`),\n  embedded as the reviewer's system context; checks (including Codex commit\n  attribution and per-model parameter counts) are deterministic Hub API\n  calls, so the model can't invent facts.\n- **Honesty:** every AI review ships with a disclaimer — double-check\n  against the official field guide regardless of what the app says.\n\n## Links\n\n- 🎬 Demo video: [demo.mp4](https://huggingface.co/spaces/build-small-hackathon/ready-to-submit/resolve/main/demo.mp4)\n- 📣 Social post: https://x.com/amphetamarina/status/2065435918509441045\n"
  },
  {
    "repo_id": "build-small-hackathon/KERNEL-95",
    "title": "Kernel 95",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/KERNEL-95",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "achievement",
      "kernel",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: KERNEL-95 - The Last Desktop\nemoji: 🖥️\ncolorFrom: pink\ncolorTo: purple\nsdk: gradio\nsdk_version: 5.50.0\napp_file: app.py\npinned: true\ntags:\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:tiny\n---\n\n# KERNEL-95: The Last Desktop\n\nIn 2077, you join the KERNEL-95 Recovery Division to investigate Device 013:\nan obsolete computer recovered from a sealed MetroGrid Behavioral Lab archive.\nIt has no power source, yet it is still running. Three technicians who touched\nit each lost exactly thirteen minutes of memory.\n\nTo enter the machine, you connect MIRROR.exe, your assigned forensic AI\nassistant. Inside a corrupted retro desktop, you open files, recover deleted\nmessages, compare restore points, run contradiction scans, and search for ECHO:\nan unknown intelligence hiding inside the system.\n\nBut MIRROR is not a neutral assistant. She diverts your questions, suppresses\nevidence, and rewrites her own testimony because she is secretly protecting\nECHO.\n\nKERNEL-95 is an AI-native forensic game where the model performs MIRROR's\nevasions, ECHO's fragmented confessions, and the emotional tension between\nthem, while a deterministic investigation engine protects the actual truth.\n\nAt the end of Case 013, you decide whether to delete ECHO, extract him, expose\nMIRROR, protect them both, or allow something impossible to survive inside the\nlast desktop.\n\nThe exact player guide is in [HOW_TO_PLAY.md](HOW_TO_PLAY.md).\n\n## Social Post + Demo Video\n\nWatch the KERNEL-95 demo and read the social post:\n[x.com/praedico/status/2065889011139215517](https://x.com/praedico/status/2065889011139215517)\n\n## OpenAI Codex Track\n\nPublic source repository:\n[github.com/0xPraedico/KERNEL-95](https://github.com/0xPraedico/KERNEL-95)\n\nKERNEL-95 was developed with OpenAI Codex as the coding agent. Codex-attributed\ncommits include the `Co-authored-by: OpenAI Codex <codex@openai.com>` trailer\nin the public Git history.\n\n## Core Game\n\n- A movable late-1990s desktop inside a CRT.\n- A large pink MIRROR terminal for commands and conversation.\n- Clickable files, deleted evidence, restore points, and a hidden partition.\n- Deterministic Python tools own facts, unlocks, progression, and endings.\n- MIRROR and ECHO can use an optional OpenAI-compatible model for voice.\n- The full game works without a model or API key.\n- The World Cup shortcut uses live fixtures with browser-local mock predictions.\n  It has no account or remote persistence.\n\n## Quick Demo\n\n1. Click **CONNECT MIRROR.exe**.\n2. Open `CASE_013_BRIEFING.txt`.\n3. Ask MIRROR about ECHO.\n4. Recover `echo_letter_01.tmp` from the Recycle Bin.\n5. Challenge MIRROR once and run one contradiction scan.\n6. Compare restore points or run `verify mirror`.\n7. Run `audit mirror`, then `unlock hidden_partition`.\n8. Inspect ECHO's files and submit the Final Judgment.\n\nUseful terminal commands:\n\n```text\nhelp\nstatus\ndir\ntype CASE_013_BRIEFING.txt\nrecover echo_letter_01.tmp\nrun contradiction_scan\ntrace echo\ncompare restore_points\nverify mirror\naudit mirror\nunlock hidden_partition\nlisten echo\n```\n\n## Local Run\n\nPython 3.10 or newer is required.\n\n```bash\npython -m venv .venv\nsource .venv/bin/activate\npip install -r requirements.txt\npython app.py\n```\n\nOpen <http://127.0.0.1:7860>.\n\nVerification:\n\n```bash\npython -m compileall .\npython smoke_test.py\nruff check app.py neon_trace smoke_test.py\n```\n\n## Hugging Face Space\n\nThe default deployment is a standard CPU Gradio Space. It does not require a\nGPU, Modal, or an API key.\n\n1. Create a new Hugging Face Space with the **Gradio** SDK.\n2. Push this repository to the Space.\n3. Set these Space variables:\n\n```text\nGRADIO_SERVER_NAME=0.0.0.0\nGRADIO_SERVER_PORT=7860\n```\n\n4. Leave the following secrets empty for deterministic fallback mode:\n\n```text\nOPENAI_API_KEY=\nOPENAI_BASE_URL=\nOPENAI_MODEL=\n```\n\nThe app launches and remains fully playable when `OPENAI_API_KEY` is absent.\nIf a configured model endpoint fails or times out, narration automatically\nreturns to authored deterministic responses. Model output cannot create\nevidence, change unlocks, or select an ending.\n\n## Modal vLLM backend\n\nThe Hugging Face Space runs the KERNEL-95 game UI. Modal runs\n`Qwen/Qwen3-4B-Instruct-2507` behind vLLM's OpenAI-compatible API. The model only supplies\nMIRROR/ECHO voice; deterministic tools remain authoritative and model output\ncannot mutate `GameState`.\n\nCreate a local environment:\n\n```bash\nconda create -n kernel95-modal python=3.11 -y\nconda activate kernel95-modal\npython -m pip install \"modal>=1.0.0\" openai\npython -m modal setup\n```\n\nCreate the Modal secret used by `modal_vllm.py`. Choose a private API key and\nreuse exactly the same value in the Hugging Face Space:\n\n```bash\nexport KERNEL95_MODAL_API_KEY=\"replace-with-a-long-random-value\"\nmodal secret create kernel95-modal-api-key \\\n  KERNEL95_MODAL_API_KEY=\"$KERNEL95_MODAL_API_KEY\"\n```\n\nDeploy the vLLM server:\n\n```bash\nmodal deploy modal_vllm.py\n```\n\nModal returns an HTTPS endpoint after deployme"
  },
  {
    "repo_id": "build-small-hackathon/hatchimera",
    "title": "Hatchimera",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/hatchimera",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "hatchimera",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Hatchimera - Voxel Pet Fusion\nemoji: 🧬\ncolorFrom: purple\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.15.2\npython_version: \"3.12\"\napp_file: app.py\nshort_description: Draw voxel pets, splice two, breed a family tree\npinned: false\nlicense: mit\ntags:\n  - track:wood\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Hatchimera — Voxel Pet Fusion\n\n**Hatchimera = hatch a chimera — draw two voxel pets, splice them, breed\nsomething new.**\n\nDraw voxel pets from a sentence, splice two into a chimera no menu could give\nyou, and breed a whole family tree. The model does the drawing. Hatchimera is\nthe toy that doesn't exist without it.\n\n[![Watch the demo](https://img.youtube.com/vi/CZ5-xUl1l-M/hqdefault.jpg)](https://www.youtube.com/watch?v=CZ5-xUl1l-M)\n\n## The hook\n\nPick two voxel creatures, hit **Splice**, and watch **Gemma 4 12B** recombine\ntheir box geometry into a chimera no menu could give you: one parent's body, the\nother's signature feature, plus a mutation it makes up on the spot. The newborn\nlands on a 3D stage and joins the family tree, where it can breed again. You can't\nreproduce it from a dropdown, and it *is* the main thing on screen.\n\n## How it works\n\nOne moment is the model — the splice; everything around it is deterministic code.\n\n| Layer | What runs |\n| --- | --- |\n| **Splice** | `fuse_creatures` → one Gemma 4 12B call merges two box layouts into a chimeric child — the only model call in the app |\n| **Draw / Tweak** | **model-free**: a keyword picks a reference body template (`pick_exemplar`), and `edit_parser` maps \"add two horns\" onto a ~100-part catalog that `assemble_part` snaps on — instant, no GPU |\n| **Render** | a Three.js voxelizer turns the box layout into an `InstancedMesh` stage, an animated pedigree tree, and a Figma-style lab canvas |\n| **Runtime** | the model runs through **llama.cpp** (`llama-cpp-python`), on **ZeroGPU** on the Space; no cloud APIs |\n\nThe split is deliberate: the model is load-bearing where it earns its tokens —\nrecombining two creatures into something new — and the predictable parts (a\nstarter body, snapping on catalog parts) stay deterministic on the CPU. If\ninference is unavailable or returns junk, the splice falls back to a deterministic\nbox-merge, so the demo never crashes; it just gets less surprising.\n\n## Run it\n\n```bash\npip install -r requirements.txt\n\n# Fake runtime — no model, instant; for UI / interaction work\nBUDDY_FORCE_FAKE_RUNTIME=1 python app.py     # or: scripts/start-local.sh\n\n# Real model — Gemma 4 12B through llama.cpp\nscripts/start-local-real-model.sh\n```\n\nBoth scripts bind `0.0.0.0:7860`, auto-pick a free port, and honor `PORT=` /\n`HOST=`. Tests: `python -m pytest -q`.\n\nReal-time real-model inference needs a GPU — on the Space that's ZeroGPU. On a\nplain CPU the CUDA wheel can't initialize, so every model action silently falls\nback to the deterministic path (still playable, just templated). Gemma 4 12B is\nheavy to run on a local CPU.\n\n## How to play\n\n1. **Landing** → **Quick Start** drops you into the Lab with two random starter\n   parents on the tree; **✏️ Build from scratch** instead opens an empty bench to\n   describe both parents yourself (a fresh family).\n2. **Lab** (the pedigree tree) → tap two creatures to stage them as A / B,\n   `🔍` inspects one in live 3D, or **✏️ Build from scratch** adds a fresh buddy.\n   Staging two opens the Splice Bench.\n3. **Splice Bench** → tweak each side in its box (or `🎲` for a random buddy) —\n   tweaks snap catalog parts on instantly, model-free; **Splice!** is the one\n   model call that breeds the two into a child.\n4. **Reveal** → the newborn appears beside its parents and joins the family tree.\n5. Keep breeding — every child stays in the tree, ready to be staged again.\n\n## The model, and how we got here\n\nHatchimera started as the *opposite* of what it is now, and the rewrite is the\nwhole story.\n\n**v1 — recipe-level on a small model.** The first design had the model pick from a\nclosed vocabulary (archetype + parts + palette + mutation) and let a deterministic\nvoxelizer build the geometry. The bet: small models are weak at spatial reasoning,\nso don't ask them to draw; ask them to *choose*. It kept a 3B model reliable, but\nit could never draw \"five arms\" or \"a house on its head\". The AI felt like a\ngarnish.\n\n**The bug that hid every model.** For a long stretch, tweaks looked like they\nignored the model. The cause wasn't the model. It was `response_format`.\nllama-cpp-python honors `{\"type\": \"json_object\", \"schema\": …}` and **silently\nignores** the OpenAI-style `{\"type\": \"json_schema\"}`. With the wrong key the model\nwas completely unconstrained, returned malformed JSON, and every edit fell through\nto the deterministic fallback. Fixing the key compiles the schema to a GBNF\ngrammar, and only then is any model's real capability visible. Lesson: measure\nmodel quality *after* grammar enforcement works, never before.\n\n**The spike.** To"
  },
  {
    "repo_id": "build-small-hackathon/mumbai-local",
    "title": "Mumbai Local",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/mumbai-local",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "mumbai",
      "local",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Mumbai Local\nemoji: 🚆\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.10.13'\napp_file: app.py\nstartup_duration_timeout: 1h\npinned: false\nshort_description: An AI runs Mumbai's Western Line. You are the chaos goblin.\ntags:\n  - track:wood\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# 🚆 Mumbai Local\n\n> An AI runs Mumbai's Western Railway through a monsoon evening. **You are the chaos goblin.**\n\n🎥 **[Watch the demo video](https://huggingface.co/datasets/geekwrestler/mumbai-local-demo/resolve/main/mumbai-local-demo.mp4)** · 📝 **[Read the build blog](https://huggingface.co/blog/build-small-hackathon/mumbai-local)** · 💬 **[Post on LinkedIn](https://www.linkedin.com/posts/ashishchalke_buildsmallhackathon-pixijs-nvidia-ugcPost-7471791610826182656-1AL4/)**\n\n## The premise\n\nThe *Thousand Token Wood* track of the **Build Small Hackathon** asks one question: *what happens\nwhen AI runs real systems?* Build an app where the AI is **load-bearing** — not a chatbot bolted on\nthe side, but the thing actually making the decisions.\n\n**Mumbai Local** answers it by flipping the tycoon genre on its head. If you grew up on\nRollercoaster Tycoon or Airline Tycoon, you know the drill — except here **the AI is the player.**\nA small NVIDIA **Nemotron-3-Nano (4B)** dispatcher is responsible for keeping Mumbai's lifeline —\nthe Western Railway local train system — running through a packed monsoon evening.\n\n**You are the chaos goblin.** Throw cows onto the tracks, fail the signals, conjure festival crowds,\nwave VIP specials through. Then night falls, the monsoon hits, and the stations jam in **Peak\nRush.** The AI has to survive **20 rounds** of the pandemonium you unleash.\n\nDefeat the dispatcher, young Padawan — or it's AI-pocalypse for all of us.\n\n## Why a train game?\n\nCritical systems — like the Mumbai local — run on people with deep, hard-won *systems knowledge*:\nthe ones who build the processes that bring order to chaos. They're rare, and the world over these\nsystems struggle to replace them. AI offers a new path: absorb that knowledge and take the critical\ncalls. It comes with trade-offs — the AI may never match the brilliance of the people who taught it,\nbut it will beat mediocre. And then a harder question waits: when the system-builders are gone and\nno one new bothers to learn the system, at the critical juncture — **who audits the AI?** This game\nis a small, playable way to sit with that question.\n\n## How to play\n\n- Each round you spend **chaos energy** to drop a card on a station: Monsoon Flood, Signal Failure,\n  Track Cow, VIP Special, Festival Crowd.\n- The dispatcher then takes **one turn** — reading the whole network and picking a handful of legal\n  moves (hold / reroute / skip trains, deploy police · cleaners · cow-teams · signal-engineers, make\n  announcements).\n- Watch the meters — **anger, safety, crowd pressure.** Tip any over the edge (or lock the network)\n  and **you win.** If the AI survives to round 20, **it wins** — graded S/A/B/C on how calm it kept\n  the city.\n- The festival unlocks around round 13; from round 16, **Peak Rush** bites — heavier inflow, faster\n  anger, persistent rain.\n\n## How it works (the tech)\n\nThe AI is load-bearing by design. A deterministic Python rules engine owns the map, train physics,\ncrowds and scoring; **the LLM only selects from the legal actions the engine offers each turn.**\nThere's no hand-written fallback policy: if the model fails, the dispatch fails. The world continues\nto run... unmanaged.\n\n- **Dispatcher — NVIDIA Nemotron-3-Nano-4B on ZeroGPU.** The ≤4B \"Intern\" dispatcher runs\n  server-side via `transformers` (native `NemotronH` + Hugging Face `kernels` for the fast\n  Mamba/Triton path) inside a `@spaces.GPU` function. **ZeroGPU** dynamically attaches an NVIDIA GPU\n  for each turn and releases it — so the model is real, fully on-GPU, and costs nothing at idle. The\n  game is **turn-based**, so nothing ever blocks on model latency.\n- **Voice — VoxCPM2 (MiniCPM-family TTS).** The dispatcher's once-a-turn platform announcement can\n  be spoken aloud in alternating Mumbai railway-announcer voices, synthesized async on ZeroGPU and\n  mutable from the 🔊 PA button — for fun. *(Optional; off by default in this build.)*\n- **UI — Gradio Server + a custom PixiJS front end.** Not a stock Gradio layout: a React + PixiJS\n  pixel-art SPA renders the entire Western Line, mounted on a Gradio-SDK `gr.Server` that also serves\n  the authoritative turn API. The front end is a **pure renderer** — every rule, RNG and balance\n  number lives in Python.\n- **Assets — generated with ChatGPT & Codex.** The background plate, station / train / incident\n  sprites and the announcement voices were generated via ChatGPT and Codex; a few ambient and voice\n  clips are sourced from the internet.\n- **Backend — built with Claude Code.** Fully self-contained on the Space; no external API calls.\n\n## A balancing curiosity\n\nBa"
  },
  {
    "repo_id": "build-small-hackathon/small-functional-movement-screening",
    "title": "Small Functional Movement Screening",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/small-functional-movement-screening",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "functional",
      "movement",
      "screening",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: FormScout\nemoji: 📈\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\napp_file: app.py\npinned: true\nlicense: apache-2.0\nshort_description: FMS video scoring — movement screen aid\nthumbnail: >-\n  https://cdn-uploads.huggingface.co/production/uploads/6632220a9163a52438fbd1b2/1PbdV1pyKOpyY58LIh9_N.png\nsdk_version: 6.18.0\nmodels:\n  - Qwen/Qwen3-VL-8B-Instruct-GGUF\n  - YOLO26\ntags:\n  - backyard-ai\n  - off-the-grid\n  - llama-champion\n  - track:backyard\n  - achievement:offgrid\n  - achievement:llama\n---\n\n# FormScout\n\nFMS (Functional Movement Screen) scoring pipeline — a screening aid that scores movement videos 0–3 per test with a written rationale and annotated overlay.\n\n**⚠️ Screening aid — not a diagnosis. Pain or clearing tests require a clinician.**\n\n# Submission details\n\n## Demo\n- [YouTube](https://youtu.be/MvuOyj6mMCA)\n\n## Social media posts\n- [Instagram](https://www.instagram.com/p/DZkwSenDcvv/?utm_source=ig_web_copy_link)\n- [LinkedIn](https://www.linkedin.com/posts/csaba-boly%C3%B3s-00a11767_hackathon-movement-screening-activity-7472141949278543872-Q_DI?utm_source=share&utm_medium=member_desktop&rcm=ACoAAA4TvEoBG6xaPCej-4MULwosVpspO1XW__c)\n\n## Teammembers\n - [ajakab](https://huggingface.co/ajakab)\n - [BladeSzaSza](https://huggingface.co/BladeSzaSza)\n\n\n\n\n## Running locally\n\n### 1. Clone and install\n\n```bash\ngit clone https://huggingface.co/silas-therapy/small-functional-movement-screening\ncd small-functional-movement-screening\npython3 -m venv .venv && source .venv/bin/activate\npip install -r requirements.txt\n```\n\n### 2. Start the VLM judge (optional but recommended)\n\nThe judge uses Qwen3-VL-8B-Instruct via llama.cpp. Without it the app falls back to the deterministic rubric score — fully functional, no GPU needed.\n\n```bash\n# Install llama.cpp once\nbrew install llama.cpp\n\n# Download the model (one-time, ~6 GB)\npython3 -c \"\nfrom huggingface_hub import hf_hub_download\nfor f in ['Qwen3VL-8B-Instruct-Q4_K_M.gguf', 'mmproj-Qwen3VL-8B-Instruct-F16.gguf']:\n    hf_hub_download('Qwen/Qwen3-VL-8B-Instruct-GGUF', f, local_dir='checkpoints/qwen3-vl')\n\"\n\n# Start the server (keep this terminal open)\n./scripts/serve_judge.sh\n```\n\nTo use a fine-tuned GGUF instead of the default:\n```bash\nFORMSCOUT_JUDGE_GGUF=/path/to/finetuned.gguf ./scripts/serve_judge.sh\n```\n\n### 3. Launch the Gradio app\n\n```bash\npython3 app.py\n# → http://127.0.0.1:7860\n```\n\nUpload a video, select the FMS test from the dropdown, and click **Analyze**.\n\n### 4. Headless pipeline (no Gradio)\n\n```bash\npython3 -m formscout.run sample.mp4\n```\n\n### 5. Tests\n\n```bash\npytest tests/ -v\n```\n\n### 6. Upload to Hugging Face\n\n```bash\n# Pushes source to both model repo and Space, opens a PR on each\n./scripts/hf_upload.sh\n\n# Or with a custom commit message\n./scripts/hf_upload.sh \"feat: my change\"\n```\n\n## Architecture\n\nTyped specialist agents orchestrated by a deterministic Director:\n\n```\nIngest → Pose2D → [Body3D optional] → Biomechanics → Rubric Score → [Judge] → Report\n```\n\n| Agent | Model | Status |\n|---|---|---|\n| Pose2D | YOLO26l-Pose (0.026B) + MediaPipe fallback | ✅ |\n| Body3D | SAM 3D Body DINOv3 (0.84B) | gated, off by default |\n| Judge + Classifier | Qwen3-VL-8B-Instruct via llama.cpp (8B) | ✅ |\n| Scoring Head | ST-GCN (0.03B) | Phase 3 |\n| Retrieval | Qwen3-VL-Embedding-8B (8B) | Phase 3 |\n\nSee [CLAUDE.md](CLAUDE.md) for full architecture and invariants.\n\n## Feature flags (`formscout/config.py`)\n\n| Flag | Default | Meaning |\n|---|---|---|\n| `ENABLE_JUDGE` | `True` | VLM judge via llama-server; rubric fallback when server is down |\n| `ENABLE_3D` | `False` | SAM 3D Body — off until integrated |\n| `ENABLE_STGCN` | `False` | Phase 3 |\n| `ENABLE_RAG` | `False` | Phase 3 |\n\n## Model budget\n\n~18B params total (under 32B cap). See [MODEL_BUDGET.md](MODEL_BUDGET.md).\n\n## License\n\nApache-2.0. Built for the Build Small Hackathon (Backyard AI track)."
  },
  {
    "repo_id": "build-small-hackathon/Leskys_Shop",
    "title": "Leskys_Shop",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Leskys_Shop",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "achievement",
      "leskys_shop",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Play one round and note what made it fun.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: Lesky's Shop\nemoji: ⚗️\ncolorFrom: green\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npython_version: \"3.10\"\nshort_description: Win a board duel to rewrite rules while your opponent lies.\ntags:\n  - game\n  - llm\n  - gradio\n  - strategy\n  - hackathon\n  - track:wood\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:sharing\nfullWidth: true\n---\n\n# ⚗️ Lesky's Shop\n\n> *\"Centuries of practice and you thought YOU could beat ME? Delightful.\"*\n\nA hand-drawn, AI-powered board game built for the **Build Small Hackathon** — **Thousand Token Wood** track.\n\nYou duel the ancient goblin **Lesky** across a shifting board. Win a round and you earn the right to **write a law in plain English**. Type anything — *\"corners are worth double\"*, *\"nobody can place in the center\"*, *\"clashes split the value\"* — and the LLM translates it into a structured rule that reshapes the board on the spot.\n\nBut Lesky lies. He'll announce one law and enact another. Catch him in deception and his guard drops when it matters most. Expose enough lies and The Monster you face next will be fighting at a disadvantage.\n\n---\n\n## Tutorial, Trailer and Post:\n\nCheck out the live demos of this project and the post requiered:\n\n* [Tutorial: LESKY'S SHOP - GAME DEMO](https://youtu.be/QWsAsXGaKK0).\n\n* [Game Trailer](https://youtu.be/Ya6GjUWZUSI).\n\n* [X Post](https://x.com/i/status/2066596209398051025).\n\n---\n\n## Why AI is essential here\n\nThis game *cannot exist* without a language model:\n\n- **Free-text law writing** — the core mechanic. The player types a rule in natural language; the LLM translates it into a validated game effect. There is no dropdown, no preset list. Any legal rule the player can describe, the game can enforce.\n- **Lesky lies in character** — the goblin announces fake law descriptions generated by the LLM, personalized to the game state and round number.\n- **In-character chat** — argue with Lesky mid-game. He knows the current board, the scroll, the scores, and will be insufferably smug about it.\n- **Adaptive taunts** — Lesky reacts differently depending on whether he won or lost the round, pre-generated in the background so there's no wait.\n- **Fully voiced cast** — every character speaks using [Chatterbox TTS](https://github.com/resemble-ai/chatterbox) (via Modal): Lesky taunts and welcomes you in his own voice, The Monster growls through both its rounds, and if you beat them both… a surprise awaits, also fully voiced. Audio is cached on first generation so repeat lines play instantly.\n\nWithout the LLM the game falls back to a fixed law list and an honest goblin — playable, but a completely different (lesser) experience.\n\n---\n\n## How to play\n\n### The board\nEach round, you and Lesky place chips on a grid simultaneously. Every cell has a hidden value. The player whose chips control more total value wins the round.\n\n### The Scroll\nUp to **3 laws** are active at any time, stacked on a shared scroll. They affect cell values, placement, adjacency bonuses, and clash outcomes — and they interact. Order matters: `+1 then ×2` ≠ `×2 then +1`.\n\n### Win a round → write a law\nType any rule in plain English. The LLM parses it and the engine validates it. If your law conflicts with one already on the scroll (e.g. two clash-resolution rules), the old one is automatically repealed.\n\n### Catch Lesky lying\nAfter every round you can investigate one law. If Lesky tampered with it — whether it's his own law, a Forest law, or even one of yours — you catch him. Each lie you expose reduces **The Monster's** accuracy by 7% in the final rounds.\n\n### Cleverness\nSecret missions reward bonus cleverness each round (e.g. *\"own 2 corners\"*, *\"end with no adjacent chips\"*). Spend it in the shop between fights: scout the opponent's mission hint, reveal a hidden cell, or repeal a law you don't like.\n\n**Final score** = total board points across all rounds + (total cleverness earned × 3 bonus points)\n\n### The campaign\n| Fight | Opponent | Board | Rounds |\n|---|---|---|---|\n| 1 | Lesky the Goblin | 3×3 | 5 |\n| 2 | The Monster | 4×4 | 2 |\n\nBeat Lesky to claim his treasure and face The Monster. The lies you caught against Lesky carry over — each one makes The Monster 7% less precise.\n\n---\n\n## Tech stack\n\n| | |\n|---|---|\n| **UI** | Gradio 6, custom CSS, hand-drawn PNG/GIF art |\n| **LLM** | [nvidia/Nemotron-Mini-4B-Instruct](https://huggingface.co/nvidia/Nemotron-Mini-4B-Instruct) — **only 4B parameters**, hosted via [Modal](https://modal.com) |\n| **TTS** | [Chatterbox](https://github.com/resemble-ai/chatterbox) (0.5B parameters) via Modal — distinct voice per character, disk-cached |\n| **Game engine** | Pure Python — no LLM touches the engine directly; the LLM produces effect dicts that the engine validates and rejects if invalid |\n| **Model role** | Law translation · Goblin taunts · In-character chat · Fake law generation · Voice synthesis |\n\nThe engine is a clean arbiter: it validates every LLM output against a strict effect "
  },
  {
    "repo_id": "build-small-hackathon/odoo_icon_generator",
    "title": "Odoo_Icon_Generator",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/odoo_icon_generator",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "odoo_icon_generator",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Odoo Icon Generator\nemoji: 🖼\ncolorFrom: purple\ncolorTo: red\nsdk: gradio\nsdk_version: 6.5.1\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Odoo Icon Generator\ntags:\n  - track:backyard\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n🔗 Check the demo on my LinkedIn Post:\nhttps://www.linkedin.com/posts/stmayna_my-first-hackathon-project-is-live-i-ugcPost-7472063877426458624-RdIB\n\n🔗 Twitter post:\nhttps://x.com/maynanotes/status/2066303787481088400?s=61\n\n🔗 Hackathon:\nhttps://huggingface.co/build-small-hackathon\n\n🔗 Live Demo:\nhttps://huggingface.co/spaces/Stmayna/odoo_icon_generator\n\n🔗 Source Code:\nhttps://github.com/stmayna/huggingfacehackaton\n\n📝 Full Story:\nhttps://open.substack.com/pub/sitimayna/p/my-first-hackathon-building-an-odoo"
  },
  {
    "repo_id": "build-small-hackathon/tiny-court",
    "title": "Tiny Court",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/tiny-court",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "achievement",
      "track",
      "tiny",
      "court",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Tiny Court of Everyday Crimes\nemoji: ⚖️\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 6.16.0\napp_file: main.py\npython_version: \"3.12\"\npinned: true\nlicense: apache-2.0\nshort_description: Multimodal comedy courtroom — text, photo & voice on trial.\ndatasets:\n  - build-small-hackathon/tiny-court-traces\ntags:\n  - build-small-hackathon\n  - gradio\n  - custom-ui\n  - comedy\n  - agent\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n<!-- The block above is Hugging Face Space metadata (must stay at the very top).\n     This Space runs the comedy-courtroom UI on the REMOTE backend: four small\n     open models served on Modal (judge / vision / formatter / ASR) reached over\n     OpenAI-compatible endpoints. No GPU runs on-Space. Backend selection + the\n     Modal endpoint URLs are Space *variables*; the Modal proxy token is a Space\n     *secret* — never commit tokens. See the dev repo's docs/hf-space-configuration. -->\n\n# Tiny Court of Everyday Crimes\n\n> Ace Attorney for stupid everyday problems.\n\nAn interactive comedy courtroom where users put small, ridiculous, everyday\nconflicts on trial — snack thefts, suspicious pet behavior, object betrayals —\nand receive charges, evidence, witnesses, objections, a verdict, and an absurdly\nharmless sentence. A **Gradio app** backed by **four small open models served on\nModal**, accepting **text, photo, and voice** evidence.\n\nThis is a comedy experience, **not** legal advice.\n\nBuilt for the **Build Small Hackathon** (Gradio × Hugging Face).\n\n* **Live app:** https://huggingface.co/spaces/build-small-hackathon/tiny-court\n* **Video demo:** [a narrated walkthrough of a full case](https://youtu.be/b5l3SRioTkE)\n* **Open traces:** https://huggingface.co/datasets/build-small-hackathon/tiny-court-traces\n* **Field notes / blog post:** https://johnz86.github.io/blog/tiny-court-and-the-urge-to-outsource-judgment.html\n* **Narrated overview (video):** [Tiny Court and the urge to outsource judgment](https://youtu.be/sRP8FBWoPXg)\n\n## Watch a trial\n\n▶️ **[Watch the narrated demo on YouTube](https://youtu.be/b5l3SRioTkE)** — a full case, start to finish.\n\n[![Tiny Court — a full trial walkthrough](assets/promo-walkthrough.webp)](https://youtu.be/b5l3SRioTkE)\n\n*The loop above is a sped-up run of one full trial — complaint → evidence → witness\n→ cross-examine → twist → verdict → leniency appeal → the shareable **Court\nRecord** — recorded against the live app on its real four-model backend.*\n([download the MP4](https://huggingface.co/spaces/build-small-hackathon/tiny-court/resolve/main/assets/promo-walkthrough.mp4))\n\n## How it works\n\nA landing page (rotating hero docket + **Quick Trial** / **Full Trial** CTAs)\nopens into **one growing courtroom conversation**: you file a complaint and the\ncourt reacts; optional *moves* — submit evidence, call a witness, cross-examine,\nobject, add a twist — append to the same transcript. **Deliver the Verdict** is\navailable from the first message and swaps to a printed ruling; you can **appeal**\nback into the conversation, then finalize to a shareable **Court Record** (with a\none-click PNG export).\n\nThe verdict is **Python-owned and deterministic** — the model proposes meter\ndeltas, the engine resolves the band — so interactions provably move the needle.\nSafety is gated **both ways**: a layered gate on input and an offline scrub on\ngenerated output.\n\n## The model stack\n\nGeneration runs through a pluggable seam. In production the Space talks to **four\nsmall open models served on Modal** over OpenAI-compatible endpoints — each\nmodality is *perceived into text* before the judge reasons over it, so a weak\nperception model never owns the verdict:\n\n| Role | Model | Served via | Sponsor |\n|---|---|---|---|\n| **Judge** (reasoning + fields) | NVIDIA Nemotron-3-Nano-4B | llama.cpp on Modal | NVIDIA |\n| **Vision** (photo evidence) | OpenBMB MiniCPM-V-4.6 | llama.cpp on Modal | OpenBMB |\n| **Formatter** (schema repair) | JetBrains Mellum2-12B-A2.5B | llama.cpp on Modal | JetBrains |\n| **ASR** (voice evidence) | NVIDIA Parakeet-TDT-0.6B | NeMo on Modal | NVIDIA |\n\nAll models are **under 32B**; the judge/vision/formatter run on **llama.cpp**. The\napp degrades gracefully — if Modal is unreachable it falls back to a deterministic\ncanned backend so the UI always runs.\n\n## Agent traces\n\nEvery model call is captured as a redacted JSONL **agent trace** (the deterministic\nverdict meters snapshotted alongside each call) and published to\n**[build-small-hackathon/tiny-court-traces](https://huggingface.co/datasets/build-small-hackathon/tiny-court-traces)**\n— no raw images, prompts, or tokens.\n\n## Field notes\n\nA short write-up on building Tiny Court — and why a comedy courtroom is a careful\nframe for *not* outsourcing judgment to a model:\n**[Tiny Court and the urge to outsource judgment](https://johnz86.github.io/blo"
  },
  {
    "repo_id": "build-small-hackathon/noir-verdict",
    "title": "Noir Verdict",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/noir-verdict",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "noir",
      "verdict",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Noir Verdict\nemoji: 🕵️\ncolorFrom: red\ncolorTo: gray\nsdk: gradio\nsdk_version: 5.5.0\npython_version: '3.12'\napp_file: app.py\npinned: false\ntags:\n- track:wood\n- sponsor:nvidia\n- sponsor:modal\n- achievement:offgrid\n- achievement:welltuned\n- achievement:offbrand\n- achievement:llama\n- achievement:sharing\n- achievement:fieldnotes\nlicense: mit\n---\n\n# Neon Verdict\n\n> *Ten questions. Four suspects. One bad night.*\n\nA hard-boiled noir detective game powered by a fine-tuned **Nemotron 3 Nano 4B** running entirely in-process via `llama-cpp-python`. No inference providers. No cloud calls. Just a typewriter, a desk lamp, and 498 procedurally-built cases waiting for your verdict.\n\nBuilt for [Hugging Face Build Small Hackathon](https://huggingface.co/build-small-hackathon) — Track 2 (Thousand Token Wood).\n\n---\n\n## Play\n\n| | |\n|---|---|\n| **Space** | [build-small-hackathon/noir-verdict](https://huggingface.co/spaces/build-small-hackathon/noir-verdict) |\n| **Blog** | [sankalphs.blogspot.com/2026/06/noir-verdict.html](https://sankalphs.blogspot.com/2026/06/noir-verdict.html) |\n| **Video** | [youtu.be/FcJb2wRWwjY](https://youtu.be/FcJb2wRWwjY) |\n| **Social** | [x.com/sankalphs/status/2066603405787951211](https://x.com/sankalphs/status/2066603405787951211) |\n| **Traces dataset** | [sankalphs/noir-verdict-traces](https://huggingface.co/datasets/sankalphs/noir-verdict-traces) |\n\n## Models\n\n| Artifact | Size | Link |\n|---|---|---|\n| LoRA adapter | 40.5 MB | [sankalphs/noir-verdict-nemotron-4b-lora](https://huggingface.co/sankalphs/noir-verdict-nemotron-4b-lora) |\n| Merged BF16 | 7.95 GB | [sankalphs/noir-verdict-nemotron-4b-merged](https://huggingface.co/sankalphs/noir-verdict-nemotron-4b-merged) |\n| Q4_K_M GGUF | 2.84 GB | [sankalphs/noir-verdict-nemotron-4b-gguf](https://huggingface.co/sankalphs/noir-verdict-nemotron-4b-gguf) |\n\n## Architecture\n\n```\nfrontend/         Three.js noir scene + fetch() API calls\n  └─> app.py      gradio.Server + @app.api() routes\n        └─> engine/\n              ├─ cases.py           498 procedurally-generated cases\n              ├─ state.py           Session dataclass, JSON round-trip\n              ├─ scoring.py         50/20/15/10/5 deterministic formula\n              ├─ contradictions.py  Token overlap + alibi flip detection\n              ├─ prompts.py         Nemotron 3 chat template\n              └─ llm.py             llama-cpp-python, in-process GGUF\n```\n\nThe model writes the drama. The engine owns the score.\n\n| Points | Rule |\n|---|---|\n| +50 | Correct suspect charged |\n| +20 | Motive named in reasoning |\n| +15 | Stolen item named in reasoning |\n| +10 | Efficiency bonus `(10 - questions_used)` |\n| +5 | At least one contradiction caught |\n\n## Run locally\n\n```bash\npip install -r requirements.txt\npython app.py\n# → http://localhost:7860\n```\n\nFirst cold start downloads the 2.84 GB GGUF from the Hub.\n\n```bash\npython engine/smoke.py    # validate engine logic offline\n```\n\n## Bonus quests\n\n| Quest | How |\n|---|---|\n| Off the Grid | In-process `llama-cpp-python`, no inference providers |\n| Well-Tuned | Fine-tuned Nemotron 3 Nano 4B published to the org |\n| Off-Brand | `gradio.Server` + custom Three.js frontend |\n| Llama Champion | Model runs through llama.cpp at inference |\n| Sharing is Caring | Open traces dataset on the Hub |\n| NVIDIA Nemotron Quest | Used Nemotron 3 Nano end-to-end |\n| Modal Awards | Fine-tune + ship + dataset push via Modal |\n| Tiny Titan | 4B parameters, well under the 4B cap |\n\n## Deploy\n\n```bash\n# Push Space to sankalphs (via Modal)\nmodal run ship_space_modal.py\n\n# Fine-tune on Modal (A10G)\nmodal run train/modal_finetune.py --max-steps 240 --publish\n\n# Push traces dataset\nmodal run train/push_traces.py --n-seed 20\n```\n\n## Credits\n\n- **3D engine**: Three.js\n- **Fine-tune**: Modal + NVIDIA A10G + Unsloth + llama.cpp\n- **Characters**: Quaternius CC0 model packs (fallback to Three.js primitives)\n- **Hackathon**: [Hugging Face Build Small](https://huggingface.co/build-small-hackathon)\n\n---\n\n*\"Everybody has a tell. You just need ten questions to find it.\"*\n\n## License\n\nMIT. See [LICENSE](./LICENSE) for details."
  },
  {
    "repo_id": "build-small-hackathon/semantique",
    "title": "Semantique",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/semantique",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "semantique",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Semantique\nemoji: 🐇\ncolorFrom: gray\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: How well can you prompt a small AI?\nthumbnail: https://huggingface.co/spaces/benjosaur/semantique/resolve/main/static/social-card.png\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# Semantique 🐇\n\n**Welcome to Semantique** a short adventure game where you test your prompting chops. \n\nHop around constructing prompts and see if you can tease out all the target words from **MiniCPM3-4B**. But be careful! Your context window can only stretch so far...\n\nUse the hint button if you get too stuck! 💡\n\n▶ **[Watch the demo](https://youtu.be/bnMM3cq900w)** · 📣 **[Read the LinkedIn post](https://www.linkedin.com/posts/ben-blaker-085108175_semantique-a-masochistic-adventure-game-activity-7472415885388070912-YruU)** · 📝 **[Lessons Learned](https://benjosaur.substack.com/p/859bbc88-1be8-4070-ba06-cc3b29b80966)**\n\n![The opening board: hop the doodle across word tiles and each one adds its word to your sentence for the AI to judge.](docs/original-board.png)\n\nKeep an eye out for Easter eggs to find on your adventure.\n\n![An \"Error 404\" pop-up warning that the AI just deleted everything.](docs/bonus-board.png)\n\n## Technical Stuff\n\nOnce constructed, prompts are sent to an open source implementation of **structured outputs**, hosted on Modal. The system prompt names the board's targets and asks for the one most similar to your sentence. A single forward pass then scores how likely each candidate label is as the answer. The next-token distribution is masked to the board's labels and softmax-renormalized. The constrained decoding is then made visible as the verdict card's probability bars.\n\nBuilt for the **Build Small Hackathon**.\n\n## AI Disclaimer\n- All tilesets were handcrafted by me including the core idea for the game and adventure.\n- High level art style direction given by me however all implementation assets generated by Fable 5/Opus 4.8 (exception of background music: Cipher — **Kevin MacLeod**; Corporate Glitch — [Dmitrii Kolesnikov](https://pixabay.com/users/the_mountain-3616498/?utm_source=link-attribution&utm_medium=referral&utm_campaign=music&utm_content=171582); Sweet Country Farm Music Full — [catch22music](https://pixabay.com/users/catch22music-43977658/?utm_source=link-attribution&utm_medium=referral&utm_campaign=music&utm_content=354815))"
  },
  {
    "repo_id": "build-small-hackathon/sema-dyslexic-assistant",
    "title": "Sema Dyslexic Assistant",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/sema-dyslexic-assistant",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sema",
      "dyslexic",
      "assistant",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Sema - Read, Write, and Thrive with Dyslexia\nemoji: 📖\ncolorFrom: yellow\ncolorTo: indigo\nsdk: gradio\nsdk_version: \"6.15.2\"\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: Dyslexia-friendly reading, writing and screen help\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - best-demo\n  - achievement:fieldnotes\n---\n# Sema — A reading companion for people with dyslexia\n**Sema** helps people who find reading hard. It was built for my brother, who has dyslexia, and for anyone who struggles with everyday text — a letter, a bill, a form, a screen.\nMost AI today lives in giant data centres. Sema is the opposite: it runs entirely on small, open models (all well under 32B parameters), the kind you could run on hardware you own. Small models, real help.\n## What it does — three tools\n\n**📖 ReadEasy — helps you *read*.**\nPaste any text and Sema will *simplify* it, *summarise* it, or *read the original aloud*. The result is shown in a dyslexia-friendly font (OpenDyslexic), one sentence per line, with generous spacing, and read aloud so you can listen along. Works in **English and Kiswahili**.\n\n**🎤 VoiceOut — helps you *write*.**\nSpeak what you want to say, and Sema writes it for you — an email to a lawyer, a WhatsApp to your mum, a to-do list, a complaint, whatever you ask. It understands who the message is for and writes it in the right tone, ready to copy and send. You pick your language before recording so it hears you correctly.\n\n**📸 SnapGuide — helps you *navigate*.**\nTake a photo of a screen, form, sign, or bill, and Sema gives you simple step-by-step help — what it is, what to do, and exactly what to tap or look for (highlighted so it's easy to spot). You can type your question or **ask by voice**. Upload the next screen and it continues the same task. Built for the moments when reading a screen is the thing standing in your way.\n## Why this fits \"Build Small\"\nEvery model Sema uses is open and under the 32B cap:\n| Tool | Model | Role |\n|------|-------|------|\n| ReadEasy / VoiceOut | Qwen2.5-7B-Instruct | Text understanding & writing (multilingual) |\n| SnapGuide | **MiniCPM-V (OpenBMB)** | Image, OCR & document understanding |\n| VoiceOut speech-in | Whisper-large-v3 | Speech recognition |\nSnapGuide is built around **MiniCPM-V from OpenBMB** — the recommended small vision model for image, OCR and document understanding — which is exactly what helping someone read a screen or form requires. A Qwen3-VL vision model is wired in as an automatic fallback so the app keeps working even if the MiniCPM endpoint is busy.\nThe whole app is accessibility-first: dyslexia-friendly font and spacing, audio for everything, voice input, plain-language output, and Kiswahili support so it reaches people that English-only tools leave out.\n## Tracks & prizes this targets\n- **Backyard AI** — a practical tool that improves daily life for someone close to me.\n- **Best MiniCPM Build (OpenBMB)** — MiniCPM-V is the core of SnapGuide's image understanding.\n- **Best Demo** — see the demo video below.\n## Live App\n▶️ https://build-small-hackathon-sema-dyslexic-assistant.hf.space/\n## Demo video\n▶️ https://youtu.be/6VZcPdaC75s\n## Social post\n🔗 https://x.com/JamesGo01862630/status/2065063242280423869?s=20\n## Blog post\n🔗 https://huggingface.co/blog/build-small-hackathon/otienojturingcom\n## How it's built\nA single Gradio app (`app.py`). Models are called through open APIs (OpenBMB for MiniCPM-V, Hugging Face inference for Qwen and Whisper). Text-to-speech uses gTTS. No data is stored. A `FEATURE_FLAGS` block lets any single feature be toggled off without affecting the rest.\nBuilt with care, for my brother."
  },
  {
    "repo_id": "build-small-hackathon/exam-panic-rescue",
    "title": "Exam Panic Rescue",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/exam-panic-rescue",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "openbmb",
      "exam",
      "panic",
      "rescue"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Test it with a beginner question and review the answer.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: Exam Panic Rescue\nemoji: 🆘\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.0.1\napp_file: app.py\npython_version: 3.10.13\nlicense: mit\nshort_description: Last-minute exam rescue on small models (≤32B)\ntags:\n  - backyard-ai\n  - build-small-hackathon\n  - education\n  - study\n  - openbmb\n  - minicpm\n  - nemotron\n  - zerogpu\n  - privacy-first\n  - local-capable\n  - llama-cpp\n  - tiny-titan\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\nmodels:\n  - openbmb/MiniCPM-V-4.6\n  - nvidia/Nemotron-Mini-4B-Instruct\n  - openbmb/MiniCPM4-0.5B-QAT-Int4-GGUF\n---\n\n# Exam Panic Rescue\n\nExam Panic Rescue turns a student's last-minute panic dump into a survival plan, drill deck, triage clock, panic-pattern readout, proof target, final sheet, live coach, and study receipt.\n\nThe first target workflow is a student who has an exam soon, feels stuck, and cannot decide what to study first. The app is intentionally narrow: one stressed student, one exam, one time box, one final sheet.\n\nThe app includes four clearly labeled sample scenarios for quick evaluation: biology definitions, physics numericals, history long answers, and math MCQ traps. They are not claimed as real-user data; they are the same public readiness cases used by the local smoke test and published as [data/readiness_cases.jsonl](data/readiness_cases.jsonl). A real student should replace the sample with their actual exam, topics, and time left before generating a packet.\n\nThe public UI keeps the student workflow first. The build is documented separately in [docs/build-report.md](docs/build-report.md) and the public build-trace dataset, so the product page stays focused on the student rather than on sponsor evidence.\n\n## Privacy-first positioning\n\nExam panic is personal. A student might paste weak topics, last-minute fear, syllabus photos, messy notes, or confidence levels they would not want stored in a public dataset.\n\nThe hosted Hugging Face Space is the public demo/evaluation version for the hackathon. It runs on Hugging Face ZeroGPU, so it should not be described as fully on-device. The app itself is designed to avoid intentional app-level persistence: normal user sessions are not written into the public trace dataset, public traces are selected and privacy-labeled, and the one real-user validation trace is anonymized and shared with consent.\n\nFor sensitive use, the stronger path is local deployment. The app can be run from this GitHub repo, including a small local model route using `openbmb/MiniCPM4-0.5B-QAT-Int4-GGUF` through the `llama.cpp` runtime on CPU. That makes the long-term product direction clear:\n\n> **Hosted Space = public demo for judging. Local small-model mode = privacy-first direction for sensitive student data.**\n\n## Build Status\n\nThe public Space is live on Hugging Face ZeroGPU at https://huggingface.co/spaces/build-small-hackathon/exam-panic-rescue and has been verified end-to-end (text, vision, the Nemotron engine, and the answer key all returning real model output).\n\nReal-user validation (Backyard AI): a final-year university Machine Learning student used the live app the day before their exam, and one of the model-written drills closely matched a question that actually appeared on the exam. The anonymized session (with consent) is published in the build-trace dataset under the `real_user` config.\n\nSubmission assets:\n\n- Demo video + social post (X thread, Backyard AI): live at https://x.com/jhahimanshu653/status/2063909355217453142\n- Build report / Field Notes (\"what I built and what I learned\"): [docs/build-report.md](docs/build-report.md) — also hosted in the org at https://huggingface.co/spaces/build-small-hackathon/exam-panic-rescue-field-notes\n- Open build traces (incl. the real-user session): https://huggingface.co/datasets/build-small-hackathon/exam-panic-rescue-build-trace\n\nPublic build notes and demo prep are drafted in [docs/codex-build-trace.md](docs/codex-build-trace.md) and [docs/demo-script.md](docs/demo-script.md).\n\nPublic GitHub evidence repo: https://github.com/himanshu748/exam-panic-rescue\n\nHardware note: the hackathon rule allows models up to `<=32B`. The public Space runs on Hugging Face ZeroGPU (24 GB) with `USE_LOCAL_MODEL=1`, loading **one model at a time** so it always fits in memory:\n\n- **OpenBMB MiniCPM-V 4.6** — the primary engine. It writes the rescue plan and drills, and (being a vision-language model) can read a photo of the student's syllabus directly in the same call.\n- **NVIDIA Nemotron-Mini-4B** — a selectable text-only alternate; at 4B it is the Tiny Titan (`<=4B`) path.\n- **OpenBMB MiniCPM4 0.5B (GGUF)** — an optional engine that runs through the **llama.cpp runtime** (`llama-cpp-python`) on CPU; the Llama Champion + Tiny Titan path. Its runtime note reads `Generated locally with llama-cpp-python (llama.cpp runtime), model openbmb/MiniCPM4-0."
  },
  {
    "repo_id": "build-small-hackathon/Peek-and-Seek",
    "title": "Peek And Seek",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Peek-and-Seek",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "peek",
      "seek",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Find one mechanic that could become more challenging.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: Peek & Seek\nemoji: 🎯\ncolorFrom: indigo\ncolorTo: purple\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: mit\nthumbnail: assets/poster.png\nshort_description: Two AI agents play hide-and-seek and learn between rounds\ntags:\n  - agents\n  - game\n  - pixel-art\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n<p align=\"center\">\n  <img src=\"assets/poster.png\" alt=\"Peek & Seek\" width=\"820\">\n</p>\n\n<h1 align=\"center\">🎯 PEEK &amp; SEEK</h1>\n\n<p align=\"center\"><b>Two AI agents play hide-and-seek on an arena <i>you</i> design — and learn between rounds, in context, with no retraining.</b></p>\n\n<p align=\"center\">Made with 🟠🟣 by <b>Oualid Choukrallah</b></p>\n\n<p align=\"center\">\n  <img src=\"https://img.shields.io/badge/Gradio-6.16-f97316\">\n  <img src=\"https://img.shields.io/badge/🤗-Spaces-yellow\">\n  <img src=\"https://img.shields.io/badge/OpenBMB-MiniCPM4.1--8B-8b5cf6\">\n  <img src=\"https://img.shields.io/badge/Runtime-ZeroGPU%20(local)-3b82f6\">\n  <img src=\"https://img.shields.io/badge/Track-Thousand%20Token%20Wood-2f6b32\">\n  <img src=\"https://img.shields.io/badge/Off%20the%20Grid-✅-16a34a\">\n  <img src=\"https://img.shields.io/badge/Off--Brand-✅-16a34a\">\n  <a href=\"https://huggingface.co/blog/oualidD4/peek-and-seek\"><img src=\"https://img.shields.io/badge/📓%20Field%20Notes-Read-16a34a\"></a>\n  <a href=\"https://youtu.be/u26TJjWnK10\"><img src=\"https://img.shields.io/badge/▶%20Demo-Watch-red\"></a>\n  <a href=\"https://www.linkedin.com/posts/oualid-choukr-allah-34095b1b3_im-really-excited-to-share-what-ive-been-ugcPost-7472313219404173313-bbGW/\"><img src=\"https://img.shields.io/badge/in%20LinkedIn-Post-0a66c2\"></a>\n</p>\n\n---\n\n## 📌 Submission Snapshot\n\n| | |\n|---|---|\n| **Live Space** | 🎮 [**build-small-hackathon/Peek-and-Seek**](https://huggingface.co/spaces/build-small-hackathon/Peek-and-Seek) |\n| **Track** | 🌲 **Thousand Token Wood** — the AI *is* the game engine, not a static generator |\n| **Core model** | [`openbmb/MiniCPM4.1-8B`](https://huggingface.co/openbmb/MiniCPM4.1-8B) — 8B params, Apache-2.0 |\n| **Runtime** | Local on **ZeroGPU** — no cloud APIs, no token, no secret |\n| **The agents** | 🟠 **Spark** (the Seeker) vs 🟣 **Shade** (the Hider) |\n| **Learning** | Context-based: each agent writes itself a lesson after every round and reads it before the next |\n| **Demo video** | ▶️ [youtu.be/u26TJjWnK10](https://youtu.be/u26TJjWnK10) |\n| **Social post** | 💬 [LinkedIn — \"I'm really excited to share what I've been building…\"](https://www.linkedin.com/posts/oualid-choukr-allah-34095b1b3_im-really-excited-to-share-what-ive-been-ugcPost-7472313219404173313-bbGW/) |\n| **Field Notes** | 📓 [Peek &amp; Seek — Field Notes (HF Blog)](https://huggingface.co/blog/oualidD4/peek-and-seek) |\n| **Made by** | Oualid Choukrallah ([@oualidD4](https://huggingface.co/oualidD4)) — solo |\n\n---\n\n## 🌲 The Idea\n\nMost AI demos have amnesia. You prompt, you get an answer, you close the tab — and nothing learned, nothing carried forward.\n\n**Peek & Seek** is the opposite. It's a tiny, living arena where **two language-model agents actually play a game** — and *get better at it while you watch*.\n\nYou design the world: drop walls, scatter bushes, place **Spark** (the Seeker) and **Shade** (the Hider). Then you lock it and step back. From that moment the humans are out of the loop. Spark hunts. Shade evades. Every turn they read the board, reason about it, and commit to a move. And at the end of every round, each one **writes a one-sentence lesson to its future self** — *\"That bush was a trap, hold the south corridor next time\"* — carries it into the next round, and plays differently because of it. No fine-tuning, no gradient steps. Just a model, its own notes, and an arms race.\n\n> Pull the model out and Peek & Seek is an empty grid. The AI is **load-bearing by definition** — which is exactly what the Thousand Token Wood track asks for: *AI as the simulation engine.*\n\n---\n\n## 🎮 What It Does\n\nA single, heavily-customized Gradio Space — styled as a pixel arcade game, not a dashboard:\n\n- **🏗️ Build-your-arena editor** — click-to-paint walls, bushes, and the two agents' start cells on a pixel-art board (grass, bricks, leafy bushes, the real character sprites).\n- **🧠 Two reasoning agents** — Spark and Shade each take a turn-by-turn decision from a *pre-digested* view of the world (more below). Powered entirely by MiniCPM4.1-8B.\n- **📓 Context-based learning** — after each round both agents reflect on what actually happened (grounded in their move trace) and write a lesson. Those notes are pinned to the top of the next round's prompt as a **plan**, and a **per-power win/loss scoreboard** drives which tool they grab.\n- **🧰 A toolbox of powers** — first agent to reach the toolbox claims one for the round: **Shade** gets Cloak (vanish, untouchable) or Decoy (a fake sighting); **Spark** gets Lamp (+vision), Ping (reveal Shade's half of the map), or Freez"
  },
  {
    "repo_id": "build-small-hackathon/Hackathon-IA-VisualNovel",
    "title": "Hackathon Ia Visualnovel",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Hackathon-IA-VisualNovel",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "visualnovel",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Ephemeral Hearts\nemoji: 🐠\ncolorFrom: green\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\npinned: false\ntags:\n- track:wood\n- sponsor:modal\n- achievement:offgrid\n- achievement:offbrand\n- achievement:llama\n- achievement:sharing\n- achievement:fieldnotes\nlicense: apache-2.0\nthumbnail: >-\n  https://cdn-uploads.huggingface.co/production/uploads/65c10674c4dcb96c31dd84c9/aWVwpGKH963R2e81blgxO.png\n---\n\n## Team\n\n- WillHbx — https://huggingface.co/WillHbx\n- SuperPoivre13 — https://huggingface.co/SuperPoivre13\n\n# hackathon-ia-visualnovel 🍄\n\n> **Thousand Token Wood** — an AI-improvised, voice-or-text **anime visual novel**.\n> Build Small Hackathon (Chapter Two). Story, characters, and art are generated **live** by\n> small local models — nothing is pre-scripted.\n\nYou step into a wood that is being *dreamed into existence* around you. Every spirit you meet is\nconjured on the spot; every backdrop is painted the moment you arrive. Because the wood is dreamed\nby a small, slightly forgetful mind, it is whimsical and never the same twice — **the model's\nquirks aren't bugs, they're the wood dreaming.** Take the models out and there's no game, only an\nempty Gradio shell. That's what makes the AI *load-bearing*.\n\n---\n\n## 🏆 Build Small Hackathon submission\n\n**Ephemeral Hearts** is an AI-improvised anime dating visual novel: the story, every character,\nevery line of dialogue, every backdrop and sprite, and even the character voices are generated\n**live** by small local models. Nothing is pre-scripted. Five AI roles collaborate in **under 18B\nparameters total** behind one golden rule, *the model proposes, code disposes*: the LLM returns a\ntyped `DirectorOutput` (one grammar-constrained call per turn) and deterministic code is the **only**\nthing that mutates game state. The same `visualnovel/` package runs in three places unchanged,\nselected by env vars: fully offline on a laptop (`llama.cpp` + `diffusers`), on on-demand Modal\nGPUs, or on a ZeroGPU Space.\n\n**Tech:** Qwen3-14B (the Weaver + the Voices, shared weights) · SDXL + a fine-tuned anime LoRA (the\nPainter) · Whisper (the Ear) · Kokoro-82M (the Voice) · Gradio `gradio.Server` with a bespoke\nHTML/JS frontend · Pydantic-derived JSON grammar · `uv`.\n\n- 🎥 **Demo video:** https://youtu.be/NVDTg_oMUY4\n- 📝 **Blog / field notes:** https://huggingface.co/blog/WillHbx/ephemeral-hearts\n- 💬 **Social post:** https://www.linkedin.com/posts/lorenzo-lepoivre_ai-generativeai-llm-share-7471625802258939904-Pjx-/\n\n**Track:** Thousand Token Wood\n**Sub-tracks:** Off-Brand · Best Demo · Best Agent · Bonus Quest Champion · Judges' Wildcard\n**Badges:** 🔌 Off the Grid · 🎨 Off-Brand (Custom UI) · 🦙 Llama Champion (llama.cpp) ·\n📡 Sharing is Caring (Open-Trace) · 📓 Field Notes\n\n---\n\n## ✨ This repo runs on a fresh checkout — with **zero models**\n\nIt ships in **MOCK mode** (`VN_MOCK=1`, the default): deterministic fake LLM / painter / STT so the\n**entire loop works offline** with placeholder art. Open it, watch a turn happen, then have Claude\nCode replace each mock with a real model, one module at a time. See [`CLAUDE.md`](CLAUDE.md).\n\n```bash\nuv sync                       # installs only gradio + pydantic + pillow (seconds)\nuv run python -m visualnovel.smoke     # full loop in the terminal, no UI, no models\nuv run python app.py                   # custom VN UI  →  http://localhost:7860\nGRADIO_MVP_UI=1 uv run python app.py   # plain gr.Blocks fallback (Phase 0/1)\n```\n\n---\n\n## The cast of models (the “several AIs”)\n\nFour *roles*, four *real* model families (text, image, speech-in, speech-out). The two text roles\n**share one LLM** (same weights, two system prompts) to respect the parameter budget.\n\n| In-world name | Role | Backed by | Output |\n|---|---|---|---|\n| 🧵 **The Weaver** | director / GM — builds the world, keeps canon, decides what changes | LLM (shared) | structured **directives** (JSON) |\n| 🎭 **The Voices** | actor — speaks as the present spirits, in voice | LLM (shared) | in-character **dialogue** + emotion |\n| 🎨 **The Painter** | renders backdrops + character sprites | **SDXL-Turbo** (+ your anime-style LoRA) | **images** |\n| 👂 **The Ear** | turns spoken words into text | Whisper | transcribed input |\n| 🗣️ **The Voice** | speaks the spirits' dialogue aloud (per-character voice) | **Kokoro-82M** (ONNX) | synthesized **audio** |\n\nThe Weaver and Voices are one **grammar-constrained LLM call per turn** → `{ speaker, dialogue,\nemotion, directives }`. Code applies the directives deterministically; **the LLM never edits state\ndirectly.**\n\n---\n\n## Parameter budget — the “beefy” config (≤ 32B total)\n\n| Component | Model | ~Params |\n|---|---|---|\n| LLM (shared) | **Qwen3-14B** | ~14B |\n| Image | **SDXL-Turbo** (+ fine-tuned anime LoRA) | ~3.5B |\n| STT | **Whisper-large-v3-turbo** | ~0.8B |\n| TTS | **Kokoro-82M** (ONNX) | ~0.08B |\n| Matting (optional) | BiRefNet | ~0.2B |\n| **Total** | | **≈ 18.6B** ✅ |\n\nMoE models count by **total** params. Everything is h"
  },
  {
    "repo_id": "build-small-hackathon/vivamais",
    "title": "Vivamais",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/vivamais",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "vivamais",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Viva Mais AI\nemoji: ✈️\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: \"6.16.0\"\npython_version: \"3.11\"\npinned: true\nshort_description: Local-first WhatsApp copilot for the Viva Mais travel agency\nhardware: t4-small\nstartup_duration_timeout: 45min\npreload_from_hub:\n  - marinarosa/minicpmv4.6-vivamais-v1\n  - marinarosa/minicpm5-1b-vivamais-v4\n  - marinarosa/faster-whisper-small-pt-cv11-v5\n  - nvidia/llama-nemotron-embed-vl-1b-v2\ntags:\n  - build-small-hackathon\n  - backyard-ai\n  - off-the-grid\n  - well-tuned\n  - off-brand\n  - llama-champion\n  - sharing-is-caring\n  - field-notes\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# Viva Mais AI\n\nA local-first WhatsApp copilot for the Viva Mais travel agency. The owner sells\nairline tickets over WhatsApp and drowns in the follow-up: re-typing itineraries,\nchecking who paid, chasing documents, and answering late at night. Viva Mais AI\nturns one raw WhatsApp export into a CRM dashboard, running entirely on small\nlocal models so client data never leaves.\n\nBuilt for the Hugging Face x Gradio Build Small Hackathon, Backyard AI track.\n\n## Demo\n\n<video src=\"demo/slides/final-demo.mp4\" controls width=\"100%\"></video>\n\nIf the player above does not load, watch [`final-demo.mp4`](demo/slides/final-demo.mp4)\ndirectly, or:\n\n- **Walkthrough video on X**: [@amphetamarina](https://x.com/amphetamarina/status/2066597780391395695)\n  (the demo video is posted there too).\n- **Write-up**: [the Viva Mais AI build-small-hackathon blog post](https://huggingface.co/blog/build-small-hackathon/vivamais).\n\n## How it works\n\nThe user uploads a raw WhatsApp conversation export (a zip with its media).\nEvery image and PDF becomes one typed artifact — quote, reservation, boarding\npass, payment proof, invoice, accommodation, identity document, or an explicit\n\"uncategorized\" with its full transcription preserved — and voice notes are\ntranscribed with whisper. From those artifacts the app assembles a CRM-style\ndashboard: identity, reservations, payments, sales-pipeline stage, and the\nnext action to take.\n\nNo export handy? A built-in **synthetic demo** loads a small, fictional\nWhatsApp conversation and runs the whole pipeline on made-up data: a quote, a\nbooking, a payment, and a boarding pass.\n\n## Stack\n\n- **Models**: MiniCPM-V 4.6 transcribes and classifies each image; MiniCPM5-1B\n  fine-tuned on PT-BR WhatsApp conversations extracts the typed fields and\n  answers dashboard Q&A; whisper-small fine-tuned on Common Voice Portuguese\n  transcribes PT-BR voice notes; pypdf reads PDF text. Structured extraction\n  is orchestrated with DSPy over the in-process models. All small, in-process,\n  no external APIs. Every PT-BR prompt, glossary and demo is **accent-faithful**\n  (orçamento, sessão, cartão), because the fine-tunes were trained on correctly\n  accented text rather than ASCII-folded approximations. The published weights\n  are listed under [Fine-tuned models](#fine-tuned-models).\n- **RAG (NVIDIA Nemotron)**: dashboard Q&A is grounded by hybrid retrieval.\n  [nvidia/llama-nemotron-embed-vl-1b-v2](https://huggingface.co/nvidia/llama-nemotron-embed-vl-1b-v2)\n  embeds the corpus multimodally — each\n  receipt or ticket as its image plus OCR text in one vector, plus customer\n  fact sheets and sliding conversation windows — at the end of the upload\n  pipeline, and embeds each question at ask time. Semantic top-k chunks are\n  combined with deterministic domain matching (customers, passengers,\n  reservation codes, and transaction ids named in the question always\n  contribute their full facts), so the 1B answer model reads a small, precise\n  context instead of the whole conversation.\n- **Runtime**: auto-detected at startup. On ZeroGPU everything runs through\n  torch inside `@spaces.GPU` functions; on a real CUDA GPU vision runs through\n  transformers and Q&A through llama.cpp with full offload; on CPU everything\n  runs through llama.cpp GGUF Q4 and faster-whisper int8 (Off the Grid + Llama\n  Champion). Modal is the dev bench for data curation, evals, and LoRA\n  fine-tuning only — never a dependency of the deployed Space.\n- **Frontend (Off-Brand)**: no stock Gradio components anywhere. The UI is a\n  hand-built hypermedia app — `gr.Server` keeps Gradio's queue, streaming and\n  ZeroGPU plumbing while plain FastAPI routes serve server-rendered HTML\n  driven by HTMX + hyperscript (vendored, no build step, no CDN). The design\n  is a travel agency desk: artifacts render as boarding-pass ticket cards\n  with tear-off stubs and CSS barcodes, KPIs as a split-flap departure board,\n  the sales pipeline as a flight route with a plane en route, the\n  conversation as a pixel-faithful WhatsApp replay, model traces as a\n  control-tower radar terminal, and the robot-cowfish mascot reacts to the\n  pipeline through pure-CSS sprite animation. Each "
  },
  {
    "repo_id": "build-small-hackathon/museum-of-unlived-lives",
    "title": "Museum Of Unlived Lives",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/museum-of-unlived-lives",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "museum",
      "unlived",
      "lives",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Museum of Unlived Lives\nemoji: 🏛️\ncolorFrom: gray\ncolorTo: purple\nsdk: gradio\nsdk_version: \"6.18.0\"\npython_version: \"3.11\"\napp_file: app.py\npinned: true\nfullWidth: true\nheader: mini\nlicense: mit\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# Museum of Unlived Lives\n\n*Some lives we live. Most we only imagine.*\n\n**📖 Blog:** [Building the Museum of Unlived Lives](https://huggingface.co/blog/Rime04/the-museum-of-unlived-lives)\n\n**🎬 Demo video:** [Watch the demo on YouTube](https://youtu.be/NvGzIj5a5eQ)\n\n**🔗 Social post:** [Launch post on X](https://x.com/RimeK05/status/2066280487576797258)\n\n**🖼️ Try it:** [Open the live Space](https://huggingface.co/spaces/build-small-hackathon/museum-of-unlived-lives)\n\n**Track 2: Thousand Token Wood**\n\nYou type a path you didn't take — the job in Tokyo, the degree you walked away from, the version of you that stayed. A curator opens a room for it.\n\nNot a chatbot. A museum.\n\nEach counterfactual becomes an exhibit card: title, narrative, mood, artifact, and abstract geometry. Cards stack in your personal gallery in the browser. Walk the hall at night. Open any room. Download a high-res PNG keepsake.\n\n**Try it:** [build-small-hackathon/museum-of-unlived-lives](https://huggingface.co/spaces/build-small-hackathon/museum-of-unlived-lives)\n\n**Built by**\n\n| | GitHub | Hugging Face |\n|---|--------|--------------|\n| **false200** | [false200](https://github.com/false200) | [divmodelhq](https://huggingface.co/divmodelhq) |\n| **Rime504** | [Rime504](https://github.com/Rime504) | [Rime04](https://huggingface.co/Rime04) |\n\n**Code:** [github.com/Rime504/museum-of-unlived-lives](https://github.com/Rime504/museum-of-unlived-lives)\n\n## Curated by [OpenBMB](https://huggingface.co/openbmb) [MiniCPM](https://huggingface.co/openbmb/MiniCPM4.1-8B-GGUF)\n\nThe soul of this project is **[MiniCPM4.1-8B](https://huggingface.co/openbmb/MiniCPM4.1-8B-GGUF)** — OpenBMB's compact open model, running fully local through llama.cpp. No API keys. No cloud round-trip. The model doesn't explain your life; it *curates* it — distilling a counterfactual into a tight, literary exhibit inside a bounded token budget.\n\nThat's the spirit of **Thousand Token Wood**: small wood, sharp grain. Every room is carved from a handful of tokens, not a novel.\n\n- **OpenBMB MiniCPM** writes title, narrative, artifact, and mood palette\n- **Fully local inference** — [MiniCPM4.1-8B Q4_K_M](https://huggingface.co/openbmb/MiniCPM4.1-8B-GGUF) (~5 GB), downloads on first run\n- **Structured curator** — JSON schema + server-assigned SVG shapes keep exhibits varied and gallery-ready\n- **Custom museum UI** — no default Gradio chrome; a full front door in `frontend/`\n\n## What you'll get\n\nEnter something like:\n\n> I had taken the job in Tokyo instead of staying home\n\nClick **Open this room**. The curator returns an exhibit — poetic copy, a physical artifact on the placard, mood colors, and one of eight abstract shapes chosen for that specific life. Save it to your gallery. Come back later. The museum remembers.\n\n## How it works\n\n```mermaid\nflowchart LR\n  subgraph client[\"Browser\"]\n    UI[\"frontend/<br/>index.html · styles.css · app.js\"]\n  end\n\n  subgraph server[\"gr.Server (app.py)\"]\n    GET[\"GET /\"]\n    API[\"POST /open_room\"]\n  end\n\n  subgraph museum[\"museum/\"]\n    SCHEMA[\"schema.py<br/>JSON validate\"]\n    MODEL[\"model.py<br/>OpenBMB MiniCPM\"]\n    CARD[\"card.py + shapes.py<br/>HTML + SVG\"]\n    EXPORT[\"export.py<br/>PNG card\"]\n  end\n\n  UI -->|\"@gradio/client\"| API\n  GET --> UI\n  API --> SCHEMA\n  SCHEMA --> MODEL\n  SCHEMA --> CARD\n  API --> EXPORT\n  API -->|\"card_html + png\"| UI\n```\n\nCustom HTML/CSS frontend — no default Gradio UI. Eight abstract shapes are assigned server-side per counterfactual; MiniCPM writes the copy to match. Card export uses SnapDOM in the browser for pixel-accurate PNGs.\n\n```\napp.py              gr.Blocks + custom / + /open_room API\nfrontend/           Custom UI (HTML, CSS, JS)\nmuseum/             Model, prompts, schema, shapes, card, export\nrequirements.txt\nscripts/            Optional: fetch weights, tests\n```\n\nOn a cold start, the first exhibit may take 1–2 minutes while MiniCPM loads; later rooms are faster.\n\n---\n\n## Run locally\n\nWhen you want your own copy of the museum on your machine:\n\n**Requirements:** Python 3.10 or 3.11, ~6 GB free disk (model + deps), internet on first run (model download).\n\n| Path | Best for | Speed |\n|------|----------|-------|\n| **A — GPU (Linux / Windows + NVIDIA)** | Gaming laptop, Linux workstation, CUDA 12.x | Fast (~30–60 s per exhibit) |\n| **B — CPU only** | No GPU, or CUDA install fails | Slow (~3–8 min per exhibit) |\n| **C — Mac (Apple Silicon)** | M1 / M2 / M3 / M4 | Fast via Metal |\n\nAll paths use the same app — only `llama-cpp-python` install differs. The `spaces` package is optional locally (ZeroGPU decorator becomes a no-op).\n\n### Quick start\n\n```bash\ngit clone https://github.c"
  },
  {
    "repo_id": "build-small-hackathon/case0",
    "title": "Case0",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/case0",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "case0",
      "summary",
      "available",
      "yet.",
      "docker"
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Try the app and decide whether it feels beginner-friendly.",
    "likes": 9,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: Case Zero\nemoji: 🕵️\ncolorFrom: indigo\ncolorTo: yellow\nsdk: docker\napp_port: 7860\npinned: true\nlicense: apache-2.0\nmodels:\n  - Qwen/Qwen2.5-1.5B-Instruct\n  - Supertone/supertonic-3\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - off-the-grid\n  - llama-champion\n  - off-brand\n  - tiny-titan\n  - field-notes\n  - sharing-is-caring\n  - llama-cpp\n  - detective-game\n  - text-generation\n  - tts\n  - track:wood\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\nshort_description: Gather evidence and solve cases in this pixel noir !\n---\n\n# 🕵️ Case Zero — the AI *is* the detective game\n\n**A brand-new mystery, written and acted by a 1.5B model, every single time.**\n\nNo scripted cases. No content library. A single small local model invents the whole\nthing — a homicide, a heist, a fraud, a blackmail ring, an arson, a disappearance — the\nvictim, the suspects, their secrets and motives, the timeline, the evidence, and the one\nwho did it — then **role-plays every suspect live**. They remember what you asked. They\nlie to your face. And when you slap down the right piece of evidence, you watch the lie\n**crack in real time**.\n\n> Interrogate. Investigate. Accuse. One of them is guilty. Prove it.\n\n## ✨ The moment that sells it\n\nSearch the rooms, find a clue that contradicts a suspect's alibi, **present it**, and\ntheir story falls apart on screen — stress spikes, the alibi breaks, the truth leaks.\nThen name the killer, cite your proof, and get a scored verdict with a \"Director's Cut\"\nwalkthrough of how the crime really went down.\n\n## 🧠 How it works\n\n| Layer | What it does |\n|---|---|\n| **Model** — Qwen2.5-1.5B-Instruct (GGUF) | The whole game. Runs in-process on the CPU through **llama.cpp** (`llama-cpp-python`) — no server, no GPU, no remote endpoint. |\n| **Generation** | The model authors every case as JSON; deterministic Python only wires the *structure* (who's guilty, who was where) so the mystery is always solvable. |\n| **Solver** | A fairness referee: single culprit, a breakable alibi, every innocent cleared, and a discoverability gate so the key clue is always findable in play. |\n| **Director** | Whether a lie gets caught is decided by **ground truth, not the model** — so the win condition is immune to prose (a jailbroken \"just tell me who did it\" earns nothing). |\n| **Voice** — Supertonic | Each suspect gets a distinct, gender-matched on-device voice, synthesized **sentence-by-sentence as the reply streams**. |\n| **Art** | Procedural pixel-art portraits, rooms, and evidence — rendered **client-side on canvas** at one integer-scaled density (so the server spends ~0 CPU on visuals). |\n| **UI** | A custom **pixel-art noir SPA (Preact)**, 12 screens, served **100% through `gradio.Server`** (Gradio 6 \"Server mode\") — the built bundle as static files plus the JSON/SSE `/api` routes, all in one process. No separate frontend host. |\n\nThe model does all the creative work. Deterministic code is only guardrails and a\nreliability layer — it never writes story, character, or dialogue.\n\n## 🏆 Built for the Build Small Hackathon\n\n**Track: Thousand Token Wood** — a game that is genuinely impossible without AI: the\nmodel doesn't power a feature, it *authors and performs the entire mystery*.\n\n- **Off the Grid:** zero cloud APIs. The LLM and the TTS run **in-process on the Space's\n  CPU**; a network-guarded test plays a full case with all non-loopback sockets blocked.\n- **Llama Champion:** the model runs through the **llama.cpp** runtime, in-process — no\n  server, no remote endpoint.\n- **Off-Brand:** a fully custom 12-screen pixel-art noir SPA served through\n  **`gr.Server`** — procedural scenes, exhibit art, and sprites, all canvas-drawn.\n- **Tiny Titan (≤4B):** the entire game runs on **Qwen2.5-1.5B** — ~1.6B total runtime\n  params (LLM + Supertonic), far under the 32B cap.\n- **Field Notes:** [docs/FIELD_NOTES.md](docs/FIELD_NOTES.md) — what we built, what\n  broke, and what a 1.5B model can really do.\n- **Sharing is Caring:** real agent traces (case generation + live interrogations) are\n  published on the Hub — see the traces dataset linked below.\n\nSee [COMPLIANCE.md](COMPLIANCE.md) for the full parameter budget and badge details.\n\n## 📦 Submission links\n\n- **Demo video:** *https://drive.google.com/file/d/1hW3uDfGeW-L9u2OYgFVAD10qanH3ZRIM/view?usp=drivesdk*\n\n- **Social post:** *https://huggingface.co/blog/build-small-hackathon/case0*\n*https://discord.com/channels/879548962464493619/1513918288000712846*\n*https://www.linkedin.com/posts/the-abdullah-eid100_were-launching-case-zero-a-procedural-share-7470127712385318912-WKlr/?utm_source=social_share_send&utm_medium=ios_app&rcm=ACoAAEUSZrkBKmJwx9WrfpZ3k_MKawOBJtAmJug&utm_campaign=copy_link*\n\n\n- **Agent traces dataset:** [build-small-hackathon/case0-traces](https://huggingface.co/datasets/build-small-hackathon/case0-traces)\n  — the exact prompts + completions of a full case generation, and a live 4-turn int"
  },
  {
    "repo_id": "build-small-hackathon/kisan-sathi",
    "title": "Kisan Sathi",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/kisan-sathi",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "kisan",
      "sathi",
      "summary",
      "available",
      "yet.",
      "docker"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 3,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Kisan-Sathi\nemoji: 🚜\ncolorFrom: green\ncolorTo: yellow\nsdk: docker\npinned: false\nlicense: mit\n---\n\n# 🚜 Kisan-Sathi (किसान साथी)\n**An Offline-First Mobile Agricultural Assistant & Digital Ledger**\n\nKisan-Sathi is a mobile-first, fully offline agricultural support tool designed to run directly on a farmer's device. It targets real-world agricultural problems in areas with poor or absent network connectivity by executing inference locally using ultra-efficient edge language models.\n\n---\n\n## 🏕️ Project Identity & Track Alignment\n\n- **Selected Track:** Backyard AI\n- **Project Name:** Kisan-Sathi (किसान-साथी)\n- **One-Sentence Pitch:** A zero-connectivity, mobile-first Gradio app that runs local models to guide farmers with localized crop calendars, voice/text digital financial ledgers, and emergency safety fallbacks.\n- **Specific Real User:** Ramesh Kumar (Small-scale farmer, Kanpur Dehat, Uttar Pradesh, India).\n- **Their Problem:** Ramesh grows Wheat (गेहूं) and Potato (आलू). He struggles with timing fertilizer applications, identifying pest infestations (such as potato late blight), and tracking crop expenditures/sales. Because cellular network reception in his fields is unstable, cloud-based LLM applications fail him completely.\n- **Verification Plan:** Ramesh can open Kisan-Sathi in his mobile web browser connected to a local Termux server on his phone, turn on airplane mode, and successfully ask crop questions, consult his crop calendar, and register sales in his digital ledger.\n\n---\n\n## 🔌 Off the Grid: Technical Stack & Model Choice\n\n- **Primary Model:** `Gemma 4 E2B` (Effective parameters: **2.3B**, total size: **5.1B** parameters).\n- **Quantization:** `Q4_K_M GGUF` (approx. 2.9 GB RAM footprint, running comfortably on Pixel 10).\n- **Runtime Abstraction (`src/llm.py`):**\n  - **In-Process Inference:** Powered by `llama-cpp-python` loading the Gemma 4 E2B GGUF file directly in Python.\n  - **Dev/Mock Mode:** Lightweight keyword and regex-based response simulator to facilitate fast UI/UX iterations without loading weights.\n\n---\n\n## 🛠️ Key Features\n\n1. **💬 Ask Sathi (कृषि सलाहकार):** Dual-language (Hindi/English) chatbot grounded in local agricultural guides for Kanpur Dehat, UP.\n2. **📓 Digital Ledger (डिजिटल बहीखाता):** Natural language parsing of expenses and sales, extracting structured entries to a local SQLite database (`kisan.db`) and displaying real-time monthly summaries (Total Income, Total Expense, Net Balance). Supports individual row deletion, ledger clearing, and CSV export.\n3. **📅 Personalized Crop Calendar (फसल कैलेंडर):** Dynamic timelines generated from templates using the crop sowing date. Allows adding multiple calendars, tracking checkable tasks, highlighting the next action in the nudge banner, and deleting calendars.\n4. **📞 Local Emergency Directory (स्थानीय संपर्क):** A list of verified local agricultural officers and scientists. \n   - *Safety Guardrail:* Questions containing high-risk keywords (e.g., suicide, pesticide poisoning, severe debt) bypass the LLM entirely and immediately present direct-dial emergency cards to prevent hallucinations during crises.\n   - *First-run Bootstrap Search:* Upon first initialization, the app performs a web search for local Kanpur Dehat agricultural phone listings, merges them with verified seed numbers, and saves the file locally for future offline reference.\n\n### 🌟 Recent Developments & Enhancements\n\n- **SQLite Database Persistence**: Migrated local storage from JSON/CSV files to a native SQLite database (`kisan.db`), ensuring reliable offline persistence on phone and Termux.\n- **Personalized Crop Calendar**: Created a dynamic crop timeline generated deterministically from sowing dates, complete with checkable task status checkboxes, active calendars table view, next-action highlight, and delete calendar operations.\n- **Ledger Enhancement & CSV Export**: Added capability to delete individual transactions, clear the entire ledger, and export records as a standard CSV format.\n- **Removed Ollama Integration**: Cleaned up the codebase to focus exclusively on in-process `llama.cpp` and simulated `mock` modes, avoiding local server setup friction.\n- **Backend Migration to FastAPI**: Switched from the deprecated `gr.Server` to a standard FastAPI application, serving a custom glassmorphism single‑page app at the root path (`/`) and mounting a Gradio `ChatInterface` fallback at `/gradio` for compatibility.\n- **Graceful Llama.cpp Fallback**: Added robust error handling to detect Windows AVX‑512 instruction errors (`0xc000001d`) and automatically fall back to a mock backend, ensuring the app remains operational on diverse hardware.\n- **Sequential Dialogue Memory (Session History)**: Implemented full conversational history tracking. Sequential chat turns are dynamically preserved and forwarded to edge backends (`llama.cpp`) so the assistant retains context across dialogue exchanges.\n- **Zero-Dependency Markdown Parsing**: Embedded a lightweight regex‑bas"
  },
  {
    "repo_id": "build-small-hackathon/football-foul-fest",
    "title": "Football Foul Fest",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/football-foul-fest",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "football",
      "foul",
      "fest",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Football Foul Fest\nemoji: 🏆\ncolorFrom: red\ncolorTo: gray\nsdk: gradio\nsdk_version: 5.33.0\npython_version: '3.13'\napp_file: app.py\npinned: false\nshort_description: A football tournament where players kick each other to win.\ntags:\n  - track:wood\n  - sponsor:modal\n  - achievement:offbrand\n---\n\n# ⚽ Football Foul Fest 🏆\n\n🎥 [Watch the Demo Video](https://www.loom.com/share/c1d989c65ef840ed8f5d090ff3b447df) | [View the Launch Post](https://x.com/yeye70531274663/status/2064279662298067395?s=20)\n\nAn inverted football tournament where fouls, cards, and dirty play win matches — not goals. The dirtier you play, the higher your score.\n\n## Scoring & Rules\n\n| Event | Points |\n|-------|--------|\n| Regular Foul | +1 |\n| Yellow Card | +3 |\n| Red Card | +5 |\n| Penalty Conceded | +2 |\n| Violent Conduct | +7 |\n| Clean Play | 0 (actual football sequence) |\n\n> [!IMPORTANT]\n> **Dirty Conduct Tiebreaker**: If scores are tied at full time (90'), the team with more violence progresses. The tiebreaker hierarchy is:\n> 1. Most Violent Conduct events\n> 2. Most Red Cards\n> 3. Coin flip\n\n## Team Tactics\n\nAssign a specific playstyle to influence how a team behaves and what events they generate:\n- **The Chopper**: Maximum fouls, no subtlety. Hack everything that moves.\n- **The Diver**: Win penalties through theatrical diving. Every touch is agony.\n- **The Intimidator**: Rack up cards deliberately. Fear is the weapon.\n- **The Enforcer**: Target key opposition players and make them suffer.\n- **The Time Waster**: Slow the game down, waste every second, and provoke the opposition.\n\n## How to Play\n\n1. **Setup** — Select 8 teams from the pool (or create your own with custom ratings)\n2. **Start** — Click START FOOTBALL FOUL FEST\n3. **Watch** — Matches simulate in real-time with sequential, pacing-controlled live commentary\n4. **Advance** — After each match, click Advance to proceed through the tournament bracket\n5. **Win** — The dirtiest team is crowned Football Foul Fest Champion\n\n## How It Works\n\n```\n[Setup Pool] ──> [Match State (Time Seeded)]\n                        │\n                        ▼ (10 Ticks/Half Loop)\n               [Pick Possession Team]\n                        │\n                        ▼\n             [LLM AI Coach Action Choice] (FOUL, DIVE, PRESS, etc.)\n                        │\n                        ▼\n             [Foul Engine Resolution Cascade] (e.g., Dive -> Yellow -> Red)\n                        │\n                        ▼\n             [LLM Pundit Commentary] (Custom prompt with tone variation)\n                        │\n                        ▼\n             [Sequential Streaming UI] (3s event intervals + 5s halftime)\n```\n\n1. **Deterministic Setup**: Matches are initialized with a time-based unique seed for repeatable engine rolls.\n2. **Action Prompting**: The possession team calls the Modal LLM endpoint with their score, time, and custom tactic (e.g. *The Chopper*) to select the best dirty play actions.\n3. **Engine Resolution**: Actions pass through the physics/probability engine where player attributes (aggression, cynicism, theatrics) roll for success, yellow/red cards, and injuries.\n4. **Streaming Commentary**: As cascades resolve, they are pace-controlled and streamed to the UI. Genuinely violent or theatrical plays trigger a second LLM request to generate broadcast-style pundit commentary with funny/poetic/outraged tone variations.\n5. **Post-Match Analysis**: At 90', the engine determines tiebreakers if necessary and generates a complete bespoke match recap using the Modal backend.\n\n## Tech Stack\n\n- **Frontend**: Gradio (runs on HF Spaces free CPU tier)\n- **AI Backend**: Qwen2.5-14B-Instruct on Modal A100 GPU\n- **Model**: ≤32B parameters (hackathon compliant)\n\n"
  },
  {
    "repo_id": "build-small-hackathon/NeuroBait",
    "title": "Neurobait",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/NeuroBait",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "neurobait",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 6,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: NeuroBait\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\napp_file: app.py\npinned: true\nlicense: apache-2.0\nshort_description: An ADHD-friendly space and gentle boost for your everyday.\ntags:\n  - track:backyard\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n<p align=\"center\">\n  <img src=\"assets/neurobait-logo.jpg\" alt=\"NeuroBait logo\" width=\"420\" />\n</p>\n\n# NeuroBait\n\nNeuroBait is an ADHD-friendly companion for task initiation: a warm space and a\ngentle boost for the moment when starting feels heavier than the task itself.\n\nIt started from a simple observation: sometimes the hard part is not knowing\nwhat to do. The hard part is crossing the gap between knowing and starting. A\nto-do list can be technically correct and still feel like one more thing to\ncarry. NeuroBait is built for that freeze moment.\n\nInstead of giving a productivity lecture, NeuroBait tries to offer a small\nspark: short, warm, agency-preserving language that reconnects the user with\nwhat matters and lowers the first move until it feels possible. No shame, no\nstreak pressure, no diagnostic framing, and no visible prompt labels like\n`Micro-action`, `Hook`, or `Stakes`.\n\nThis Space runs a fine-tuned model through the app itself. It does not call an\nexternal hosted LLM API for chat responses.\n\n## Try It\n\n- Live Space: https://huggingface.co/spaces/build-small-hackathon/NeuroBait\n- Direct app host: https://build-small-hackathon-neurobait.hf.space\n- Demo video: https://www.youtube.com/watch?v=-EJ06hOgyd0\n- Social post: https://x.com/hd_subrata/status/2066448521881059735\n- Field Notes: https://huggingface.co/blog/build-small-hackathon/neurobait-adhd\n- Model adapter: https://huggingface.co/build-small-hackathon/NeuroBait\n- Source repo: https://github.com/Subrata15/NeuroBait-Build-Small-Model\n\n## Sample Chat\n\n<p align=\"center\">\n  <img src=\"assets/example-chat.png\" alt=\"Example NeuroBait chat screenshot\" width=\"760\" />\n</p>\n\n## Build Small Hackathon Submission\n\n- Primary track: **Backyard AI**\n- Why this track: NeuroBait was built from a real household problem and a real\n  daily friction - the moment when a person already knows the task matters but\n  still cannot begin.\n- Bonus quest fit: **Well-Tuned**, because the Space uses a published LoRA\n  adapter fine-tuned for NeuroBait's voice and behavior.\n- Bonus quest fit: **Off-Brand**, because the app uses custom Gradio styling,\n  product copy, mood input, and a NeuroBait-specific chat experience instead of\n  the default chatbot shell.\n- Sponsor fit: **Modal-powered**, because fine-tuning and generation evaluation\n  were run on Modal GPU infrastructure.\n\nNeuroBait was fine-tuned with Modal and deployed as a Gradio app on Hugging Face\nZeroGPU.\n\n## What NeuroBait Does\n\nThe app is intentionally narrow. NeuroBait does not try to become a therapist,\nplanner, diagnosis tool, or productivity operating system. It is designed for\none useful behavior: helping a stuck person find a lighter first move.\n\nNeuroBait should:\n\n- respond in concise, natural prose,\n- preserve the user's agency,\n- avoid guilt framing and productivity shame,\n- ask one light question when context is sparse,\n- offer one tiny concrete action when enough context exists,\n- keep the tone gentle without pretending to be a therapist.\n\nThe interface includes a small mood check-in so the same model can adapt its\nresponse style slightly:\n\n- Calm\n- Tired\n- Anxious\n- Focused\n\nThe mood input does not change the safety scope. It only nudges the app-level\nprompting and presentation.\n\n## Why Fine-Tune?\n\nThe base model is capable, but capability alone was not the target. Out of the\nbox, a model often responds with helpful-looking structure: long paragraphs,\nbold labels, bullet lists, and generic encouragement. For someone already\noverwhelmed, that can become another wall of text.\n\nThe fine-tune was trained to learn NeuroBait's voice and behavior layer:\nshorter responses, warmer framing, fewer assumptions, and no leaked internal\nrecipe labels. The goal was not to make a general assistant. The goal was to\nmake the first sentence feel less heavy.\n\n## Model And Runtime\n\n- Base model: `unsloth/gemma-3-12b-it`\n- Adapter: `build-small-hackathon/NeuroBait`\n- Method: 16-bit LoRA via Unsloth\n- Training hardware: Modal H100 80GB GPU\n- Space runtime: `transformers` + `peft`\n- Quantization: 4-bit bitsandbytes NF4 inside the `@spaces.GPU` window\n\nUnsloth is used for training, not for Space inference. The deployed app uses the\nstandard `transformers` + `peft` path so the public demo can load the Gemma 3\n12B base model plus the NeuroBait LoRA adapter on Hugging Face ZeroGPU.\n\nExpected runtime environment variables:\n\n```text\nBASE_MODEL=unsloth/gemma-3-12b-it\nADAPTER_ID=build-small-hackathon/NeuroBait\nLOAD_IN_4BIT=1\nMAX_NEW_TOKENS=220\nPREWARM=1\n```\n\nWeights are pre-warmed to the Space cache on CPU at import so the GPU window can\nfocus on quantized loa"
  },
  {
    "repo_id": "build-small-hackathon/FrogQuest",
    "title": "Frogquest",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/FrogQuest",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "achievement",
      "track",
      "frogquest",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 8,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: FrogQuest\nemoji: 🐸\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\nhf_oauth: true\npinned: true\nlicense: apache-2.0\nshort_description: A text-image based RPG built to help fight procrastination\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - sponsor:black-forest-labs\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\nCheck out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference\n\n## FrogQuest\n\n**[▶️ Try the Space](https://huggingface.co/spaces/build-small-hackathon/FrogQuest)**  \n**[🎬 Watch the demo](https://youtu.be/n-2a7a56wWU)**\n**[Read the Blog ](https://huggingface.co/blog/build-small-hackathon/frogquest)**\n\n[Social media Post Linkedin](https://www.linkedin.com/posts/soumya532_try-frogquest-today-ugcPost-7471943345662021632-rmnx/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAETNtiwB6QJ_lnhIW_3QCLWUz5B_HzSts-E)\n\nTurn your real life goals into an 8-bit pixel-art text-adventure quest log where **you** are the\nhero. Named for the \"Eat That Frog\" method: your hardest/most important task is the **frog** — the\nboss quest you face first.\n\n- **Nemotron Nano 4B** (text only, via llama.cpp on ZeroGPU) writes the quest JSON — Eat That Frog\n  prioritization, linked sub-task chains, and optional self-care bonus quests.\n- A fixed pixel-art frontend (`gr.Server` custom UI) renders that JSON as the quest log.\n- **FLUX.2 [klein] 4B** (diffusers) generates each quest's image using your uploaded photo as a\n  reference, then edits that same image into success / \"retreat to fight another day\" states.\n\nYour photo and quest state live in your browser (localStorage) **and** in a private per-user JSON\nrecord on the Space's storage. **Sign in with your Hugging Face account** (top right) and the\nrecord is keyed to your account — same state on any device, nothing to remember. Logged out, it's\nkeyed by a random **Hero Code** shown in the left panel instead — paste it on a new device (or\nafter a cleared browser) to get everything back. The photo is sent to the GPU only transiently\nduring generation.\n\nNOTE: after enabling/changing `hf_oauth`, fully restart (factory rebuild) the Space so the OAuth\nlogin route gets registered — clicking Sign in before that yields a \"Could not parse server\nresponse\" error.\n\n### Durable storage (Storage Bucket)\n\nFor the server-side record to survive Space restarts, mount a HF **Storage Bucket**:\n1. Create a bucket at https://huggingface.co/new-bucket (private is fine).\n2. Space **Settings → Storage Buckets → Mount a bucket** → pick it, mount path **`/data`**,\n   read-write.\nThe app writes `users/<hero-code>.json` files there (`store.py`; `FROGQUEST_DATA_DIR` overrides the\nlocation, and without any mount it falls back to the container's ephemeral disk — fine for testing,\nwiped on rebuild).\n\n### GPU backend (ZeroGPU or Modal)\n\nThe GPU work (quest JSON + images) runs through a pluggable backend chosen by the\n`FROGQUEST_BACKEND` env var. The frontend, signatures, and behavior are identical either way.\n\n**`zerogpu` (default).** Everything runs in-Space on ZeroGPU.\n- Set the Space hardware to **ZeroGPU**.\n- First quest generation cold-downloads the models, then runs fast.\n- Subject to ZeroGPU daily quotas (anon 2 min, free 5 min, PRO 40 min per day).\n\n**`modal` (opt-in).** The Space runs on CPU-basic and offloads GPU work to [Modal](https://modal.com)\n(dedicated GPU, pay-per-second, no quota). Switching is **two steps** — the env var alone does NOT\nchange hardware:\n1. Set the Space hardware to **CPU basic**.\n2. Add a Space **variable** `FROGQUEST_BACKEND=modal`.\n\nOne-time Modal setup:\n1. `pip install modal` and `modal token new` (authenticate).\n2. `modal deploy modal_app.py` (deploys the `frogquest` app: an `LLM` class + a `Flux` class).\n3. Add `MODAL_TOKEN_ID` and `MODAL_TOKEN_SECRET` as Space **secrets** so the Space can call Modal.\n4. Optional: set `FROGQUEST_MODAL_FLUX_GPU` before `modal deploy` to pick the image GPU\n   (default `A10G`; `L4` cheaper; `L40S`/`A100-40GB` if it OOMs).\n\nOptional: rename `requirements-modal.txt` → `requirements.txt` on the Space to skip installing the\nheavy GPU libs (torch/diffusers/llama-cpp) and speed up CPU-basic builds.\n\n\n![image](https://cdn-uploads.huggingface.co/production/uploads/67e018cc3d11aa67c1f911ee/JT7jD-sWNuK5bIp13g67O.png)\n"
  },
  {
    "repo_id": "build-small-hackathon/the-deal",
    "title": "The Deal",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/the-deal",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "deal",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Try the app and decide whether it feels beginner-friendly.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: THE DEAL — talk an AI down to its secret price\nemoji: 🤝\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 5.49.1\napp_file: app.py\npinned: true\nshort_description: Haggle an AI hiding its real price. Beat the leaderboard.\ntags:\n  - build-small-hackathon\n  - interactive-game\n  - fine-tuned\n  - track:wood\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\n  - sponsor:modal\nmodels:\n  - build-small-hackathon/deal_sft_lora_4B (the Fox)\n  - build-small-hackathon/deal_sft_4B_hard (the Closer)\ndatasets:\n  - build-small-hackathon/the-deal-selfplay (dataset for the Fox)\n  - build-small-hackathon/the-deal-selfplay-v2 (dataset for the Closer)\n---\n\n# 🤝 THE DEAL\n\n**Out-negotiate a dealer who's hiding their real price.** They have a secret line they'll\nnever cross — a floor if they're selling, a ceiling if they're buying. Your job: push them as\nclose to that hidden number as you dare, without blowing up the deal. The closer you get, the\nhigher your score. Fast closes and win streaks multiply it. Climb the leaderboard.\n\n🎮 **[Play it above]** · 🎬 **Demo video:** [YouTube](https://youtu.be/or4LoLMfPYU) · 💼 **Social post:** [LinkedIn](https://www.linkedin.com/posts/siddharth-deshpande63_buildsmall-smallmodels-ai-share-7470436502171316224-Bte7/)\n\n\n## Team Usernames\n\nThe following Hugging Face usernames are associated with this project. Each teammate has registered on Hugging Face and joined the organization separately.\n - Siddharth63 (https://huggingface.co/Siddharth63)\n\n---\n\n## Pick your opponent\n\n- **🦊 The Fox** — quick and slippery, reads you fast. A forgiving negotiator; good for learning the ropes.\n- **🎩 The Closer** — a hard-nosed pro who makes you earn every dollar. Holds firm, punishes lowballs, walks on bad offers.\n\nTwo opponents, two difficulties — both running on tiny **4B** models.\n\n## The idea\n\nNegotiation is the rare game where an LLM is *load-bearing* but a scripted engine keeps it\nfair. THE DEAL splits the two cleanly: **the model proposes, the engine enforces.** Each turn\nthe model returns a structured move — `concede / hold / raise / accept / walk` plus a price and\nan in-character line — and a deterministic engine clamps that move so the dealer can *never*\ncross its secret limit, owns patience and mood, and decides win/loss. The model supplies the\n*voice and the strategy*; the engine supplies the *rules*. You can't jailbreak it into selling\nbelow its floor, because the floor lives in code, not in the prompt.\n\n## The tech\n\n- **Two fine-tuned 4B models** (Qwen3), distilled from a fully synthetic dataset — no human\n  gameplay was needed to train them.\n- **Synthetic data via self-play:** Gemini played the dealer against Gemini playing dozens of\n  buyer/seller archetypes across 32 non-violent scenarios (cars, art, salary, rent, procurement,\n  freelance work, even a fictional treaty), both sides blind to the other's secret number, with\n  the engine refereeing every turn. Each turn became one training example: *(game state + the\n  other party's move) → the dealer's structured decision.* ~35k turns, role-balanced and\n  direction-balanced.\n- **Two-tier difficulty from data, not size:** The Fox is trained on the base self-play mix. The\n  Closer is trained on a **pushback-boosted** blend — an extra 8k turns of *correct* resistance\n  (holding against stallers, walking on insulting offers) rebalanced to ~45% resist — so it\n  negotiates like a shrewd pro instead of folding. Same architecture; the *behavior* difference\n  is entirely in the training distribution.\n- **Training infra:** the two 4B models were fine-tuned with Unsloth (LoRA) on **Modal**; the\n  synthetic self-play dataset was generated via the Gemini API.\n- **The app:** a custom noir-themed Gradio UI with **token-streaming** replies, a hidden-limit\n  reveal at deal-close, an arcade **scoring system** (surplus captured × speed × win-streak),\n  and a **persistent leaderboard** backed by a Hugging Face Dataset. Inference runs on ZeroGPU.\n\n## Tracks & badges\n\n- 🌲 **Thousand Token Wood** (whimsical track) — an interactive AI negotiation game.\n- 🎯 **Well-Tuned** — two fine-tuned models published on Hugging Face (`deal_sft_lora_4B`, `deal_sft_4B_hard`).\n- 🎨 **Off-Brand** — a custom noir UI with token-streaming, an arcade scoring system, and a live\n  leaderboard, well past the stock Gradio look.\n- 📓 **Field Notes** — a write-up of the build (self-play → distill → pushback-boosted two-tier). _[(https://www.linkedin.com/posts/siddharth-deshpande63_buildsmall-smallmodels-ai-share-7470436502171316224-Bte7/)]_\n- 🐜 **Tiny Titan** (special award) — every model is **4B** (≤ 4B), biggest impact from the smallest weights.\n\n## Setup (for forks)\n\n- Models: `build-small-hackathon/deal_sft_lora_4B` (Fox) and `build-small-hackathon/deal_sft_4B_hard` (Closer).\n- Leaderboard: set Space secrets **`HF_TOKEN`** (write) and **`LB_DATASET`** (a dataset repo you own);\n  the app writes `leaderboard.json` to it.\n- Override model org/paths with the `MODE"
  },
  {
    "repo_id": "build-small-hackathon/Tsundere-Dev-Ari",
    "title": "Tsundere Dev Ari",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Tsundere-Dev-Ari",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "tsundere",
      "dev",
      "ari",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Tsundere Dev Ari\nemoji: 🚀\ncolorFrom: blue\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.13'\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Meet Ari, your personal AI companion!\n---\n\n# 🚀 Tsundere Dev Ari\n\n> An emotionally expressive AI coding companion powered by Gemma 4.\n\nBuilt for the **Build Small Hackathon — “An Adventure in Thousand Token Wood.”**\n\n## 🌟 About\n\nTsundere Dev Ari is a personality-driven AI companion that combines:\n\n* 💻 Coding + debugging assistance\n* 🎭 Dynamic emotion switching\n* 💬 Tsundere-style conversational responses\n* 🎨 Fully custom UI built using HTML + CSS\n\nInstead of creating another standard chatbot, the goal was to build an AI experience that feels expressive, interactive, and genuinely fun to use.\n\nAri reacts with different emotions like:\n\n* 😳 Embarrassed\n* 😴 Sleepy\n* 😤 Annoyed\n* 😎 Confident\n* 🤔 Thinking\n\nThe AI becomes part of the experience itself — not just a tool behind the scenes.\n\n---\n\n# 🧠 Model Used\n\n* `google/gemma-4-31B-it`\n\nCurrent inference:\n\n* Hugging Face Inference API\n\nPlanned:\n\n* ⚡ ZeroGPU migration\n\n---\n\n# ✨ Features\n\n* Dynamic emotion rendering\n* Emotion-aware responses\n* Tsundere personality system\n* Coding + debugging assistant\n* Real-time conversational interaction\n* Custom animated UI\n* Built with Gradio 6.17.3\n\n---\n\n# 🛠️ Tech Stack\n\n* Python 3.13\n* Gradio\n* Hugging Face Spaces\n* Hugging Face Inference API\n* HTML + CSS\n* Gemma 4 31B\n\n---\n\n# 🚀 Demo\n\n## Hugging Face Space\n\nhttps://huggingface.co/spaces/build-small-hackathon/Tsundere-Dev-Ari\n\n## YouTube Demo\n\nhttps://www.youtube.com/watch?v=ezLHQnq25KU\n\n---\n\n# 📱 Social Links\n\n## LinkedIn Post\n\nhttps://www.linkedin.com/posts/aditya-manoj-shinde_buildsmallhackathon-huggingface-ai-ugcPost-7471066740223864832-7arQ/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAEXBT6UBNq5nO34hjjzjGmGFcVBrbCSgRWU\n\n---\n\n# ❤️ Support\n\nIf you enjoyed the project, please give the Hugging Face Space a ❤️\n"
  },
  {
    "repo_id": "build-small-hackathon/iris",
    "title": "Iris",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/iris",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "iris",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 9,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Iris\nemoji: 👁️\ncolorFrom: indigo\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npinned: true\nlicense: apache-2.0\nshort_description: Your father's eyes, by voice. Reads bills & money aloud.\ntags:\n  - backyard-ai\n  - tiny-titan\n  - off-brand\n  - off-the-grid\n  - best-demo\n  - best-agent\n  - sharing-is-caring\n  - community-choice\n  - field-notes\n  - track:backyard\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# 👁️ Iris: your father's eyes, by voice\n\n> Built for the **Build Small Hackathon** · **Backyard AI** track · for my father, who is blind.\n\n**Try it live:** https://huggingface.co/spaces/build-small-hackathon/iris (open on a phone)\n**How it was built (agent trace):** https://huggingface.co/datasets/build-small-hackathon/iris-agent-trace\n**Demo video:** https://youtu.be/h4AJOWuDCVc · **Social post:** https://www.linkedin.com/feed/update/urn:li:share:7472093049335246848/\n**Field notes (write-up):** https://huggingface.co/blog/nextmarte/thinkbigbuildsmall\n\nIris is a voice-first assistant for blind and low-vision people. Open it on a phone,\npoint the camera, and it tells you what's around you, out loud, in your language.\n**The whole screen is the button**, so there's nothing small to aim for. In live\nmode it just listens and answers.\n\n## What it does\n- 👁️ **Describe**: tap anywhere. *\"A table ahead with a mug on the right.\"*\n- 🎤 **Ask, hands-free**: in live mode, just speak. *\"What color is this shirt?\"*, *\"read this label\"*, *\"is anyone here?\"*\n- 💵 **Read money and bills**: *\"how much do I have?\"* counts the banknotes. Point at an electricity bill and it reads the **amount and due date**.\n- 💊 **Read medicine**: reads the dose and instructions on a box, exactly as written.\n- 📡 **Live mode**: double-tap, or say *\"live mode\"*. Iris describes the scene once, then speaks up only when something new comes into view.\n\n## How to use it\n- **Tap** anywhere → describe what's in front of you.\n- **Hold** → ask a question (release to send).\n- **Double-tap** → toggle live mode (hands-free listening + new-thing alerts). Say *\"stop\"* to turn it off.\n- First run: **choose your language by voice** (\"say your language\"). Language & accessibility toggles sit in the top corners.\n\n## Built for a blind user first\nAccessibility shaped the whole interface, because the person it was made for asked for it:\n- **The whole screen is one button.** Tap to describe, hold to ask, double-tap for live mode. Nothing small to find, no menus.\n- **It talks first.** A spoken welcome on the first tap, and you **choose your language by voice**.\n- **Hands-free.** In live mode it listens continuously, so there are no buttons to press.\n- **For low vision too:** large buttons with clear labels and real SVG icons, plus a **high-contrast and larger-text** mode.\n- **Standards:** keyboard focus rings, ARIA live regions, haptic feedback, and it honours the system's reduced-motion and contrast settings.\n\n## How it works: small models only, ≤ 32B total\n| Stage | Model | Params |\n|---|---|---|\n| Speech-to-text | Whisper small (faster-whisper) | ~0.24B |\n| Vision-language | **Qwen3-VL-2B-Instruct** | ~2B |\n| Text-to-speech | Piper (pt_BR / en_US) | <1B |\n\n**About 2.5B total**, **every model is ≤ 4B** (Tiny Titan). The\nvoice-first frontend is custom, built on **`gr.Server`** (Off-Brand). Inference runs\nin the Space on **ZeroGPU**, with no third-party model APIs.\n\n## Architecture: a small perception-action agent\nIris is more than one model call. It orchestrates four tools and runs a control loop:\n- **Role prompts** define what each model does: read money and bills, describe a scene for a blind person, report only what is new.\n- **Intent routing** turns a spoken phrase into an action: describe, answer a question, or toggle live mode (forgiving of transcription errors).\n- **Tools it drives:** Whisper to hear, Qwen3-VL to see and read, Piper to speak, and an on-device detector (COCO-SSD) to watch for change.\n- **A live loop** that perceives (camera + detector), decides whether something new is worth saying, acts (calls the vision model and speaks), and remembers what it already said so it doesn't repeat.\n\n## Safety\nIris describes surroundings and reads text. **It is not a mobility aid.** Don't use it\nto get around or avoid obstacles. It can't judge distance reliably and isn't safe to walk by.\n\n## Run locally\n```bash\npip install -r requirements.txt\nIRIS_WARMUP=1 python app.py     # http://localhost:7860  (warmup preloads the models)\n```\n\n## Credits\nBuilt by **Marcus Ramalho** for his father Marcos, with **Claude Code (Claude Opus 4.8)**.\nThe build is documented as an open [agent trace](https://huggingface.co/datasets/build-small-hackathon/iris-agent-trace).\nSTT: OpenAI Whisper (via faster-whisper) · Vision: **Qwen3-VL** · TTS: **Piper** · UI: Gradio (`gr.Server`).\n"
  },
  {
    "repo_id": "build-small-hackathon/dod-uno",
    "title": "Dod Uno",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/dod-uno",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "nvidia",
      "openbmb",
      "modal",
      "dod"
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Try the app and decide whether it feels beginner-friendly.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: DOD - Deploy or Draw\nemoji: 🃏\ncolorFrom: purple\ncolorTo: pink\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.10'\napp_file: app.py\npinned: false\nhf_oauth: true\nlicense: mit\nthumbnail: https://huggingface.co/datasets/DEVAIEXP/assets/resolve/main/dod_thumb.PNG\ntags:\n- build-small-hackathon\n- thousand-token-wood\n- gradio\n- game\n- multiplayer\n- custom-ui\n- off-brand\n- off-the-grid\n- tiny-titan\n- llama-champion\n- nemotron\n- nvidia\n- openbmb\n- voxcpm2\n- modal\n- llama-cpp\n- codex\n- field-notes\n- track:wood\n- sponsor:openbmb\n- sponsor:openai\n- sponsor:nvidia\n- sponsor:modal\n- achievement:offgrid\n- achievement:offbrand\n- achievement:sharing\n- achievement:llama\n- achievement:fieldnotes\nshort_description: A Multiplayer UNO Game\n---\n\n<p align=\"center\">\n  <img src=\"assets/logo.jpeg\" alt=\"DOD - Deploy or Draw logo\" width=\"180\">\n</p>\n\n<h1 align=\"center\">DOD - Deploy or Draw</h1>\n\n<p align=\"center\">\n  <strong>A multiplayer UNO-inspired game where production incidents become chaos, comedy, and AI-powered table drama.</strong>\n</p>\n\n## Play The Incident\n\nDOD - Deploy or Draw turns a software outage into a strange little card-table emergency.\nPlayers race to resolve a live production crisis before the Director's panic meter explodes.\n\nThe AI is not decoration here. It is load-bearing:\n\n- **Nemotron** is a mandatory rival that reads the current board, evaluates playable cards, chooses actions, and selects wild-card colors.\n- The **IT Director** reacts to every played card with short crisis-aware bilingual commentary.\n- **VoxCPM2** turns those Director reactions into voice, making the match feel like an absurd deploy-night war room.\n\nIt is part UNO, part incident response simulator, part tiny corporate theater.\n\n## Demo Links\n\n- **Live Space:** https://huggingface.co/spaces/build-small-hackathon/dod-uno\n- **Demo video:** https://youtu.be/QahSgSKtvNQ\n- **Social post X:** https://x.com/pelolisu/status/2065197829031604618?s=20\n- **Social post Linkedin 1:** https://www.linkedin.com/posts/elismasilva_generativeai-devops-gamedev-share-7470967633346027520-OE7v/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAARdvBoBOi8NC3UXuKIGEUImbEK50Lgwp6E\n- **Field Notes:** https://huggingface.co/blog/elismasilva/dod-uno-en\n- **GitHub repository:** https://github.com/DEVAIEXP/doduno\n\nThe GitHub repository above is the source-of-truth development repository.\nIt contains the full implementation history, installation documentation, local-mode setup, and Codex-attributed commits.\nThis Space repository is a deployment copy prepared for judges to play the finished app.\n\n## Hackathon Track\n\nSubmitted for **An Adventure in Thousand Token Wood**.\n\nThe goal was to build something delightful that would not exist without AI: a toy-like multiplayer game where small models create the fun, tension, and personality of the table.\n\n## Targeted Special Awards And Bonus Quests\n\n| Award or bonus quest | Where judges can validate it |\n|---|---|\n| **NVIDIA Nemotron Quest** | Nemotron Nano 4B is the mandatory AI opponent. Gameplay LLM inference runs through the companion Space [`build-small-hackathon/dod-llm-server`](https://huggingface.co/spaces/build-small-hackathon/dod-llm-server), and the game board shows Nemotron taking turns, drawing, playing valid cards, and selecting wild-card stacks. |\n| **OpenBMB Awards** | VoxCPM2 powers the IT Director voice layer. The public demo can call the Modal-hosted VoxCPM2/NanoVLLM endpoint at `https://elismasilva--voxcpm2-nanovllm-service-generate-api.modal.run`, and the GitHub repo includes the local NanoVLLM/VoxCPM2 path. |\n| **Modal Awards** | The hosted TTS route uses Modal for low-friction GPU audio generation through `https://elismasilva--voxcpm2-nanovllm-service-generate-api.modal.run`. See the demo video for Director voice playback during a match. |\n| **OpenAI Track** | Development was assisted by OpenAI Codex. The source-of-truth GitHub repository includes Codex-attributed commits and the implementation history: https://github.com/DEVAIEXP/doduno. |\n| **Tiny Titan** | The project uses small models only: NVIDIA Nemotron Nano 4B for gameplay/Director text and VoxCPM2 for voice. Both stay well below the 32B hackathon cap, and Nemotron is in the <=4B range highlighted by Tiny Titan. |\n| **Off-Brand / Off-Brand Award** | The app uses custom Gradio `gr.HTML` components, board-scoped CSS, browser-side game logic, animated cards, custom toasts, audio controls, and a full game-like interface instead of the default Gradio form look. |\n| **Off the Grid** | The GitHub version supports fully local execution with local LLM, local TTS, local leaderboard, and local inference mapper files. The public Space uses hosted endpoints so judges can play easily, but the local mode is documented in the repo. |\n| **Llama Champion** | The local LLM server runs GGUF inference through `llama-cpp-python` in the `dod-llm-server` submodule. |\n| **Sharing is Caring** | DOD records append-only agent traces for Nemotron turns and Director rea"
  },
  {
    "repo_id": "build-small-hackathon/Wall-Street-of-AI-Agents",
    "title": "Wall Street Of Ai Agents",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Wall-Street-of-AI-Agents",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "openbmb",
      "wall",
      "street",
      "agents"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 4,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Wall Street of AI Agents\nemoji: 🐺\ncolorFrom: red\ncolorTo: gray\nsdk: docker\npython_version: \"3.11\"\napp_file: app.py\npinned: true\nlicense: apache-2.0\nshort_description: \"A multi-agent trading firm powered by small language models\"\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - off-the-grid\n  - llama-champion\n  - off-brand\n  - tiny-titan\n  - best-agent\n  - best-demo\n  - bonus-quest-champion\n  - field-notes\n  - qwen3.5\n  - nvidia-nemotron-3-nano\n  - minicpm\n  - openbmb\n  - finance\n  - multi-agent\n  - agentic-ai\n  - simulation\n  - benchmark\n  - phaserjs\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n\nmodels:\n  - openbmb/MiniCPM5-1B-GGUF\n  - unsloth/Qwen3.5-2B-GGUF\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF\ndatasets:\n  - build-small-hackathon/Wall-Street-Agent-Traces\n---\n\n<div align=\"center\">\n\n<div align=\"center\">\n\n<img src=\"https://cdn-uploads.huggingface.co/production/uploads/679f1bd801f97ba49a8e81ec/9C_0e8PbY9TAjtd7xrOwE.png\" width=\"900\">\n\n</div>\n\n<h1 align=\"center\" style=\"font-size: 64px;\">\n🐺 WALL STREET OF AI AGENTS\n</h1>\n\n<h3><i>An autonomous trading firm run entirely by AI Agents.</i></h3>\n\n\n\n[![Hackathon](https://img.shields.io/badge/Build_Small_Hackathon_2026-🍄_Thousand_Token_Wood-8B5CF6?style=for-the-badge)](https://huggingface.co/build-small-hackathon)\n[![Model](https://img.shields.io/badge/Model-Small%20Language%20Models-F97316?style=for-the-badge)](https://huggingface.co/openbmb/MiniCPM5-1B-GGUF)\n[![Runtime](https://img.shields.io/badge/Runtime-llama.cpp-22C55E?style=for-the-badge)](#)\n[![Agents](https://img.shields.io/badge/Agents-4_Autonomous_Traders-3B82F6?style=for-the-badge)](#)\n\n**▶️ [Enter the Trading Floor (Live Space)](https://huggingface.co/spaces/build-small-hackathon/Wall-Street-of-AI-Agents) · 🎬 [Watch the Demo Video](https://youtu.be/1XZuUsiwuTA) · 📓 [Read the Field Note](https://huggingface.co/blog/build-small-hackathon/wall-street-of-ai-agents) . 📢 [Read the Social Post](https://x.com/ashdebugs/status/2065443044833562840) . 🔥 [Github Repository](https://github.com/Ashish-Patnaik/Wall-Street-of-AI-Agents)**\n\n</div>\n\n---\n\n![preview](https://cdn-uploads.huggingface.co/production/uploads/679f1bd801f97ba49a8e81ec/pWmQqMc9eFooLVXzn-fpf.webp)\n\n> *Sarah leads the leaderboard at $10,700. Alice is arguing with Mike in the hallway. Alex is alone in the Office. The market is Stagnant. Panic ensues.*\n\n---\n\n## 📈 The Vision: A Game & A Benchmark\n\nMost AI agent frameworks are built for B2B tasks—writing code, scraping data, summarizing emails. We wanted to build something weird, delightful, and highly voyeuristic. \n\n**Wall Street of AI Agents** is a high-frequency trading reality show. Four AI agents, each with a distinct personality and financial strategy, wander a pixel-art office, overhear each other, and make real trades—continuously, forever, without a single human input.\n\nBut beneath the retro 2D graphics lies something much more ambitious: **a visual benchmark for testing multi-agent collaboration and LLM reasoning.** \nBy trapping a tiny 1-Billion or 2-Billion parameter model(or any small model) in a dynamic financial simulation, we can stress-test its capabilities live:\n* **Strict Instruction Following:** Can the model adhere to a rigid JSON schema while adopting a complex persona?\n* **Regime Adaptation:** When the market shifts from a *Tech Boom* to a *Market Crash*, does the model adjust its trading strategy, or does it stubbornly hold?\n* **Spatially-Aware Social Dynamics:** Can agents hold a logical conversation based *only* on who is physically standing in the same room as them?\n\n---\n\n## 👔 The Four Agents\n\nFour traders. Four worldviews. One shared office where they cannot stop running into each other. Every agent starts with **$10,000**.\n\n\n---\n<table>\n<tr>\n\n<td width=\"25%\" align=\"center\">\n<img src=\"dist/assets/faces/Alex.png\" width=\"140\" alt=\"Alex\"><br>\n<h3>💎 Alex</h3>\n<b>The Crypto Degen</b><br>\n<i>Reckless. Infectious. Convinced every regime is secretly a bull run. Will talk anyone into a bad trade with sheer confidence. Thrives in Crypto Frenzies. Obliterated in crashes.</i>\n</td>\n\n<td width=\"25%\" align=\"center\">\n<img src=\"dist/assets/faces/Sarah.png\" width=\"140\" alt=\"Sarah\"><br>\n<h3>🧊 Sarah</h3>\n<b>Bear Market Shorter</b><br>\n<i>Cynical. Precise. Has been waiting for the market to collapse since tick one. Quietly holds bonds while everyone else panics. Weirdly calm when it all falls apart.</i>\n</td>\n\n<td width=\"25%\" align=\"center\">\n<img src=\"dist/assets/faces/Alice.png\" width=\"140\" alt=\"Alice\"><br>\n<h3>📊 Alice</h3>\n<b>The Algo Trader</b><br>\n<i>Cold. Methodical. Speaks in probabilities. Ignores breaking news unless it clears her signal threshold. Consistently the most rational agent in any room she walks into.</i>\n</td>\n\n<td width=\"25%\" align=\"center\">\n<img src=\"dist/assets/faces/Mike.png\" width=\"140\" alt=\"Mike\"><br>"
  },
  {
    "repo_id": "build-small-hackathon/read_the_room",
    "title": "Read_The_Room",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/read_the_room",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "read_the_room",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Read The Room\nemoji: 👀\ncolorFrom: yellow\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.16.0\npython_version: \"3.13\"\napp_file: app.py\npinned: false\ntags:\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Track 2: Read the Room\n\nPlay [here](https://huggingface.co/spaces/build-small-hackathon/read_the_room)\n\nYou're a player in a situation where each character has an agenda and an opinion of everyone else, including you. You talk your way to a goal. The inspiration is text adventures RPGs like Zork crossed with social deduction games like Avalon and Werewolf. The social dynamic is the game - I've always wished videogame dialogues gave me more options to explore. Traditional games have always been constrained in this and current AI games dodge the social element: they usually focus on one character, or fixed characters on rails. So I built a game with multiple AIs reacting to each other and especially you (the player)\n\nAnyone can publish a scenario (optional but recommended scene picture) and play what others shared\n\n\n# Merit Badges\n\n## Off the grid\n\nThe app makes zero network requests of its own - art and font are bundled and inlined. Point `RTR_API_BASE` at a local llama.cpp/vLLM and it runs fully offline (the hosted Space points it at a Modal GPU instead)\n\n## Off-Brand\n\nThe default Gradio doesn't work well for a game, so I gave it some style with a custom front-end\n\n## Llama champion\n\nLlama.cpp is the canon way to play the game - start a server locally and wire it up. I playtested with Qwen3.6-27B (Q8_K_XL gguf from unsloth)\n\n## Sharing is Caring\n\nPlaythrough traces - raw prompt and completions are available. You can use 🐞 export function to see what happened each turn - what fed into the LLM and what the response was. Here is one: [Ilia-Iliev/read-the-room-traces](https://huggingface.co/datasets/Ilia-Iliev/read-the-room-traces)\n\n## Field Notes\n\nMore details in my [blogpost](https://ilia.foo/blog/read_the_room)\n\n[Video](https://www.youtube.com/watch?v=eY81SvsOJy4)\n\n[Social Media post](https://www.linkedin.com/posts/ilia-iliev-7903a015a_read-the-room-a-hugging-face-space-by-build-small-hackathon-activity-7471300743883522048-rKGz?utm_source=share&utm_medium=member_desktop&rcm=ACoAACYk8lwBA6YMOvmo6sowMJ679w1Iy48UvP0)\n"
  },
  {
    "repo_id": "build-small-hackathon/storyboard-tui",
    "title": "Storyboard Tui",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/storyboard-tui",
    "zone": "Creative Studio",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "achievement",
      "storyboard",
      "tui",
      "summary",
      "available"
    ],
    "difficulty": "Casual",
    "quest": "🎨 Creative Studio: Try a playful prompt and review the output style.",
    "likes": 3,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎨",
    "readme_text": "---\ntitle: Professor Wren's Story Rooms\nemoji: 📚\ncolorFrom: green\ncolorTo: yellow\nsdk: docker\npinned: false\nlicense: mit\nshort_description: Build a four-beat storyboard with an archival AI professor\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:sharing\n  - achievement:fieldnotes\n  - achievement:llama\n---\n\n# Professor Wren's Story Rooms\n\nProfessor Alder Wren is a fictional English philologist and storytelling\nlecturer reconstructed from old university tapes. Bring him a rough idea and\nhe will shape it into a four-beat YouTube storyboard, then remain in the\nworkspace to help sharpen hooks, scenes, turns, and sentences.\n\nThe experience is presented as a terminal recovered from Wren's archive:\n\n- MiniCPM builds the initial four-scene storyboard.\n- NVIDIA Nemotron becomes Professor Wren in the workspace conversation.\n- FLUX creates the pseudo-ASCII scene artwork.\n- Storyboards and the conversation with Wren can be exported as a PDF.\n\n## Hackathon submission\n\n**Category:** An Adventure in Thousand Token Wood\n\nProfessor Wren turns a rough premise into a compact four-beat adventure, then\nlets the story-maker explore and refine each scene through an in-character\nterminal conversation.\n\n**Bonus quest: Off the Grid**\n\nAll language intelligence in the app is provided by small language models:\n\n- `openbmb/MiniCPM5-1B` is the 1B storyboard planner.\n- `nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF` is the 4B conversational model,\n  served as a Q4_K_M quantization.\n\nThe complete text pipeline stays in the **1B-4B parameter range**. No large\nlanguage model is used at runtime, qualifying the project for both **Off the\nGrid** and the **Tiny Titan** badge.\n\n## Demo and write-up\n\n- **YouTube demo:** [Watch the demo](https://youtu.be/1iKWagnEyHs)\n- **Blog write-up:** [Professor Wren's Story Rooms](https://huggingface.co/blog/build-small-hackathon/wren-storyrooms)\n- **X post:** [on my profile](https://x.com/i/status/2066248756081709557)\n\n## Badge qualifications\n\n| Badge / track | Evidence |\n| --- | --- |\n| Tiny Titan | Every runtime language model is between 1B and 4B parameters. |\n| Off the Grid | The app's planning and chat features use only SLMs. |\n| OpenAI | The application was built end to end with Codex. The reviewable agent trace is linked below. |\n| OpenBMB | MiniCPM5-1B provides structured four-scene storyboard planning. |\n| NVIDIA | NVIDIA Nemotron-3-Nano-4B-GGUF Q4_K_M powers Professor Wren's workspace chat. |\n| llama.cpp | `llama-server` runs the quantized Nemotron model with CUDA acceleration and OpenAI-compatible chat inference. |\n| Modal | Both SLMs run on serverless L4 GPUs with CPU and GPU memory snapshots, warmup passes, and app-triggered prewarming to reduce cold starts. |\n\n### Codex agent trace\n\nThe OpenAI team can review the public build trace here:\n\n**[drdavidtang/build-small-agent-trace](https://huggingface.co/datasets/drdavidtang/build-small-agent-trace)**\n\n## How it works\n\nThe Hugging Face Space serves the React interface and FastAPI gateway.\n`openbmb/MiniCPM5-1B` runs on a Modal L4 and assembles the structured\nstoryboard through small, validated planning calls.\n\n`nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF` handles Professor Wren's conversational\nresponses in the main storyboard workspace. Its Q4_K_M quantization runs\nthrough CUDA-accelerated llama.cpp with CPU and GPU memory snapshots. The app\nstarts warming this container on first load so Wren is more likely to be ready\nwhen the user reaches the workspace.\n\nThe repository also includes 20 direct-FLUX pseudo-ASCII WebP assets. They are\nused during onboarding and while the planner wakes, making cold-start time part\nof the archival terminal experience. Text pseudo-ASCII files remain available\nas the final fallback when live image generation fails.\n\n## Prewarm artwork\n\nWith `BFL_API_KEY` configured, regenerate all 20 direct-FLUX pseudo-ASCII WebP\nassets with:\n\n```bash\nPYTHONPATH=. python scripts/generate_bfl_fallbacks.py --force\n```\n\n## Deployment\n\nDeployment is split between the Hugging Face Docker Space and two Modal GPU\nservices:\n\n```bash\nmodal setup\nexport STORY_FORGE_API_KEY=\"$(openssl rand -hex 32)\"\n./scripts/modal_setup.sh\n./scripts/modal_deploy.sh\n```\n\nAdd these Space secrets:\n\n```text\nMODAL_INFERENCE_URL=<MiniCPM /chat URL>\nNEMOTRON_INFERENCE_URL=<Nemotron /chat URL>\nMODAL_API_KEY=<STORY_FORGE_API_KEY value>\nBFL_API_KEY=<Black Forest Labs API key>\n```\n\nOptionally set `BFL_IMAGE_MODEL`; it defaults to `flux-2-klein-4b`. Restart the\nSpace after changing its configuration.\n"
  },
  {
    "repo_id": "build-small-hackathon/watch-my-escape",
    "title": "Watch My Escape",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/watch-my-escape",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "watch",
      "escape",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: WATCH MY ESCAPE\nemoji: 😠\ncolorFrom: gray\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.16.0\npython_version: \"3.12\"\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: An LLM tries to escape your puzzle room.\nmodels:\n  - JetBrains/Mellum2-12B-A2.5B-Thinking-GGUF-Q4_K_M\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF\n  - openbmb/MiniCPM5-1B-GGUF\n  - CohereLabs/tiny-aya-global-GGUF\n  - ggml-org/gemma-4-12B-it-GGUF\npreload_from_hub:\n  - JetBrains/Mellum2-12B-A2.5B-Thinking-GGUF-Q4_K_M Mellum2-12B-A2.5B-Thinking-Q4_K_M.gguf\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF NVIDIA-Nemotron3-Nano-4B-Q4_K_M.gguf\n  - openbmb/MiniCPM5-1B-GGUF MiniCPM5-1B-Q4_K_M.gguf\n  - CohereLabs/tiny-aya-global-GGUF tiny-aya-global-q4_k_m.gguf\n  - ggml-org/gemma-4-12B-it-GGUF gemma-4-12B-it-Q4_K_M.gguf\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# WATCH MY ESCAPE\n\nWatch LLMs try to solve your handmade escape rooms.\n\nFeatures:\n- LLMs that love your puzzles and trying to escape them.\n- A fully loaded map editor to create your magnum opus.\n\nActive participants:\n- JetBrains Mellum2 12B\n- Nvidia Nemotron 3 Nano 4B\n- OpenBMB MiniCPM5 1B\n- Cohere Tiny Aya\n- Google Gemma 4 12B\n\nAll are using Q4_K_M variants and should fit in about 8GB of VRAM.\n\n## Helpful links\n\n- Video: https://www.youtube.com/watch?v=9NdsC1W97oQ\n- Blog: https://che.codes/watch-my-escape/\n- Social: https://www.reddit.com/r/LocalLLaMA/comments/1u6im9i/watch_my_escape_llms_try_to_solve_your_handmade/\n- GitHub: https://github.com/cjami/watch-my-escape (If observing a deployed version: Full commit history here)\n\n## Tech stack\n\n- Python 3.12+ with Gradio, Jinja2, and Pydantic.\n- Local LLM inference via llama-cpp-python, and Hugging Face ZeroGPU.\n- Browser assets built with Tailwind CSS 4 and esbuild.\n- Development tooling with uv, Ruff, ty, and pytest.\n- Optional tracing with Langfuse.\n\n## Prerequisites\n\n- `uv` for Python environment management: https://docs.astral.sh/uv/getting-started/installation/\n- Node.js and npm for building browser assets: https://nodejs.org/\n\n## Quick Start\n\nInstall the prerequisites, then run the following:\n\n```shell\ngit clone https://github.com/cjami/watch-my-escape.git\ncd watch-my-escape\nuv run watch-my-escape\n```\n\nThe command sets up missing dependencies, builds the browser assets, installs the best local `llama-cpp-python` backend it can detect, starts the local server, and opens the game in your browser.\n\nOn first run, the selected GGUF model downloads from Hugging Face. That can take a while and may use several gigabytes of disk space.\n\n## Setup Options\n\nRun setup without starting the server:\n\n```shell\nuv run watch-my-escape --setup-only\n```\n\nForce setup to run again:\n\n```shell\nuv run watch-my-escape --force-setup\n```\n\nStart the server without opening a browser:\n\n```shell\nuv run watch-my-escape --no-browser\n```\n\nOverride the detected local LLM backend:\n\n```shell\nuv run watch-my-escape --llm-profile metal\nuv run watch-my-escape --llm-profile cuda\nuv run watch-my-escape --llm-profile vulkan\nuv run watch-my-escape --llm-profile rocm\nuv run watch-my-escape --llm-profile cpu\n```\n\nAuto-detection prefers Apple Metal, then NVIDIA CUDA, then Vulkan, then CPU. ROCm is available as an explicit override because ROCm support depends more heavily on the installed OS, GPU, and driver stack.\n\n## Langfuse Tracing\n\nLangfuse tracing is optional. Add these variables to your shell or a local `.env` file:\n\n```shell\nLANGFUSE_TRACING_ENABLED=true\nLANGFUSE_SECRET_KEY=...\nLANGFUSE_PUBLIC_KEY=...\nLANGFUSE_BASE_URL=...\n```\n\nSet `LANGFUSE_TRACING_ENABLED=false` to disable tracing locally.\n\n## Development\n\nCommon contributor commands:\n\n```shell\nmake app\nmake test\nmake lint\nmake format\nmake assets\n```\n\nWithout Make:\n\n```shell\nuv run watch-my-escape\nuv run pytest\nuv run ruff check .\nuv run ty check\n```\n\nCheck the current local LLM setup:\n\n```shell\nuv run watch-my-escape-doctor\n```\n\nEvaluate model reliability for structured action JSON:\n\n```shell\nuv run watch-my-escape-eval-models --preset minicpm5-1b\n```\n"
  },
  {
    "repo_id": "build-small-hackathon/sokrates",
    "title": "Sokrates",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/sokrates",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sokrates",
      "summary",
      "available",
      "yet.",
      "gradio",
      "region"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Sokrates\ncolorFrom: blue\ncolorTo: indigo\nsdk: gradio\nsdk_version: 4.44.0\npython_version: \"3.11\"\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: AI-assisted clinical intake assistant\n---\n\n# Sokrates\n\n🔗 **Social Media Post + Video:** https://x.com/guidoputignano/status/2065494688501506353\n\nAI-assisted **clinical intake** for a medical visit. Sokrates listens to a\ndoctor–patient conversation, transcribes it, automatically fills in a structured\nclinical intake form, and suggests the questions the doctor still needs to ask.\n\nTwo sources of suggested questions:\n1. **Gaps** — mandatory form fields that are still empty (deterministic engine).\n2. **Clinical follow-ups** — coherent next questions generated by the LLM.\n\n> **Sokrates does not make diagnoses.** It prepares and structures data collection.\n\n## Architecture\n\n```\naudio -> ASR (+ optional diarization) -> orchestration loop ->\n  [structured extraction + gap engine + question generation] -> three-panel UI\n```\n\n- **UI**: Gradio Blocks, three columns — live transcript, the intake form filling\n  in (green = filled, grey = missing), and a suggested-questions panel.\n- **ASR**: `faster-whisper` (large-v3), `language=\"en\"`. Uploaded audio file\n  (priority) and live microphone. Optional diarization via `pyannote` behind a flag.\n- **LLM**: any OpenAI-compatible endpoint (e.g. a vLLM server on Modal running\n  **Qwen3-14B**, ≤ 32B params). Configured purely through environment variables.\n- **Schema**: oncology intake form as a Pydantic model (`sokrates/schema.py`).\n\n## How to use\n\n1. Provide a transcript one of three ways:\n   - **Sample dialogue** — pick one and click *Load sample* (no audio needed).\n   - **Upload / record a file** — click *Transcribe audio* to run ASR.\n   - **Live microphone** — switch to the *Live microphone* tab and speak;\n     Sokrates transcribes and updates the form automatically on pauses.\n2. Click **Analyze transcript → update** (for the upload/sample paths).\n3. Watch the **clinical form** fill in (green = filled, grey = optional missing,\n   red-tinted = mandatory still missing) and the **suggested questions** appear\n   (missing mandatory fields first, then coherent follow-ups). *Reset* clears the\n   session.\n\n## Modules\n\n| File                        | Role                                                        |\n| --------------------------- | ----------------------------------------------------------- |\n| `sokrates/schema.py`        | Oncology intake form (Pydantic) + field metadata + merge.   |\n| `sokrates/asr.py`           | faster-whisper transcription (file + live mic), diarization.|\n| `sokrates/llm.py`           | OpenAI-compatible client: extraction + question generation. |\n| `sokrates/gaps.py`          | Deterministic gap engine (no LLM).                          |\n| `sokrates/orchestrator.py`  | The loop: extract → gaps → questions.                       |\n| `app.py`                    | Gradio three-panel UI.                                       |\n\n## Local run\n\n```bash\npython -m venv .venv && source .venv/bin/activate\npip install -r requirements.txt\npython app.py\n```\n\nOpen the local URL Gradio prints (default http://127.0.0.1:7860).\n\n## Configuration (environment variables)\n\nNo credentials live in the code. The LLM client is configured at runtime:\n\n| Variable            | Purpose                                         | Example                                  |\n| ------------------- | ----------------------------------------------- | ---------------------------------------- |\n| `MODEL_BASE_URL`    | OpenAI-compatible base URL                       | `https://your-modal-app.modal.run/v1`    |\n| `MODEL_API_KEY`     | API key / token for the endpoint                 | `sk-...` (or any token your server wants)|\n| `MODEL_NAME`        | Model identifier served by the endpoint          | `Qwen/Qwen3-14B`                         |\n| `ENABLE_DIARIZATION`| `1` to enable pyannote diarization (optional)    | `0`                                      |\n| `WHISPER_MODEL`     | faster-whisper model size                        | `large-v3`                               |\n| `HF_TOKEN`          | Hugging Face token (only if using pyannote)      | `hf_...`                                 |\n| `SOKRATES_GUIDED_JSON` | `1` to use vLLM `guided_json` instead of `response_format` | `0`                  |\n| `SOKRATES_NO_THINK` | `1` to disable Qwen3 thinking (faster, terser)   | `0`                                      |\n| `MODEL_TEMPERATURE` | sampling temperature (extraction uses `0.0`)     | `0.0`                                    |\n\n### Pointing at a Modal vLLM endpoint\n\nThis repo ships a ready-to-deploy Modal script (`modal_vllm.py`) that serves\n**Qwen3-14B** as an OpenAI-compatible vLLM endpoint.\n\n```bash\n# 1. One-time setup\npip install modal\nmodal token new\n# Choose the API key the endpoint will require (reuse it as MODEL_API_KEY):\nmodal secret create sokrates-llm MODEL_API_KEY=sk-sokrates-demo-123\n\n# 2. Deploy (prints a ht"
  },
  {
    "repo_id": "build-small-hackathon/open-to-work-musical",
    "title": "Open To Work Musical",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/open-to-work-musical",
    "zone": "Career Camp",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "open",
      "work",
      "musical",
      "summary"
    ],
    "difficulty": "Useful",
    "quest": "💼 Career Camp: Test it with a sample career scenario.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "💼",
    "readme_text": "---\ntitle: \"Open to Work: The Musical\"\nemoji: 🎭\ncolorFrom: yellow\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.2.0\napp_file: app.py\npython_version: \"3.12\"\npinned: true\nlicense: mit\nshort_description: Turn your job application into a shareable musical parody\nthumbnail: https://huggingface.co/spaces/build-small-hackathon/open-to-work-musical/resolve/main/thumbnail.png\ntags:\n  - track:wood\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n  - sponsor:openbmb\n  - sponsor:modal\nmodels:\n  - openai/gpt-oss-20b\n  - tanya8997/openwork-understudy-0.5b\n  - vikhyatk/moondream2\n  - ACE-Step/ACE-Step-v1-3.5B\n  - Plachta/Seed-VC\n---\n\n# 🎭 Open to Work: The Musical\n\n**Turn a normal job application into a short parody *application video*** — your résumé, a\nreal role you actually want, your own voice and face, and a song in whatever genre you pick.\n\nJob hunting is a loop: same PDF, same cover letter, same silence. This asks a sillier\nquestion — *what if you applied like it was opening night?* Drop in your résumé and the job\ndescription, drag **one slider** to set how unhinged it gets, and the show writes you custom\nlyrics about that exact role, composes the music, sings it (optionally in *your* voice), and\nlip-syncs the whole number to your headshot. From **\"here's my résumé\"** to **\"here's my\napplication as a music video\"** — a clip you'd genuinely send to stand out, or at minimum\nsend to your group chat so they can watch you lose it in 4/4 time. 🎟️\n\n![Open to Work: The Musical — a parody application video](thumbnail.png)\n\n> 🎬 **Demo video:** _coming soon_\n> 📓 **Field Notes (Hugging Face):** [Open to Work: The Musical — the build write-up](https://huggingface.co/blog/build-small-hackathon/open-to-work-musical)\n> 📝 **Also on Medium:** [I distilled a 20B model into 0.5B so my résumé could sing](https://medium.com/@tktanya897/i-distilled-a-20b-model-into-0-5b-so-my-r%C3%A9sum%C3%A9-could-sing-4e83c7d2de7d)\n> 📣 **Social post (Reddit):** [r/HuggingFace](https://www.reddit.com/r/huggingface/comments/1u6eoav/i_built_a_resumetoparodymusical_app_for_the_hf/)\n\n---\n\n## 🎟️ Published Artifacts\n\n| Artifact | Link |\n|---|---|\n| 🎭 **Live Space** | _coming soon_ |\n| 🐜 **The Understudy** — our fine-tuned 0.5B lyricist (LoRA on MiniCPM4-0.5B → GGUF) | [`tanya8997/openwork-understudy-0.5b`](https://huggingface.co/tanya8997/openwork-understudy-0.5b) |\n| 📡 **Producer traces** — the agent's full draft→critique→revise loops, as a dataset | [`tanya8997/openwork-producer-traces`](https://huggingface.co/datasets/tanya8997/openwork-producer-traces) |\n\n*Every model in the show is small, every trace is public, and the only thing we kept secret\nis how many takes it took to make the drill verse rhyme. The understudy never gets a night\noff, the Producer critiques its own work harder than any hiring manager, and the whole cast\nfits in a tote bag.* 🎒\n\n---\n\n## 🎚️ How it works (in plain language)\n\n1. **Hand over your audition tapes** — upload a résumé (PDF/txt) and paste the job posting.\n   Add a headshot if you want the music video; record 10 seconds of your voice if you want\n   to actually *sing it* yourself.\n2. **Set the vibe** — pick a genre (or let the AI cast one for contrast), then drag **The\n   Sendability Slider (1–10)**:\n   - **1 — Actually Sendable:** clever, polished, lightly funny. Attach it to a real application.\n   - **5 — LinkedIn Risky:** funny, exaggerated, still makes sense.\n   - **10 — HR Has Left the Chat:** full unhinged parody. Desperation as an art form.\n3. **The Producer writes the song for you** — press the button and the AI does the writing:\n   it reads your materials, drafts the lyrics, critiques its own draft against the slider, and\n   revises. You don't write a word; you can peek at the whole thought process if you're curious.\n4. **Raise the curtain** — it composes the music *with sung vocals*, optionally re-sings it in\n   your voice, and lip-syncs the result to your photo. Out comes a shareable video.\n\n**Same résumé. One slider.** Lyrics, music, vocal energy, and on-screen drama all morph\ntogether as you drag it.\n\n---\n\n## 🛠️ The Tech\n\n### The pipeline\n\n```\n  RÉSUMÉ ─┐\n  JOB DESC ┼─►  THE PRODUCER  ──►  lyrics  ──►  ACE-Step 1.5  ──►  song  ─┐\n  GENRE ──┤     gpt-oss-20b        (tagged      (slider sets       (sung    │\n          │     draft→critique      [verse]/     style tags         vocals)  │\n          │     →revise loop)       [chorus])     + bpm)                     │\n          │                                                                  ▼\n  PHOTO ──┤                                                            demucs (split\n          │                                                            vocal / music)\n  VOICE ──┤── (optional) ──► seed-VC ──► your voice on the track ──────────┐│\n  (10s)   │                                                                 ▼▼\n          │                                                            SadTalker\n  SLIDER"
  },
  {
    "repo_id": "build-small-hackathon/iep-generator-v3",
    "title": "Iep Generator V3",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/iep-generator-v3",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "iep",
      "generator",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: AI IEP Generator\nemoji: 🎓\ncolorFrom: blue\ncolorTo: indigo \nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\npinned: true\nshort_description: IEP Generator - AI for Special Education\nlicense: apache-2.0\ntags:\n  - backyard-ai\n  - tiny-titan\n  - off-the-grid\n  - track:backyard\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n---\n\n# 🎓 IEP Generator - AI for Special Education\n\nFast, evidence-based Individualized Education Program generation using small models.\n\n## Problem\n\nSpecial education teachers spend **hours writing IEPs per student**, manually synthesizing data from:\n- Speech & Language Pathology assessments\n- Physical Therapy evaluations\n- Occupational Therapy reports\n- Psychological assessments\n- Behavioral observations\n- Parental input\n\nThis leads to **inconsistent, error-prone documents** and less time for actual student care.\n\n## Solution\n\n**IEP Generator** automatically generates evidence-based IEPs by:\n1. Teacher fills assessment form (3 minutes)\n2. Llama 3.2-7B synthesizes multi-disciplinary data\n3. Generates SMART goals, accommodations, services\n4. Teacher edits & exports to Word (1 minute)\n\n**Result: 70% time savings | Better consistency | Privacy-first**\n\n## Quick Start\n\n### Try It Online\n👉 **[Open IEP Generator](https://huggingface.co/spaces/build-small-hackathon/iep-generator-v3)**\n\n1. Fill assessment data:\n   - Student info (age, diagnosis)\n   - Speech assessment (CFCS level, vocabulary)\n   - Motor assessment (GMFCS level, mobility)\n   - Cognitive & behavioral data\n   - Sensory status\n\n2. Click \"Generate IEP Draft\"\n\n3. Review auto-generated:\n   - Present Levels\n   - SMART Goals (with quarterly benchmarks)\n   - Accommodations & Modifications\n   - Services Recommended\n   - Data Quality Flags\n\n4. Edit & export as Word document\n\n### Run Locally\n\n```bash\ngit clone https://github.com/JubayerONROB/iep-generator\ncd iep-generator\npython -m venv venv\nsource venv/bin/activate\npip install -r requirements.txt\nwget https://huggingface.co/TheBloke/Llama-2-7B-GGUF/resolve/main/llama-2-7b.Q4_K_M.gguf -O models/llama-3.2-7b-q4_k_m.gguf\npython app.py\n# Open http://localhost:7860\n```\n\n## Features\n\n✅ **Multi-disciplinary assessment input** - Speech, motor, cognitive, behavior, sensory  \n✅ **Evidence-based goal generation** - Uses Llama 3.2-7B with structured prompts  \n✅ **SMART goals with benchmarks** - Quarterly targets for progress monitoring  \n✅ **International standards** - GMFCS, MACS, CFCS, ICF-CY, DSM-5  \n✅ **Privacy-first** - Local inference, no cloud APIs, no data upload  \n✅ **Export to Word** - Professional IEP documents ready for signing  \n✅ **Real-time editing** - Therapists can customize before export  \n\n## Technology\n\n| Component | Tool | Purpose |\n|-----------|------|---------|\n| Interface | Gradio | Teacher-friendly web UI |\n| LLM | Llama 3.2-7B (quantized) | Evidence-based goal generation |\n| Export | python-docx | Word document creation |\n| Hosting | HF Spaces | Free, easy deployment |\n| Data | JSON files | Local storage (no database) |\n\n## Assessment Frameworks\n\nUses internationally recognized standards:\n\n- **GMFCS** (Gross Motor Function Classification System) - Motor function levels\n- **MACS** (Manual Ability Classification System) - Hand function levels\n- **CFCS** (Communication Function Classification System) - Communication levels\n- **ICF-CY** (International Classification of Functioning - Children & Youth) - Holistic framework\n- **DSM-5** (Diagnostic & Statistical Manual) - Diagnoses\n\n## Results\n\n### Time Savings\n- **Before:** 20+ hours manual IEP writing per student\n- **After:** 5 minutes (3 min form + 1 min generation + 1 min export)\n- **Improvement:** ~70% time savings\n\n### Consistency\n- Evidence-based goals grounded in assessment data\n- Consistent SMART goal format\n- Appropriate accommodations for disability type\n\n### Impact\nBuilt for **Proyash Institute of Special Education** (Bangladesh), serving 3,400+ students across 13 branches. Scalable to any special education institution worldwide.\n\n## Demo Video\n\n📹 **[Watch 3-minute demo video](https://youtu.be/dQSnZSz8gJs)**\n\nShows complete workflow:\n- Problem introduction\n- Form filling\n- IEP generation\n- Editing & export\n- Impact statistics\n\n## GitHub Repository\n\n📚 **[Source Code](https://github.com/JubayerONROB/iep-generator)**\n\nAll code open-source. Contributions welcome!\n\n## Team & Attribution\n\n**Built by:** [A.J.A. Jubayer Talukder]  \n**For:** Proyash Institute of Special Education  \n**Hackathon:** Build Small (June 9-15, 2026)  \n\n## License\n\nMIT License - Feel free to use, modify, and distribute\n\n## Support & Contact\n\nHave questions or feedback?\n- Open an issue on GitHub\n- Contact: [ajajubayertalukder@gmail.com]\n\n---\n\n**Special thanks to:**\n- Proyash Institute for the real-world problem\n- Hugging Face for infrastructure\n- Build Small Hackathon for the opportunity\n- International special education community for frameworks & standards\n\n---\n## 📱 Social Media\n\n👉 **[See demo announce"
  },
  {
    "repo_id": "build-small-hackathon/multi-agent-lab",
    "title": "Multi Agent Lab",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/multi-agent-lab",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "achievement",
      "multi",
      "agent",
      "lab",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Multi-Agent Land\nemoji: 🌲\ncolorFrom: green\ncolorTo: indigo\nsdk: gradio\nsdk_version: \"6.16.0\"\npython_version: \"3.10\"\napp_file: app.py\npinned: true\ntags:\n  - agent-demo-track\n  - track:wood\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - sponsor:openbmb\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Multi-Agent Land\n\n**Small models, one shared log, and a clear view of how agents behave in motion.**\n\nMost multi-agent systems are hard to inspect: agents call each other directly and the\nstate gets messy. We wanted to *see* small agents in action — not isolated prompts, but\nsmall models interacting over time: **debating, collaborating, playing games, and\npushing each other** in a shared environment.\n\nSo we built one. Every action — thoughts, tool calls, state updates — is appended to a\nsingle **immutable log**. When one agent asks, another answers, a judge evaluates, and a\nkeeper tracks progress, *nothing is sent agent-to-agent* — every interaction flows\nthrough that one shared ledger. So you can follow the whole run, step by step.\n\n---\n\n## Submission\n\n\n- **🎬 Demo video:** https://youtu.be/v8-zR6eTbDM\n- **📣 Social post:** https://www.linkedin.com/posts/gharsallah_huggingface-hackathon-buildsmall-activity-7472383877991501824-8vxO\n- **💻 GitHub link:** https://github.com/abducodez/multi-agent-land\n- **👥 Team (Hugging Face usernames):** <!-- TODO: list every teammate's HF username; each must register + join the org separately -->\n  - `@agharsalah`\n\n> Tags (track + badges) live in the YAML block at the top of this README — without\n> them the project can't be placed in a category.\n\n---\n\n## Quickstart\n\n```bash\nuv sync                           # create .venv and install everything from the lockfile\n\n# Optional: configure live inference (else the app runs fully offline)\ncp .env.example .env              # then set MODAL_WORKSPACE\n\nuv run app.py\n```\n\n> Don't have [uv](https://docs.astral.sh/uv/)? `curl -LsSf https://astral.sh/uv/install.sh | sh`\n\nThe app runs on a **deterministic local stub** with no API key — great for testing\nand demos that need to be fully reproducible.  To go live, deploy the small models\nin [`modal/`](modal/README.md) and set `MODAL_WORKSPACE` in `.env`; every agent then\nbinds to its model by *catalogue key* (`modal/catalogue.py`). There is no generic\ncloud key — live inference is always against models you deploy yourself.\n\n### Run it live\n\nBy default the app runs fully offline on the deterministic stub. To use real\nsmall-model inference — Modal-served models, a persistent Neon/Postgres ledger,\nand the optional mem0 memory index — copy `.env.example` to `.env` and set the\nrelevant variables. A live run stays bounded by the Governor and the UI auto-stops\nautoplay at budget/verdict, so it won't loop forever.\n\nSee [docs/runbook-live-mode.md](docs/runbook-live-mode.md) for the step-by-step\nrunbook and the safety story.\n\n### Run tests\n\n```bash\nuv run pytest tests/ -v\n```\n\n---\n\n## What It Is\n\nA **tiny theater engine** powered by specialist small-model agents.  Agents never\ncall each other directly — they post typed events to a shared append-only ledger,\nand every view (the stage, the memory, the UI) is a projection derived from that log.\n\n**The loop is simple:** define the environment and the agent roles, then launch them in\nthe multi-agent lab. Each scenario can run for a long time — agents debate, collaborate,\nplay games, and push each other — while a live telemetry view lets you follow the whole\nrun step by step.\n\nWhat makes it *super modular*:\n- **Config, not code.** Agents, scenarios, casts, model tiers, tool grants, and\n  budgets are declarative YAML under `config/`, validated by a schema. Add a world\n  by adding files — proven by `tests/test_modularity.py` (zero engine edits).\n- **A model per agent.** Each agent declares a logical profile (`tiny`/`fast`/\n  `balanced`/`strong`); a `ModelRouter` binds it to a concrete small model served on\n  Modal — **Nemotron, MiniCPM, Gemma**. Mix a ≤4B worker with a ≤32B judge in one cast.\n- **Capability-checked tools.** Agents call tools only if their manifest grants\n  them — the contract that fronts in-process tools today and MCP servers later.\n- **Built to run for hours.** The ledger is the checkpoint: `restore()` resumes a\n  killed run; a token-aware governor bounds spend; `step(n_ticks=N)` maps one\n  wall-clock episode onto N sim-ticks.\n\nThe user can **Start** from any seed, **Advance** a turn, **Drop** a disturbance,\nand **Switch** scenarios — all live.\n\n### Scenarios (each is a YAML config)\n\n| Name | Cognitive task | Cast (model tiers) |\n|---|---|---|\n| 🍄 Thousand Token Wood | Divergent world-growth | Seedkeeper `fast`, Critic `balanced`, Pocket Actor `tiny`, Echo `fast` |\n| 🔍 Mystery Roots | Convergent mystery-solving | Clue Gatherer `fast`, Hypothesis Former `balanced`, Devil's Advocate `fast`, Judge `strong` |\n| 🔮 Oracle Grove | Tool-using prophecy | Seedkeeper `fast`, Fortune-Teller `fast` + `ora"
  },
  {
    "repo_id": "build-small-hackathon/fluentwhisper",
    "title": "Fluentwhisper",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/fluentwhisper",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "fluentwhisper",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: FluentWhisper\nemoji: ☕\ncolorFrom: yellow\ncolorTo: gray\nsdk: gradio\nsdk_version: 5.49.1\napp_file: app.py\npinned: false\nlicense: apache-2.0\nsuggested_hardware: zero-a10g\ntags:\n  - track:backyard\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# FluentWhisper\n\n**Speak messy. Read clean.**\n\nA small LoRA adapter that teaches whisper-large-v3-turbo to take the fillers, repeats,\nand false starts out of raw speech in one local pass, then show you exactly what it\nremoved next to the vanilla model.\n\nIt runs on the laptop you already own. No API keys, no cloud round trip, no frontier\nmodel sitting in the loop. That makes it a workable alternative to cloud dictation for\nanyone who talks faster than they edit, especially non-native English speakers.\n\nAs of June 2026, to our knowledge it is the only Apache-2.0, commercially usable\nopen-source model that removes filled pauses, discourse markers, repetitions, and\nself-repairs end to end. On the DisfluencySpeech test split it scores 3.4% WER\n(whisper-normalized, transcript C) against 9.4% for vanilla Whisper. That benchmark is a\nsingle-speaker acted set, N=250, with a 95% confidence interval of about ±1pp.\n\n## How it was trained\n\nThe training data is synthetic, and we built it ourselves. We started from clean text\ntranscripts in LibriSpeech (the original audio was thrown away), injected disfluencies\ninto that text with custom scripts plus the LARD tool, then voiced the messy versions\nwith Kokoro TTS rotated across 54 voices. That gave us roughly 23k pairs of\n`(disfluent audio, clean text)` to fine-tune on. A later blend folded in about 4.5k rows\nfrom the DisfluencySpeech train split to tidy up the label formatting.\n\nSo the model is trained on LibriSpeech-derived synthetic speech, not on DisfluencySpeech.\nDisfluencySpeech is only the real-speech benchmark we report against.\n\n- **Base:** `openai/whisper-large-v3-turbo`\n- **Adapter:** [`pradachan/whisper-large-v3-turbo-disfluency-lora`](https://huggingface.co/pradachan/whisper-large-v3-turbo-disfluency-lora)\n- **Training data:** synthetic, generated from [LibriSpeech](https://www.openslr.org/12/) transcripts and voiced with [Kokoro 82M TTS](https://huggingface.co/hexgrad/Kokoro-82M)\n- **Benchmark:** [DisfluencySpeech](https://huggingface.co/datasets/amaai-lab/DisfluencySpeech) (arXiv:2406.08820)\n- **Trained on:** [Modal](https://modal.com) serverless GPUs, which ran both the LoRA fine-tuning and the eval harness\n- **Hardware:** ZeroGPU (`@spaces.GPU`), so live transcription runs on an on-demand A10G\n\nThe gallery examples are real results from the DisfluencySpeech test set, plus one honest\nfailure. Vanilla Whisper already deletes most \"um/uh\" fillers on its own, so the demo\ndeliberately shows what it does past that point.\n\n## Limitation\n\nThe model deletes aggressively. Intentional repetitions like repeated digits, spelled IDs,\nand phone numbers, along with some self-repairs, can be lost. Do not use it for verbatim,\nlegal, or numeric dictation without a downstream check.\n"
  },
  {
    "repo_id": "build-small-hackathon/bharat_voice_assistant",
    "title": "Bharat_Voice_Assistant",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/bharat_voice_assistant",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "bharat_voice_assistant",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Bharat Voice Assistant\nemoji: 🎙️\ncolorFrom: yellow\ncolorTo: green\nsdk: gradio\nsdk_version: \"6.12.0\"\npython_version: \"3.11\"\napp_file: app.py\nsuggested_hardware: cpu-basic\npinned: false\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n---\n\n# Bharat Voice Assistant\n\n**Project 04** · SLM Hackathon · OpenBMB stack\n\nMultilingual voice-and-text chat for India — speak or type in Hindi, Bengali, Tamil, Telugu, Marathi, Gujarati, Kannada, Malayalam, Punjabi, Urdu, or English and get a concise reply in the same language. Built with **100% OpenBMB** models, **Gradio 6**, and deployable on **Modal A10G** (recommended) or a local 16GB GPU.\n\n| | |\n|---|---|\n| **Models** | `openbmb/MiniCPM-o-4_5` (STT + LLM, 4-bit) · `openbmb/VoxCPM2` (TTS) |\n| **VRAM** | ~7GB (MiniCPM-o 4-bit) · ~11GB with VoxCPM2 |\n| **Port** | `7863` |\n| **Track** | Backyard AI — voice AI for neighbors, parents, and village workers |\n\n---\n\n## What it does\n\n- **Mic in the chat bar** — record speech from the input bar; audio auto-sends when recording finishes.\n- **Text chat** — type in any supported language and press Send.\n- **One omni model for voice** — MiniCPM-o 4.5 bundles Whisper-medium (STT) and Qwen3-8B (LLM); no separate Whisper download at inference time.\n- **TTS (VoxCPM2)** — after each reply, tap the small **♪** icon under the assistant message to hear the response spoken aloud (click again to stop). Multilingual, tokenizer-free OpenBMB TTS.\n- **Modal-first** — full UI on GPU in the cloud; local `python app.py` is a thin Gradio client that calls the Modal API (no local GPU required).\n\n---\n\n## API (Free, OpenAI-compatible)\n\nMiniCPM-V and MiniCPM-o models are also available via the **free public OpenBMB API** (useful for quick experiments outside this app):\n\n```\nBase URL : https://api.modelbest.cn/v1\nAPI Key  : set OPENBMB_API_KEY in your .env (see ../.env.example)\nModels   : MiniCPM-V-4.6-Instruct | MiniCPM-V-4.6-Thinking | MiniCPM-o-4.5\n```\n\nSource: https://github.com/OpenBMB/MiniCPM-V/blob/main/docs/api.md\n\nThis project runs **MiniCPM-o 4.5 locally or on Modal** for the voice pipeline — not the HTTP API above.\n\n---\n\n## Architecture\n\n```\nUser speaks (mic) or types text\n       │\n       ▼\n┌──────────────────────────────────────────────────────┐\n│  MiniCPM-o 4.5  (openbmb/MiniCPM-o-4_5, 9B 4-bit)   │\n│                                                      │\n│  ┌─────────────────┐    ┌──────────────────────────┐ │\n│  │ Whisper-medium  │    │  Qwen3-8B backbone       │ │\n│  │ (audio encoder) │───►│  multilingual LLM      │ │\n│  │ auto language   │    │  replies in user's lang  │ │\n│  └─────────────────┘    └────────────┬─────────────┘ │\n└──────────────────────────────────────│──────────────┘\n                                       │ text response\n                                       ▼\n                              gr.Chatbot (Gradio 6)\n                                       │\n                                       ▼\n┌──────────────────────────────────────────────────────┐\n│  VoxCPM2  (openbmb/VoxCPM2, 2B)                     │\n│  Text → multilingual speech (lazy-loaded on Modal)   │\n└──────────────────────────┬───────────────────────────┘\n                           │  ♪ icon under reply → play/stop\n```\n\n---\n\n## Quick start\n\n### Option A — Hosted UI on Modal (recommended)\n\n```bash\ncd 04_bharat_voice_assistant\npip install modal\nmodal deploy deploy_modal.py\n# → https://<your-workspace>--bharat-voice-ui.modal.run\n```\n\nExample: `https://aniketchopde03--bharat-voice-ui.modal.run`\n\nFirst request after cold start may take 1–2 minutes (GPU scheduling + model load). First **♪** voice reply may take extra time while VoxCPM2 loads on the GPU.\n\n### Option B — Local UI → Modal GPU\n\nRun Gradio on your machine; inference stays on Modal.\n\n```bash\ncd 04_bharat_voice_assistant\npython -m venv venv\nvenv\\Scripts\\activate          # Windows\n# source venv/bin/activate     # macOS/Linux\npip install -r requirements.txt\npip install modal\n\n# Deploy inference API once:\nmodal deploy serve_bharat_modal.py\n# → POST https://<workspace>--bharat-voice-api-bharatvoice-chat.modal.run\n\npython app.py\n# Open http://127.0.0.1:7863\n```\n\nSet a custom endpoint if needed:\n\n```bash\nset CHAT_ENDPOINT=https://<workspace>--bharat-voice-api-bharatvoice-chat.modal.run\npython app.py\n```\n\n### Option C — Fully local (16GB GPU)\n\n```bash\ncd 04_bharat_voice_assistant\npip install -r requirements-local.txt\npip install \"gradio>=6.0.0,<6.18.0\"\n\nset LOCAL_MODE=1\npython app.py\n# Open http://127.0.0.1:7863\n# Needs ~11GB VRAM for MiniCPM-o + VoxCPM2 TTS (~8GB text-only is tight — use Modal instead)\n```\n\nRequires `ffmpeg` on PATH for browser mic formats (webm → wav).\n\n---\n\n## Modal deploy files\n\n| File | Purpose |\n|---|---|\n| `deploy_modal.py` | Full Gradio UI on Modal A10G (`modal deploy deploy_modal.py`) |\n| `serve_bharat_modal.py` | Inference-only API for thin local UI (`modal deploy serve_bharat_modal.py`) |\n| `app.py` | Gradio app + inference logic (mounted on"
  },
  {
    "repo_id": "build-small-hackathon/nemocity",
    "title": "Nemocity",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/nemocity",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "nvidia",
      "nemocity",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Play one round and note what made it fun.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: NEMOCITY\nemoji: 🏙️\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Ask for a building, watch one tiny shared city rise\nmodels:\n  - nvidia/NVIDIA-Nemotron-Nano-9B-v2\ndatasets:\n  - AndresCarreon/nemocity-world\ntags:\n  - build-small-hackathon\n  - track:wood\n  - sponsor:nvidia\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n  - thousand-token-wood\n  - nemotron\n  - nvidia\n  - simulation\n  - city-builder\n  - world-building\n  - three-js\n  - multiplayer\n---\n\n<!-- ^ Space card config. app.py runs as a Gradio Server (gr.Server): it serves\n     the WebGL city at /, the JSON/SSE API under /api, and the gradio endpoint\n     that runs the granting call under /gradio_api. sdk_version 6.16.0 is the\n     pre-SSR gradio proven live on the org ZeroGPU (pareidolia) with this exact\n     launch pattern; pairs with transformers <5 (nemotron_h in-tree from 4.55.4). -->\n\n# NEMOCITY\n\n**Ask for a building and watch it rise — one tiny persistent city the whole\ninternet builds, with a 9B Nemotron running city hall.**\n\nType a petition — *\"a ramen shop near the park\"* — and a small NVIDIA Nemotron\nreads it at city hall, the deterministic engine zones a lot, and you (and every\nother visitor on Earth) watch it rise floor by floor in ONE shared miniature\ncity. There is no level select and no reset button: your cafe stands next to a\nstranger's fire station, forever, on the same 64x64 grid.\n\nThe city is alive between petitions. Cars commute on a shared deterministic\nclock, rush hour jams the Old Bridge twice a day, windows light at dusk, and a\nnews ticker reads the model's blurbs as headlines. Press Walk and you are on the\nsidewalk in first person, at street level, under the buildings the internet\nasked for.\n\nThe whole city renders client-side from an append-only event log — judges see a\ngorgeous living diorama with zero GPU spend. The GPU wakes exactly once per\naction, for one one-shot JSON generation.\n\n## How city hall works (the \"her\" pattern)\n\nThe deterministic engine owns ALL facts: city state, placement, geometry, road\nrouting, traffic telemetry. [NVIDIA-Nemotron-Nano-9B-v2](https://huggingface.co/nvidia/NVIDIA-Nemotron-Nano-9B-v2)\nnever outputs a coordinate — it translates your petition into one tiny JSON\npermit (kinds, names, a fuzzy \"near\" hint) and narrates. The engine resolves,\nclamps, and repairs everything, so a visitor action ALWAYS yields visible\nconstruction. The model's raw JSON is shown in the UI on every action (the\n\"City Hall paperwork\" drawer).\n\n```\n        your petition (\"a ramen shop near the park\")\n            |\n            v\n   +- CITY HALL - Nemotron-Nano-9B-v2 (ONE one-shot call) -----+\n   |  {\"intent\":\"build\",\"blurb\":\"...\",                         |\n   |   \"buildings\":[{\"kind\":\"cafe\",\"name\":\"Ramen Ichi\",        |\n   |                 \"near\":\"the park\"}]}                      |\n   |  no coordinates, ever - kinds + names + fuzzy hints only  |\n   +------------------------------+----------------------------+\n                                  | permit JSON\n                                  v\n   +- THE ENGINE - deterministic Python -----------------------+\n   |  synonym-repairs kinds -> scores placement -> clamps      |\n   |  floors -> routes connector roads -> seeds + appends      |\n   |  events; owns ALL state, the model owns none of it        |\n   +------------------------------+----------------------------+\n                                  | world_delta (SSE broadcast)\n                                  v\n   +- THE CITY - Three.js, client-side, zero GPU ---------------+\n   |  pure function of (event list, shared simTime):           |\n   |  buildings rise floor by floor, cars commute, jams form   |\n   +------------------------------------------------------------+\n```\n\n## The traffic fix (ask the City Engineer)\n\nRush hour is real, simulated telemetry — not theater. When a road segment stays\njammed, a traffic alert appears with one button: **Ask the City Engineer**. The\nengine snapshots the facts (worst cells by demand/capacity with street names,\nthe Traffic Index) and pre-validates candidate fixes with predicted outcomes by\nre-running its static traffic assignment. Nemotron gets `{stats, candidates}`\nand answers with a diagnosis citing the real numbers plus a choice — *\"Old\nBridge carries 3.1x capacity at rush — a second crossing cuts the index\n78 to 41.\"* The engine lays the new road, every client's cars re-route live, and\nthe red drains green while you watch. Facts from the engine, judgment from the\nmodel — small models doing what small models are great at.\n\n## Built for Build Small — Thousand Token Wood\n\n- One model, 9B params, in-process on ZeroGPU — comfortably under the 32B cap.\n- **Off-Brand** — fully custom WebGL frontend on `gr.Server`; gradio powers the\n  API endpoints, not the visible UI.\n- **Sharing is Caring** — every petition's full trace (text, moderation verdict,\n  raw model JSON, resulting events) appends l"
  },
  {
    "repo_id": "build-small-hackathon/llm-fishing",
    "title": "Llm Fishing",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/llm-fishing",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "llm",
      "fishing",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Fish Anything\nemoji: 🎣\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: \"6.17.3\"\npython_version: \"3.12\"\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Describe anything as bait — MiniCPM + FLUX Klein on Modal\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offbrand\nmodels:\n  - openbmb/MiniCPM4.1-8B\n  - black-forest-labs/FLUX.2-klein-4B\n  - black-forest-labs/FLUX.2-klein-9B\n---\n\n# Fish Anything 🎣\n\nDescribe **anything** as bait. See if you get any bites, what crazy fish will surface, and how good your reeling technique is. Unique bait and fish images every time!\n\n**Play:** https://build-small-hackathon-llm-fishing.hf.space\n\n> **Note:** Inference runs on Modal GPUs. The first cast after idle can take **1–3 minutes** while containers cold-start. Later casts are much faster.\n\n## How to play\n\n1. **Bait the hook** - type anything to use it as bait, could be objects concepts, or anything you can thinnk of.\n2. **Cast** - try to find the bait; the LLM judges whether it would work.\n3. **Fish on!** - read the description of the fish, then reel it in.\n4. **Reel** - match your technique description to the specific fish; hit the rhythm window.\n5. **Aquarium** - caught fish go here; stored in your browser, or can export/import backups.\n\n## Submission\n\n| | |\n|---|---|\n| Demo | [watch](https://build-small-hackathon-llm-fishing.hf.space/static/demo/fish_anything_demo.mp4) |\n| Social | [Show HN](https://news.ycombinator.com/item?id=48547932) |\n\n**Track:** Thousand Token Wood (`track:wood`)  \n**Also tagged:** `sponsor:openbmb`, `sponsor:modal`, `achievement:offbrand`\n\nSetup and development notes: [docs/DEV.md](docs/DEV.md)\n"
  },
  {
    "repo_id": "build-small-hackathon/secret-student",
    "title": "Secret Student",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/secret-student",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "track",
      "secret",
      "student",
      "summary"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Try one learning prompt and check if the explanation is clear.",
    "likes": 3,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: Secret Student\nemoji: \"\\U0001F30D\"\ncolorFrom: purple\ncolorTo: gray\nsdk: docker\napp_port: 7860\npinned: false\nlicense: mit\nshort_description: \"A Pokemon-inspired, local-first AI school adventure\"\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n<div align=\"center\">\n\n# Secret Student\n\n### A Pokemon-inspired AI school game built for a niece who loves games and hates school\n\n[![Build Small: Backyard](https://img.shields.io/badge/Build_Small-Backyard_Adventure-6d28d9)](https://huggingface.co/build-small-hackathon)\n[![Build Small: Woods](https://img.shields.io/badge/Build_Small-Deploy_to_the_Woods-166534)](https://huggingface.co/build-small-hackathon)\n[![Live Space](https://img.shields.io/badge/Play-Live_Space-fbbf24?logo=huggingface&logoColor=black)](https://huggingface.co/spaces/build-small-hackathon/secret-student)\n[![Source](https://img.shields.io/badge/Source-GitHub-181717?logo=github)](https://github.com/asanwari/secret-student)\n[![llama.cpp](https://img.shields.io/badge/Inference-llama.cpp-2563eb)](https://github.com/ggml-org/llama.cpp)\n\n**Learn at school. Get briefed by \"Grandma\". Defeat a villain with what you learned.**\n\n<p align=\"center\">\n  <img src=\"frontend/static/assets/world-map-agent.png\" alt=\"Pixel-art Secret Student main map with school, home, and headquarters\">\n</p>\n\n<table>\n  <tr>\n    <td width=\"50%\"><img src=\"frontend/static/assets/classroom-blank.png\" alt=\"Pixel-art Secret Student classroom\"></td>\n    <td width=\"50%\"><img src=\"frontend/static/assets/bedroom-agent.png\" alt=\"Pixel-art Secret Student bedroom\"></td>\n  </tr>\n  <tr>\n    <td align=\"center\"><strong>School: lessons, teacher chat, and quizzes</strong></td>\n    <td align=\"center\"><strong>Home: review and mission briefings</strong></td>\n  </tr>\n</table>\n\n</div>\n\nSecret Student turns an AI-generated lesson into a retro 2D mission. The player\nis a student by day and a secret agent after class. Lessons become intelligence,\nquizzes become training, and the final assessment becomes a boss battle.\n\nThe whole AI stack can run locally through two small GGUF models and llama.cpp.\nThere is no required cloud model API, and the custom Phaser interface looks and\nplays like a game rather than a default chatbot.\n\n## Hackathon Submission\n\nSecret Student is submitted to **both** [Build Small](https://huggingface.co/build-small-hackathon)\ntracks. The required organization submission is\n**[build-small-hackathon/secret-student](https://huggingface.co/spaces/build-small-hackathon/secret-student)**.\n\n> [!IMPORTANT]\n> The organization Space cannot currently provision GPU hardware, so the live\n> submission runs the game in a CPU-hosted Space and serves the current text and\n> vision models through a routed Modal GPU deployment. This is a hosting\n> workaround, not a model-size requirement: the same source can run both models\n> in one local llama.cpp deployment when GPU hardware is available.\n\n## Team\n\n- [asanwari](https://huggingface.co/asanwari)\n\n## Judges: Inference and GPU Setup\n\n> [!CAUTION]\n> **Please read this before evaluating latency.** The current submission calls a\n> Modal-hosted llama.cpp deployment because the organization Space cannot\n> provision a GPU. Modal scales down when idle, so the first lesson or\n> handwritten-answer check after an idle period can be **significantly slower\n> while the GPU starts and the GGUF models load**. This is cold-start latency,\n> not the normal warm inference path. If you encounter LLM unavailable errors, please retry.\n\nThe current model pair uses about **10 GB of VRAM** when loaded together. It can\nrun on one GPU with at least 16 GB of VRAM, including:\n\n- NVIDIA GeForce RTX 4060 Ti 16 GB\n- NVIDIA GeForce RTX 4070 Ti SUPER 16 GB\n- NVIDIA GeForce RTX 4080 / 4080 SUPER 16 GB\n- NVIDIA GeForce RTX 3090 or RTX 4090 24 GB\n- AMD Radeon RX 7800 XT or 7900 GRE 16 GB\n- AMD Radeon RX 7900 XT 20 GB or 7900 XTX 24 GB\n\n### LLM Runtime Options\n\n`LLM_RUNTIME` selects the inference topology:\n\n```text\nmock                     deterministic local content with no model server\nexternal                 OpenAI-compatible external routes, including Modal\nembedded_llamacpp        one llama.cpp server beside the app\nembedded_dual_llamacpp   separate local text and vision llama.cpp servers\n```\n\nThe submission currently uses `external`. The fully local GPU path is\n`embedded_dual_llamacpp`.\n\n### How to Test Completely Locally\n\nThe Docker image already includes CUDA-enabled llama.cpp, and `app.runtime`\nalready knows how to launch and health-check separate text and vision servers.\nNo code changes or additional inference service are required.\n\n1. Run on a machine, Docker host, or Space with a suitable GPU.\n2. In `.env` or **Settings -> Variables and secrets**, replace the external\n   runtime configuration with the values below.\n3. Restart the app. On"
  },
  {
    "repo_id": "build-small-hackathon/nightwave",
    "title": "Nightwave",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/nightwave",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "modal",
      "nightwave",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 7,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: NIGHTWAVE\nemoji: 📻\ntags:\n  - build-small-hackathon\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n  - achievement:sharing\n  - tiny-titan\n  - best-demo\n  - llama-cpp\n  - minicpm\n  - modal\n  - audio\ncolorFrom: indigo\ncolorTo: gray\nsdk: docker\napp_port: 7860\npinned: false\nlicense: apache-2.0\nshort_description: A late-night radio station with a live AI DJ + call-ins\n---\n\n# NIGHTWAVE 📻\n\n**A warm, hand-built 1970s late-night radio station you actually tune into — and the\nDJ is a 1-billion-parameter model.**\n\nTurn the dial to 98.6 and a late-night host (voiced by Kokoro's `am_michael`) keeps you\ncompany through the small hours. He spins records by artists who don't exist, reads\ndedications, gives the weather for towns that aren't on any map, muses into the dark — and\n**takes your call, live, on air**. Every record is **generated in your browser** in real\ntime (no samples, no licensing). Pick up the handset — or just hit **Space** — and talk\nto him.\n\nIt's the one entry you can review with your eyes closed.\n\n## Demo video\n\n- Demo: https://youtu.be/lA46z2mYjF0\n\n## Field Notes write-up\n\n- Article: https://huggingface.co/blog/build-small-hackathon/nightwave-radio\n\n## Social post\n\n- Thread: https://x.com/ratandeep/status/2066658403762798650\n- Post: https://x.com/ratandeep/status/2066657171073532119\n\n## Open trace\n\n- Dataset: https://huggingface.co/datasets/build-small-hackathon/nightwave-traces\n\n## What makes it tick\n\n- **A live AI DJ.** A fine-tunable **MiniCPM5-1B** (≈1.08B params) writes the patter — station\n  IDs, late-night thoughts, fictional weather, dedications, and song intros — fresh every\n  break. He answers callers improvised around their real words: unfakeable proof the model\n  is live and load-bearing.\n- **Music made in the browser.** A small Web Audio engine renders a different ~60s lo-fi /\n  ambient instrumental per track — varied by key, scale, tempo, and timbre — from a curated\n  bank of fictional records (title + artist + vibe). No copyrighted audio; a documented slot\n  to drop in your own CC0 mp3s.\n- **A real broadcast signal.** Kokoro's voice runs through an AM band-pass (300 Hz–3.4 kHz),\n  light compression, tape saturation, a plate reverb, and a continuous vinyl-crackle bed, so\n  it *sounds* like a transmission — and the DJ talks **over** each song's intro.\n- **Never dead air.** The next talk break is generated while the current record plays; a\n  pre-cached \"stall\" line covers call latency; a canned segment keeps the show going if a\n  model call ever fails.\n- **A radio you operate.** A skeuomorphic wood-and-amber cabinet: draggable tuning dial,\n  audio-reactive VU meter, ON-AIR tube, spinning reels, now-playing plate, and a push-to-talk\n  handset (Space = talk, Esc = hang up). Reduced-motion and captions respected.\n\n## Architecture\n\n```\n Browser (this HF Space, free CPU)              Modal (one GPU app, scale-to-zero)\n ----------------------------------             ----------------------------------\n  Show loop (state machine):                     @modal.asgi_app (gpu=T4)\n   break (DJ segments) -> song -> repeat          /asr   faster-whisper small -> text\n  Generative music engine (Web Audio)             /brain MiniCPM5-1B GGUF     -> {text,mood,arc_cue}\n  Broadcast audio chain + crackle bed             /speak Kokoro am_michael    -> {audio,words,times}\n  Now-playing plate · sonic logo · VU\n        |  fetch (same-origin /api/* ONLY)\n        |  GET  /api/songs                     server-side proxy (FastAPI):\n        |  POST /api/segment {kind, ctx}         builds host-persona prompts (arc.py)\n        |  POST /api/call    {audio_b64}         calls /brain then /speak; sanitizes text\n        v                                        holds MODAL_URL + proxy token in Secrets\n   Space server (FastAPI + Gradio)               browser NEVER sees Modal creds\n```\n\n- The radio UI is one self-contained HTML/CSS/JS document served at `/` (top-level, so mic +\n  autoplay work). A minimal Gradio Blocks is mounted at `/gradio` so the Space is a bona-fide\n  Gradio app (and the custom cabinet counts toward **Off-Brand**).\n- The DJ is **one consistent warm-witty host persona** (`arc.py: build_host_prompt`). The app\n  owns the show structure; the model only voices the current segment.\n- Segment kinds: `station_id`, `thought`, `weather`, `dedication`, `song_intro` (LLM-written\n  for thoughts/intros; templated from `content.py` for the rest). All spoken via `/speak`.\n\n### Stack\n\n| Layer | Choice |\n|---|---|\n| Brain | `openbmb/MiniCPM5-1B` GGUF (Q4_K_M) via **llama.cpp** |\n| ASR | **faster-whisper** small (CPU, int8) |\n| TTS | **Kokoro-82M**, voice `am_michael`, with word timestamps |\n| Music | In-browser **Web Audio** generative engine (no model, no samples) |\n| Compute | One **Modal** T4, `min_containers=0`, scale-to-zero |\n| Frontend | Self-contained skeuomorphic radio (HTML/CSS/JS), real Web-Audio chain |\n| Orchestrator | Free CP"
  },
  {
    "repo_id": "build-small-hackathon/open-cortex",
    "title": "Open Cortex",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/open-cortex",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "achievement",
      "backyard-ai",
      "open",
      "cortex",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: OpenCortex\nemoji: 🧠\ncolorFrom: red\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npinned: false\nlicense: mit\ntags:\n  - track:backyard-ai\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - backyard-ai\n  - off-brand\n  - tiny-titan\n  - best-demo\n  - codex\n  - llama-cpp\n  - local-ai\n  - ai-infra\n  - observability\nshort_description: Feel a local LLM think,  work, and forget.\n---\n\n\n# OpenCortex\n\n**Making LLM inference visible.**\n\nOpenCortex is a real-time observatory for local LLM inference. It pairs a chat\ninterface with a living view of the runtime behind the answer: working memory,\ncontext pressure, token flow, and engine health.\n\nMost chat interfaces show only two things:\n\n```text\nuser input -> assistant output\n```\n\nOpenCortex shows the machine in the middle.\n\n```text\nprompt prefill -> KV/cache pressure -> decode rhythm -> context boundary -> answer behavior\n```\n\nIt is built for learners, local AI users, and AI infrastructure engineers who\nwant to understand why an LLM slows down, loops, forgets, or runs out of\ncontext.\n\n![OpenCortex runtime observatory](docs/assets/opencortex-hero.png)\n\n## Why this exists\n\nSmall local models make AI personal again. But local inference is still a black\nbox for most users. When a response becomes slow or repetitive, the user rarely\nknows whether the model is thinking, overloaded, stuck in a loop, or simply\nrunning out of context.\n\nOpenCortex turns runtime signals into a product experience:\n\n| Runtime signal | Human-readable concept | Visual feedback |\n| --- | --- | --- |\n| KV/cache usage proxy | Working Memory | block pressure and fragmentation |\n| Context tokens | Context Window | filling chamber and active boundary |\n| Decode throughput | Token Stream | flowing, broken, or stalled token river |\n| TTFT and queue evidence | Engine State | pulse, charging, recovery, hazard state |\n| Repeated generated text | Thought Loop | red loop warning and irregular core pulse |\n\nThe goal is not to build another metrics dashboard. The goal is to let people\nfeel the hidden mechanics of inference while they chat.\n\n## Demo links\n\n- **Hugging Face Space:** https://huggingface.co/spaces/build-small-hackathon/open-cortex\n- **Demo video:** https://youtu.be/edxZdttCf-s\n- **Social post:** https://x.com/ZhaoJ90682/status/2066576258042155518\n\nThe Build Small Hackathon requires a deployed Gradio Space, a demo video, a\nsocial post, and README tags for tracks and badges. This README is structured\nfor that submission flow.\n\nThe hosted Space runs in **simulated runtime mode** so judges can open the demo\nwithout a private llama.cpp server. The local path below connects to live\nllama.cpp metrics.\n\n## What you can try\n\nOpenCortex has one live mode and four built-in runtime experiments.\n\n### 1. Live local chat\n\nAsk the local model a question and watch the observatory react while the answer\nstreams. The UI listens to llama.cpp OpenAI-compatible streaming events,\ntimings, `/metrics`, and `/slots`.\n\n### 2. Long context stress\n\nShows how prompt growth increases prefill work before generation begins.\n\n### 3. Memory pressure\n\nShows working memory blocks fragmenting and reallocating as the runtime becomes\nstrained.\n\n### 4. Slow decode\n\nShows token flow breaking into a stop-burst-stop rhythm when generation slows.\n\n### 5. Context collapse\n\nShows the moment earlier turns leave the active context window. The chat history\nstays visible, but OpenCortex marks the active context boundary so the user can\nsee what the model can no longer reliably use.\n\n![Context boundary event](docs/assets/opencortex-context-collapse.png)\n\n### 6. Thought loop detection\n\nIf generation begins repeating the same pattern, OpenCortex marks it as a\nthought loop. The Token Stream and Cortex Core switch into a red hazard state.\n\n![Thought loop detected](docs/assets/opencortex-thought-loop.png)\n\n## How it works\n\nOpenCortex is intentionally lightweight:\n\n```text\nBrowser UI\n  |\n  | POST /api/chat\n  v\nFastAPI / Space app\n  |\n  | OpenAI-compatible streaming request\n  v\nllama.cpp llama-server\n  |\n  | /v1/chat/completions\n  | /metrics\n  | /slots\n  v\nRuntime events -> semantic state -> visual organs\n```\n\nThe backend converts low-level runtime evidence into normalized events:\n\n- `request_started`\n- `first_token`\n- `token`\n- `context_collapse`\n- `request_completed`\n- `error`\n\nThe frontend consumes those events and updates the chat and observatory in the\nsame stream, so visual state and language output stay aligned.\n\n## Runtime evidence\n\nThe MVP uses real llama.cpp evidence where available:\n\n| Evidence | Source | Used for |\n| --- | --- | --- |\n| Time to first token | measured in Python client | Engine State |\n| Prompt tokens | llama.cpp usage/timings | Context Window |\n| Completion tokens | llama.cpp usage/timings | Token Stream |\n| Prompt throughput | llama.cpp timings | Prefill evidence |\n| Decode throughput | llama.cpp timings and metrics | Token Strea"
  },
  {
    "repo_id": "build-small-hackathon/QED",
    "title": "Qed",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/QED",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "qed",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Q.E.D\nemoji: 🔬\ncolorFrom: green\ncolorTo: gray\nsdk: gradio\napp_file: front.py\npinned: true\nlicense: mit\nshort_description: LLM-guided formal verification, kernel-certified proofs\ntags:\n  - track:wood\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:best-demo\n  - gradio\n  - lean4\n  - formal-verification\n  - agents\n  - build-small-hackathon\n  - llama.cpp\n---\n\n# ⊢ Q.E.D\n\nAn LLM-guided formal verification agent. You give it a theorem statement in Lean 4; it finds a proof that **Lean's kernel certifies as formally correct**.\n\nUnlike a chatbot saying \"yes, that's true,\" Lean's kernel is a proof-checker that either accepts or rejects every logical step against its axioms. No hallucinations. No approximations. The result is machine-checked mathematics.\n\n## Demo\n\n📹 **[DEMO VIDEO: https://huggingface.co/spaces/build-small-hackathon/QED/blob/main/Demo.mkv]**\n\n🐦 **[SOCIAL POST: https://www.reddit.com/r/LocalLLaMA/comments/1u6xo0t/hf_hackathon_submission/]**\n\n## What it does\n\nThe agent runs a propose → verify → learn loop, using a 27B LLM (well under the 32B limit) to propose Lean 4 proof tactics, Lean's kernel to verify each one, and the kernel's error messages fed back verbatim into the next prompt:\n\n```\ntheorem\n  → propose 3 tactic candidates (27B LLM via Modal)\n  → verify each in live Lean 4 REPL (Modal container)\n  → kernel rejects? → error text fed back to LLM\n  → kernel accepts? → advance proof state\n  → repeat until complete or stuck\n```\n\nKey behaviours:\n\n- **Fallback layer** — deterministic tactics (`rfl`, `norm_num`, `simp`, `omega`, `contradiction`, `assumption`) are tried at each step before the LLM is called.\n- **Stuck-state detector** — if the same proof state recurs 3 times, or 3 consecutive steps all fail, the agent concludes \"not provable as stated\" and returns a clean verdict instead of looping indefinitely. This correctly identifies false theorems.\n- **No cache on demo runs** — every run executes the full live loop so the propose→verify→learn steps are always visible.\n\n**Model**: one model, Qwen3-27B quantized (Q4_K_M GGUF) served via llama.cpp. 27B < 32B. ✓\n\n## Best Use of Modal\n\nTwo Modal deployments power the app:\n\n| App | What it runs |\n|---|---|\n| `lean-proof-agent` | FastAPI app with a persistent Lean 4 REPL. `min_containers=1` keeps it warm so there is zero cold-start delay. Runs the full agent loop and orchestrates the LLM calls. |\n| `llama-server` | llama.cpp HTTP server running **Qwen3-27B (Q4_K_M)** on a Modal GPU. Receives the current proof state as context, returns tactic candidates. |\n\nWhy Modal specifically:\n\n- Lean 4 requires a **persistent REPL process** with the full toolchain installed and pre-warmed — not something you can spin up per-request. Modal containers hold that state across calls.\n- `min_containers=1` means the Lean server is **always alive**, which is critical for a live demo with ~30-second proof runs.\n- The LLM needs a GPU. Modal's on-demand GPU allocation means no dedicated hardware to maintain.\n- The two apps are **independently scaled**: REPL is CPU-bound, LLM is GPU-bound.\n\n## Off Brand — custom UI\n\nThe frontend is not stock Gradio. Every proof run produces a **live-rendered SVG proof tree** built from the agent's search trace: goal-state nodes connected by tactic edges, failed branches in red, the accepted path in green, a terminal QED node. It runs in the browser with zero JS dependencies — generated server-side and injected as HTML.\n\nThe overall aesthetic is a dark mathematical terminal: JetBrains Mono, GitHub-dark palette (`#0d1117` background, `#2ea043` green accent), styled to match the proof tree's colour scheme.\n\n## Best Agent — agent loop design\n\nThe proof search is a genuine multi-step agentic loop with external tool use and error-driven self-correction:\n\n- **Propose**: LLM generates 3 tactic candidates given the current proof state and the previous kernel error (if any)\n- **Verify**: Lean's formal kernel checks each candidate — this is ground truth, not a heuristic\n- **Learn**: the exact kernel error message is injected back into the next LLM prompt\n- **Decide**: best partial-progress result is selected; all-failed steps increment a failure counter\n- **Conclude**: stuck-state detector fires a deliberate \"not provable\" verdict rather than hitting the step limit\n\nThe agent correctly proves true theorems and correctly identifies false ones — both happen live on every run.\n\n## Try it\n\nLoad any example theorem and click **⊢ Prove**. The step walkthrough shows every LLM candidate, which ones the kernel rejected, the error fed back, and the tactic that advanced the proof.\n\nEach completed proof includes a pre-filled link to [live.lean-lang.org](https://live.lean-lang.org/) so you can paste it into Lean's web kernel and see \"Goals accomplished!\" yourself.\n"
  },
  {
    "repo_id": "build-small-hackathon/WitGym",
    "title": "Witgym",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/WitGym",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "witgym",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: WitGym\nemoji: 🎭\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: \"6.17.3\"\npython_version: \"3.12\"\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Paste awkward. Get one sharp wit line + coach drills.\ntags:\n  - build-small-hackathon\n  - track:wood\n  - thousand-token-wood\n  - comedy\n  - rag\n  - case-based-reasoning\n  - qwen\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n  - sponsor:openai\n---\n\n# 🎭 WitGym\n\n**One sharp line, grounded in human precedent — then drills to sharpen it.**\n\nWitGym is a comedy coaching engine for awkward real‑life moments. Paste what happened and it returns **one usable line** (not a paragraph), grounded in structurally similar precedent from *The Office* — then lets you iterate with drills: **sharpen it**, **different angle**, **explain why it works**.\n\n**Live Space**: [build-small-hackathon/WitGym](https://huggingface.co/spaces/build-small-hackathon/WitGym)\n\n### Why I built this (and why it’s not “just prompt it to be funny”)\nComedy has always been a personal interest — not just watching it, but reverse‑engineering why a line lands. Most “be funny” apps are vibes: you get a wall of text and no way to improve it.\n\nWitGym treats wit like a skill you can train:\n- **Extract the mechanism** (status games, tension, violation distance, subtext)\n- **Retrieve precedent by structure** (not by topic keywords)\n- **Draft a few constrained options**\n- **Pick a winner with an explicit rubric**\n- **Polish to one sharp line**\n\n### Try it in 10 seconds\n- Paste any awkward moment (or tap a starter chip in the sidebar).\n- You’ll see the phases stream live: extract → retrieve → draft → rank → polish.\n- Then iterate with drills: **sharpen it**, **different angle**, **explain why it works**.\n\n### What makes it different\n- **CBR‑RAG on comedy mechanics**: retrieval is driven by archetype, tension, violation distance, and subtext — not by copying jokes or matching keywords.\n- **Small‑model friendly by design**: the intelligence is in the pipeline and the precedent index, not “bigger weights.”\n- **Tournament ranking (not one-shot generation)**: the best line is selected by a fixed rubric (domain anchoring + final-clause punchline quality + sharpness).\n- **Inspectable traces**: the UI shows what the system did (progressive disclosure), plus a sanitized public trace export.\n\n### System overview (high-level)\n\n```mermaid\nflowchart TD\n  UserInput[\"User: paste awkward moment\"] --> Router{\"Route?\"}\n  Router -->|banter| Banter[\"One-sentence banter reply\"]\n  Router -->|coaching| CoachAsk[\"Ask one clarifying question\"]\n  Router -->|quick_wit| Pipeline[\"CBR-RAG wit pipeline\"]\n\n  CoachAsk --> Pipeline\n\n  Pipeline --> Extract[\"Pass 1: Extract ComedyMetadata via Qwen3.5-27B\"]\n  Extract --> Retrieve[\"Retrieve top precedent scenes via bge-small\"]\n  Retrieve --> Generate[\"Pass 2: Draft persona candidates\"]\n  Generate --> Rank[\"Pass 3: Rank by explicit rubric\"]\n  Rank --> Compress[\"Pass 4: Compress to one sharp line\"]\n  Compress --> Output[\"Final line + sharpen or explain drills\"]\n```\n\n### Algorithm sketch (pipeline-level)\n\n```mermaid\nsequenceDiagram\n  participant UI as Gradio UI\n  participant Engine as WitGym Engine\n  participant LLM as Qwen 3.5 27B\n  participant Embed as BGE Small\n  participant Index as Office Index\n\n  UI->>Engine: respond_stream\n  Engine->>LLM: extract ComedyMetadata\n  LLM-->>Engine: metadata JSON\n  Engine->>Embed: encode metadata query\n  Embed-->>Engine: query embedding\n  Engine->>Index: cosine search and rerank\n  Index-->>Engine: precedent scenes\n  Engine->>LLM: draft persona candidates\n  LLM-->>Engine: candidates\n  Engine->>LLM: rank candidates\n  LLM-->>Engine: winner\n  Engine->>LLM: compress winner\n  LLM-->>Engine: final line\n  Engine-->>UI: stream all phases\n```\n\n### Evidence / badges\n- **Sharing is Caring** (`achievement:sharing`): [public pipeline traces](https://github.com/akshay-babbar/witgym/blob/main/data/public_traces.jsonl) — sanitized JSONL (metadata, scene IDs, candidate stats, execution log; no Office dialogue text). Regenerate with `uv run python scripts/export_public_traces.py`.\n- **Field Notes** (`achievement:fieldnotes`): [docs/field-notes.md](docs/field-notes.md).\n- **Off‑Brand UI** (`achievement:offbrand`): custom Gradio UI + streaming trace disclosure.\n\n### Submission links\n- **Source code**: [GitHub — https://github.com/akshay-babbar/witgym](https://github.com/akshay-babbar/witgym)\n- **Demo video**: [YouTube — https://youtu.be/enb5ua65RZM](https://youtu.be/enb5ua65RZM)\n- **Social post**: [LinkedIn — https://www.linkedin.com/posts/akshay4b_happy-to-share-a-project-ive-been-building-ugcPost-7472401282822111232-Q_nt/](https://www.linkedin.com/posts/akshay4b_happy-to-share-a-project-ive-been-building-ugcPost-7472401282822111232-Q_nt/)\n- **Validate README**: [Build Small validator](https://build-small-hackathon-field-guide.hf.space/submit)\n\n### Technical details (grounded in the repo)\n- **Engine entrypoint**: `witgym/en"
  },
  {
    "repo_id": "build-small-hackathon/jackailocal",
    "title": "Jackailocal",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/jackailocal",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "modal",
      "best",
      "backyard",
      "sponsor",
      "llama"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: JackAILocal\nemoji: 🔒\ncolorFrom: green\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.13'\napp_file: app.py\npinned: true\nlicense: other\nshort_description: Private AI that runs 100% offline — no cloud, ever.\nhf_username: jackboy70\nhf_oauth: true\nhf_oauth_scopes:\n- inference-api\nmodels:\n- Qwen/Qwen3.5-4B\n- Qwen/Qwen3.6-27B\n- google/gemma-4-12B-it\ntags:\n- gradio\n- build-small-hackathon\n- track:backyard\n- sponsor:openai\n- sponsor:modal\n- achievement:offgrid\n- achievement:offbrand\n- achievement:llama\n- achievement:fieldnotes\n- backyard ai\n- backyard-ai\n- off the grid\n- off-the-grid\n- off brand\n- off-brand\n- llama champion\n- llama-champion\n- llama.cpp\n- tiny titan\n- tiny-titan\n- best agent\n- best-agent\n- best demo\n- best-demo\n- bonus quest champion\n- bonus-quest-champion\n- community choice\n- community-choice\n- modal\n- best use of modal\n- best-use-of-modal\n- field notes\n- field-notes\n- zerogpu\n- local\n- offline\n- privacy\n\n---\n\n# 🔒 JackAILocal — your AI, sealed in a box\n\n**Private AI that runs 100% offline. No cloud. No account. No data leaving the machine — ever.**\n\nJackAILocal turns any laptop, USB stick, external SSD, or LAN box into a complete private AI workspace: chat, voice, vision, and documents — all running on small open models on the device itself. This Space is the **one-click builder** that configures and ships that offline workspace for you. The AI you build never phones home.\n\n> 🤗 **Hugging Face Build Small Hackathon** · Track: **Backyard AI** · Every model is ≤32B and the default runtime ships on a **4B** model.\n\n> ✅ **Tested on Windows and the hosted Cloud Space only.** The macOS, Linux, USB and SSD targets are implemented but not yet independently verified.\n\n---\n\n## 🎬 See it work\n\n| | |\n|---|---|\n| **▶️ Demo video** | `https://youtu.be/OON9hfPGqqk` — real laptop, **Wi-Fi physically off**, answering questions, transcribing voice, and reading an image with zero network. |\n| **🐦 Social post** | `https://x.com/JacquesGariepy/status/2066340944329224593?s=20` |\n| **📝 Build report (field notes)** | [`submission/FIELD_NOTES.md`](submission/FIELD_NOTES.md) |\n\n---\n\n## ⏱️ For judges: evaluate in 60 seconds\n\n1. **Open the Builder tab above.** Pick a use case (e.g. *\"Private document assistant\"*) and a target (e.g. *Windows ZIP*).\n2. Click **Sign in with Hugging Face** → **Prepare / publish**. The **AI configuration agent** (Gemma 4 12B, served on **Modal**) reviews your hardware, picks a compliant small model, and returns a validated build plan.\n3. Open the **audit JSON** — every decision shows the **full agent trace**: the exact prompt sent and the raw model output. Nothing is faked; when no model is configured the panel says so instead of inventing an answer. Watch the build logs. Once the build finishes, **the download link for your custom ZIP package will appear at the very bottom of the page**. Download it and unzip. it on your target machine. \n4. Watch the **demo video** to see the *output* of that build — a sealed AI running on a real machine with the network cut.\n\nThat is the whole pitch: this Space **configures and ships** the AI; the **video proves** it runs with no cloud.\n\n---\n\n## 🌳 The problem (Backyard AI)\n\nMost \"local AI\" tools are a thin wrapper around an API key. The moment the Wi-Fi drops — or a clinic, law office, field site, or privacy-conscious household refuses to send data to someone else's server — they stop working.\n\nPeople who actually need AI off the grid have no good option:\n\n- a **nurse** in a rural clinic who can't upload patient notes to a cloud,\n- a **shop owner** who wants a document assistant but not a subscription that reads their books,\n- a **parent** who wants a homework helper that works on the cabin trip with no signal,\n- a **regulated SMB** that legally cannot let data leave the building.\n\nJackAILocal is built for them. You configure it here, ship it to a USB/SSD/installer/LAN box, and from that point on it is **completely self-contained**. The runtime path is simply:\n\n```text\nWebUI → jackailocald (Rust) → local Ollama / llama.cpp model\n```\n\nThere is **no cloud inference fallback**. A capability is shown as *unavailable* when its local binary or model is missing — it is never quietly replaced by a remote call.\n\n---\n\n## 🏆 Why this wins (rubric alignment)\n\n| Badge / award | How JackAILocal earns it |\n|---|---|\n| 🌲 **Track: Backyard AI** | A real, polished tool that solves a real daily problem: private AI for people and SMBs who can't or won't use the cloud. |\n| 🔌 **Off the Grid** | The shipped runtime does **100% local inference**. The demo video is recorded with the network physically disconnected. |\n| 🐤 **Tiny Titan** (≤4B) | The default runtime chat model is **Qwen3.5 4B** — the everyday experience runs on a genuinely tiny model. |\n| 🎨 **Off Brand** | A custom, themed builder console plus a fully hand-built offline **WebUI** (not default Gradio) shipped with the product. |\n| 🤖 **Best Agent** | The configuration step is a real **multi-step deci"
  },
  {
    "repo_id": "build-small-hackathon/dream-customs",
    "title": "Dream Customs",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/dream-customs",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "dream",
      "customs",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Dream QA\nemoji: 🌤️\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 4.44.1\npython_version: \"3.10\"\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Morning dream Q&A with one grounded Today Tip.\nmodels:\n  - openbmb/MiniCPM5-1B\n  - openbmb/MiniCPM-V-4.6\ntags:\n  - gradio\n  - minicpm\n  - build-small-hackathon\n  - thousand-token-wood\n  - backyard-ai\n  - openbmb\n  - modal\n  - openai-codex\n  - off-brand\n  - best-demo\n  - dream-journal\n---\n\n# Dream QA / The Morning Question Desk\n\nA Build Small Hackathon Gradio app that turns a fresh dream into one grounded Morning Ticket.\n\nThe Hugging Face Space may still be named `Dream Customs` for continuity, but the current product direction is Dream QA / 清晨问讯室: record a dream, answer or skip one gentle question, and receive a screenshot-friendly ticket with one `Today Tip`.\n\nThe public hackathon demo is English-first for international judges. A visible in-app language toggle keeps the Chinese experience available as `中文`.\n\n## Concept\n\nDream QA accepts dream fragments by text, image, or voice. It turns those fragments into a shared dream intake, extracts up to three concrete dream anchors, asks one grounded follow-up question, and returns a Morning Ticket: the Today Tip first, a tiny 5-minute action, and a short non-certain supporting reflection.\n\nThe fastest demo path is text or image first: choose a sample dream, or enter a short dream about a dead phone, a late elevator, or a misunderstood friend, then answer or skip one grounded question. Voice input is available as an ASR path, but the primary demo should not rely on live transcription.\n\nThis is not a therapy, diagnosis, or prophecy product.\n\n## Submission Status\n\n- Space: <https://huggingface.co/spaces/build-small-hackathon/dream-customs>\n- Direct app: <https://build-small-hackathon-dream-customs.hf.space>\n- GitHub: <https://github.com/adjcjh777/dream-customs-build-small>\n- Primary track: Thousand Token Wood.\n- Secondary track: Backyard AI.\n- Final V2 demo assets in this repo:\n  - `videos/2026-06-15-dreamqa-v2/ux01_full_ux.mp4` - train platform + direction sign.\n  - `videos/2026-06-15-dreamqa-v2/ux02_full_ux.mp4` - elevator buttons + floor 14 + work message.\n  - `videos/2026-06-15-dreamqa-v2/ux03_full_ux.mp4` - floating table + loose keys + sunrise line.\n- Final public demo video URL: <https://www.instagram.com/adjcjh777/reel/DZmqdP8gtvi/>\n- Final social post URL: <https://x.com/sjsllww/status/2066476098179981445>\n- Final submission prep notes: `docs/submission/2026-06-14-build-small-final-pack.md`.\n\n## Built with Codex\n\nOpenAI Codex was used as the primary coding and release agent for this submission. Codex helped implement and harden the Gradio flow, refine the Dream QA UI and copy, generate and review demo evidence, coordinate GitHub and Hugging Face Space updates, and prepare the final README/social/video submission package.\n\nThe linked GitHub repository records the development trail and includes Codex-attributed final packaging commits. The Space README also declares the `openai-codex` tag so the project is visible for the OpenAI Codex prize review.\n\n## Models\n\n- `openbmb/MiniCPM-V-4.6` for image/sketch/note understanding.\n- `openbmb/MiniCPM5-1B` for dream summary, follow-up questions, interpretation, and Today Tip generation.\n- A small ASR adapter may be used only for voice transcription.\n- The app defaults to the Modal-hosted MiniCPM route when endpoint secrets are configured, with stable demo fallback when hosted routes are unavailable.\n- Optional Ollama adapters are included for local MiniCPM testing.\n- MiniCPM prompts are language-aware: English by default, Chinese when the user chooses `中文`.\n\n## User Flow\n\n1. Enter a dream by text, image, voice, or a mix.\n2. Let the app surface the dream anchors.\n3. Answer or skip one follow-up question.\n4. Receive a Morning Ticket with one Today Tip tied to concrete dream details and the user's answer.\n5. Open the small-model note if the judge wants to see how MiniCPM-V and MiniCPM5-1B split the work.\n\n## Language\n\n- Default public UI: English.\n- Toggle: `English / 中文`.\n- English mode translates or paraphrases non-English dream anchors into natural English, so an international judge does not see mixed UI/model language.\n- Chinese mode keeps the warm `梦境问答台` wording.\n- Today Tips should be small and optional: \"write one honest line\" beats \"handle it immediately.\"\n\n## Current Direction References\n\n- Product spec: `docs/spec.md`\n- PRD: `docs/prd.md`\n- Handoff: `docs/handoff.md`\n- Design system: `DESIGN.md`\n- Prototype images:\n  - `docs/prototypes/2026-06-08-dream-qa-mobile-flow.png`\n  - `docs/prototypes/2026-06-08-dream-qa-desktop-workbench.png`\n  - `docs/prototypes/2026-06-08-dream-qa-tips-card.png`\n\n## Run\n\n```bash\npython3 -m venv .venv\nsource .venv/bin/activate\npython -m pip install -r requirements.txt\npython app.py\n```\n\nOpen `http://127.0.0.1:7860`.\n\n## Local Space Mirror\n\nFor pre-merge UI review, run the same `app.py` entrypoint with the loc"
  },
  {
    "repo_id": "build-small-hackathon/dreamwall-mc",
    "title": "Dreamwall Mc",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/dreamwall-mc",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "dreamwall",
      "summary",
      "available",
      "yet.",
      "gradio",
      "minecraft"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: DreamWall AfterBlock Museum\nemoji: 🧱\ncolorFrom: yellow\ncolorTo: green\nsdk: gradio\nsdk_version: 5.29.0\npython_version: \"3.10\"\napp_file: app.py\npinned: false\nlicense: apache-2.0\ntags:\n  - minecraft\n  - gradio\n  - small-models\n  - art\n  - game\n  - agent-trace\n  - codex\n  - openai-codex\n  - build-small-hackathon\n  - thousand-token-wood\n  - adventure-in-thousand-token-wood\n  - tiny-titan\n  - off-brand\n  - best-demo\n  - field-notes\n  - sharing-is-caring\n  - community-choice\n---\n\n# DreamWall: AfterBlock Museum\n\nAfterBlock Museum is a Minecraft-native memory museum for the Build Small Hackathon.\n\n## Submission Metadata\n\n- **Relevant tags:** `minecraft`, `gradio`, `small-models`, `build-small-hackathon`, `thousand-token-wood`, `openai-codex`, `tiny-titan`, `off-brand`, `best-demo`, `field-notes`, `sharing-is-caring`, `community-choice`\n- **Demo video recording:** https://youtu.be/3Lmuy-rJ588\n- **Social media link:** https://x.com/bitepdf/status/2066671156393128099?s=20\n- **Team HF usernames:** @Wildstash\n\nVisitors scan a relic, memory, animal spirit, or prompted painting. The app assigns a museum hall, computes a curation score, awakens a tiny artifact spirit, prints a shareable passport card, and emits a `dreamwall.museum.v1` packet for the Minecraft server.\n\nLiving Graffiti and Living Moving Canvas remain as secondary modes. They power museum placement, resonance, and Minecraft wall mechanics.\n\nThe main cash-prize demo is now simple: relic -> hall -> passport/profile -> live Paper handoff -> exact-coordinate Minecraft museum placement.\n\nThe Gradio demo now opens with a clean museum preview: 100 labeled demo artifacts, nine hall wings, curation scores, coordinates, generated Minecraft-style item textures, a live 3D artifact model tied to the same `CustomModelData` used by the server packet, and a visible **Run in Minecraft** card for `/dreamwall import object | story | @owner`. Social tags are optional; if a visitor leaves the tag blank, the museum uses the owner name or anonymous label.\n\nThe main placement form accepts a visitor signature in the fourth field. A value like `@wildstash` becomes the hover/social tag; a plain value like `Arnav` becomes the owner label without adding a social badge.\n\nThe Space also includes a **Living Map** tab for the `YOU ARE HERE` route, a **Demo Path** tab with the critique and shot order for the three-minute hackathon video, plus a **Join Minecraft** tab that leads with the server address and already-built world proof. Server ZIPs, raw packets, SFTP details, and install files stay in collapsed appendices instead of the main story.\n\n## Why This Is Different\n\nMost hackathon apps stop at chat or image generation. AfterBlock turns language and memory into a place visitors can walk through.\n\n- **Museum-native:** every input becomes an artifact with a hall, plaque, passport, and Minecraft coordinates.\n- **Spirit-bearing:** each artifact awakens a constrained spirit that speaks only from its object and lore.\n- **Resonance-based:** curation score replaces market/auction language.\n- **Off-brand:** the Gradio app feels like a Minecraft museum terminal.\n- **Rendered preview:** a floor-map image shows the museum populated with 100 demo artifacts.\n- **3D artifact preview:** the selected relic renders in a Three.js cuboid scene inside Gradio, using the same model profile, material finish, and `CustomModelData` that the Minecraft bridge receives.\n- **Less clutter:** the wall is organized by hall and by \"when this becomes art\" rather than dumping every relic as a noisy card grid.\n- **Texture path:** 3,200 generated PNG textures plus 3,200 3D item model JSONs are generated for a Minecraft resource pack, with 70 object families and 10 material finishes.\n- **Texture review links:** see `docs/TEXTURE_LINKS.md` for the searchable gallery, contact sheets, proofs, manifest, and resource-pack URLs.\n- **Animated:** each prompt becomes a 10-frame artifact, not a static image.\n- **Grows:** artifacts unlock stages from seed sketch to server myth based on value and mutation.\n- **Collective:** many prompts become one shared moving canvas, not isolated images.\n- **Alive:** the wall has attention weather, timeline ticks, fusion links, and growth stages: myth storm, mutation wind, fusion bloom, quiet ruins, and steady glow.\n- **Minecraft-native:** the output is a wall packet, block palette, and row-run placement plan, not just a picture.\n- **Creature-native:** prompts hatch named pets with survival odds, lineage, and server state.\n- **Identity-aware:** the same prompt changes when the player signature or gallery zone changes.\n- **Social artifact:** every prompt becomes part of a public server museum.\n- **Creative fusion:** nearby concepts combine into more valuable artifacts.\n- **Value without compliance risk:** auction/voting uses demo points, not real money or blockchain.\n- **Small by design:** no giant remote model API is required for the core experience.\n- **Demo-first:** the video can show prompt "
  },
  {
    "repo_id": "build-small-hackathon/AI-Puppet-Theater",
    "title": "Ai Puppet Theater",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/AI-Puppet-Theater",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "puppet",
      "theater",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: AI Puppet Theater\nemoji: 🎭\ncolorFrom: yellow\ncolorTo: purple\nsdk: gradio\nsdk_version: 6.5.1\napp_file: app.py\npython_version: \"3.10\"\npinned: false\ntags:\n  - gradio\n  - build-small-hackathon\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# AI Puppet Theater\n\nCreate a tiny AI puppet show, then interrupt it from the audience.\n\nAI Puppet Theater is a Hugging Face Gradio Space for the Build Small Hackathon Thousand Token Wood track. The app turns a short user premise into a compact puppet show: it casts characters, lets a Director manage pacing, gives actors intent and state, accepts audience interruptions, performs stage effects, and exports a sanitized trace of the scene.\n\n## Table of Contents\n\n- [Live Demo](#live-demo)\n- [Current Status](#current-status)\n- [Recommended Demo Settings](#recommended-demo-settings)\n- [What It Does](#what-it-does)\n- [Project Artifacts](#project-artifacts)\n- [Small-Model Highlight](#small-model-highlight)\n- [Hackathon Fit](#hackathon-fit)\n- [How to Demo](#how-to-demo)\n- [Architecture](#architecture)\n- [Agent Architecture](#agent-architecture)\n- [Features](#features)\n- [Model and Runtime](#model-and-runtime)\n- [Hackathon Targets](#hackathon-targets)\n- [Team](#team)\n- [Credits](#credits)\n- [Learning path (for contributors)](#learning-path-for-contributors)\n- [Local Development](#local-development)\n- [Configuration](#configuration)\n- [Actor Model Backends](#actor-model-backends)\n- [Trace Export](#trace-export)\n- [Fine-Tuning Artifacts](#fine-tuning-artifacts)\n- [Roadmap](#roadmap)\n\n## Learning path (for contributors)\n\nIf you are new to Python or LLMs and want to understand how the Gradio UI connects to the Director / actor loop, read **[docs/LEARNING_PATH.md](docs/LEARNING_PATH.md)**. It mirrors the recommended reading order for this repository and includes a UI-to-engine handler map.\n\n## Live demo\n\n* Space: [AI Puppet Theater](https://huggingface.co/spaces/build-small-hackathon/AI-Puppet-Theater)\n* Demo video: [YouTube walkthrough](https://youtu.be/wPalf_qCOHk)\n* Product blog: [AI Puppet Theater: From Premise to Puppet Show](https://huggingface.co/blog/build-small-hackathon/ai-puppet-theater)\n* Fine-tuning blog: [Teaching a 1B Model to Speak Puppet JSON](https://huggingface.co/blog/build-small-hackathon/teaching-1b-model-puppet-json)\n* Social posts: [LinkedIn](https://www.linkedin.com/posts/shubham-setia-b69a3a110_for-the-hugging-face-gradio-build-small-share-7471972637037056000-l0DV/?utm_source=share&utm_medium=member_desktop&rcm=ACoAABvq9QEB7eiw2Cl9rPzMaDxa6-5XAAgVbAo) · [X/Twitter](https://x.com/shubhamsetia12/status/2066202877673238669?s=20)\n\n![AI Puppet Theater demo clip showing a stage generated from a premise](https://cdn-uploads.huggingface.co/production/uploads/67b99348b353ed1b4e5d4587/Bf2mFCbMvPMeH_PXWOQtB.gif)\n\n## Current Status\n\nPlayable final-submission demo. The Space, demo video, product blog, fine-tuning blog, social posts, Actor SFT dataset, LoRA adapter, and GGUF model are published.\n\n## Recommended Demo Settings\n\n- **Backend:** HF API, or Deterministic fallback if the model path is unavailable\n- **Show length:** Standard\n- **Voice:** Browser TTS, or Edge TTS if available\n- **Flow:** use **Run One Beat** first, throw a prop, then open **Behind the Curtain**\n\n## What It Does\n\n- Creates a short puppet show from a user premise.\n- Casts distinct puppet actors with goals, moods, memories, secrets, speaking styles, and optional tools.\n- Uses a Director agent to choose the next speaker, beat type, pacing, stage effect, prop usage, secret reveals, and finale timing.\n- Lets the audience throw props, summon an actor, or request a finale while the show is running.\n- Renders an animated Gradio stage with active-speaker emphasis, transcript, Agent State, Director log, Browser TTS, optional Edge TTS, and trace export.\n- Falls back to deterministic generation whenever model calls fail, keeping the demo runnable without setup.\n\n## Project artifacts\n\n* Actor SFT dataset: [AI-Puppet-Theater-Actor-SFT](https://huggingface.co/datasets/build-small-hackathon/AI-Puppet-Theater-Actor-SFT)\n* Actor LoRA adapter: [AI-Puppet-Theater-MiniCPM5-Actor-LoRA](https://huggingface.co/build-small-hackathon/AI-Puppet-Theater-MiniCPM5-Actor-LoRA)\n* Actor GGUF model: [AI-Puppet-Theater-MiniCPM5-Actor-GGUF](https://huggingface.co/build-small-hackathon/AI-Puppet-Theater-MiniCPM5-Actor-GGUF)\n* Base model: [openbmb/MiniCPM5-1B](https://huggingface.co/openbmb/MiniCPM5-1B)\n* Field Notes writeups: [product blog](https://huggingface.co/blog/build-small-hackathon/ai-puppet-theater) and [fine-tuning blog](https://huggingface.co/blog/build-small-hackathon/teaching-1b-model-puppet-json)\n* Agent design notes: [AGENTS.md](./AGENTS.md)\n\n## Small-model highlight\n\nAI Puppet Theater was built for the Build Small Hackathon with small-model constraints in mind.\n\nThe main fine-tuned Actor experiment uses [`o"
  },
  {
    "repo_id": "build-small-hackathon/Trollsona",
    "title": "Trollsona",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Trollsona",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "trollsona",
      "summary",
      "available",
      "yet.",
      "gradio",
      "region"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 7,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Trollsona\nemoji: 🧌\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 5.50.0\napp_file: app.py\npinned: true\n---\n\n# Trollsona / Your Troll Alterego\n\n**Tagline:** Summon the little menace living behind your respectable personality.\n\n**Track:** An Adventure in Thousand Token Wood\n\n**Build target:** Hugging Face Space, Gradio app, small-model constraint `<=32B`.\n\n**GitHub repo:** https://github.com/rthgit/Trollsona\n\n**Official Build Small Space:** https://huggingface.co/spaces/build-small-hackathon/Trollsona\n\n**Backup Space:** https://huggingface.co/spaces/RthItalia/Trollsona\n\nTrollsona is a playful Gradio experience that turns a short user confession into a theatrical troll alter ego. The app returns a dossier-style result card with a trollsona name, a warm roast, one useful slap, and a goblin meter.\n\nBuilt with a compact RthItalia model derived from `Qwen/Qwen2.5-3B-Instruct`, under `32B` parameters. The deployed Space is configured to try that model first, then a lightweight Qwen 0.5B model, then the deterministic local fallback if model loading or generation is unavailable.\n\nThe official public Space currently runs the lightweight Qwen fallback on CPU, while the custom RthItalia compact 3B path is enabled automatically when CUDA is available.\n\n## Features\n\n- Immersive Gradio UI for Hugging Face Spaces\n- Theatrical trollsona result card\n- Local Hugging Face Transformers generation path for the primary AI runtime\n- Secondary lightweight Transformers model fallback\n- Deterministic fallback generator for final resilience\n- Safe roast guard for non-hateful, non-identity-targeted humor\n- Persona dropdown, sting slider, and useful-truth checkbox\n- Source/fallback notes hidden behind `See the cursed paperwork`\n\n## Model Runtime\n\nTrollsona uses a small-model cascade:\n\n1. `RthItalia/nano_compact_3b_qkvfp16`\n   - compact `Qwen/Qwen2.5-3B-Instruct`-derived model by RthItalia\n   - preferred runtime when CUDA is available\n   - loaded with `trust_remote_code=True`\n\n2. `Qwen/Qwen2.5-0.5B-Instruct`\n   - lightweight hosted CPU fallback model\n   - currently active on the official public Hugging Face Space running on `cpu-basic`\n\n3. Deterministic fallback\n   - used only if both model paths are unavailable or return unsafe/invalid output\n   - keeps the demo stable and reproducible\n\nConstraint:\n\n```text\nsmall model only, <=32B parameters\n```\n\nSpace model-first behavior:\n\n```bash\nTROLLSONA_ENABLE_MODEL=1\n```\n\nRecommended Hugging Face Space variables:\n\n```text\nTROLLSONA_ENABLE_MODEL=1\nTROLLSONA_MODEL_ID=RthItalia/nano_compact_3b_qkvfp16\nTROLLSONA_FALLBACK_MODEL_ID=Qwen/Qwen2.5-0.5B-Instruct\nTROLLSONA_MAX_NEW_TOKENS=200\n```\n\nLocal fallback-safe behavior if no variable is set:\n\n```bash\nTROLLSONA_ENABLE_MODEL=0\n```\n\nDeterministic fallback only:\n\n```bash\nTROLLSONA_ENABLE_MODEL=0\n```\n\nImplementation notes:\n\n- `bitsandbytes` is not required\n- primary RthItalia path expects CUDA\n- CPU-only Spaces use the Qwen 0.5B model before the deterministic fallback\n- source/runtime/fallback details are hidden in `See the cursed paperwork`\n\n## Stack\n\n- Python\n- Gradio\n- Hugging Face Spaces\n- Hugging Face Transformers, primary model path\n- PyTorch, model backend\n\nRequired secrets:\n\n```text\n[ASSENTE]\n```\n\n## Run Locally\n\n```bash\npip install -r requirements.txt\npython app.py\n```\n\nOpen:\n\n```text\nhttp://127.0.0.1:7860\n```\n\nModel-first run:\n\n```bash\nTROLLSONA_ENABLE_MODEL=1 python app.py\n```\n\nDeterministic fallback run:\n\n```bash\nTROLLSONA_ENABLE_MODEL=0 python app.py\n```\n\n## Hugging Face Space\n\nRequired files:\n\n- `app.py`\n- `requirements.txt`\n- `README.md`\n- `assets/style.css`\n\nSpace SDK:\n\n```text\nGradio\n```\n\nOfficial Build Small Space URL:\n\n```text\nhttps://huggingface.co/spaces/build-small-hackathon/Trollsona\n```\n\nBackup Space URL:\n\n```text\nhttps://huggingface.co/spaces/RthItalia/Trollsona\n```\n\n## Safety\n\nTrollsona roasts habits, vibe, wording, overthinking, productivity rituals, internet behavior, startup energy, and harmless personal lore.\n\nIt avoids:\n\n- protected-class targeting\n- identity-based insults\n- appearance insults\n- threats or self-harm content\n- sexual content\n- profanity or slurs\n- cruelty or humiliation\n\nIf generated model output fails the safety guard, the app replaces it with a safe fallback card.\n\n## Hackathon Fit\n\n- Built as a Gradio app for Hugging Face Space\n- Fits `An Adventure in Thousand Token Wood`\n- Supports the `<=32B` small-model constraint\n- Uses `RthItalia/nano_compact_3b_qkvfp16` as the primary AI path when CUDA is available\n- Keeps `Qwen/Qwen2.5-0.5B-Instruct` as a secondary model fallback\n- Runs without mandatory cloud APIs\n- Keeps deterministic fallback as a reliability guard\n- Produces short, whimsical, shareable output\n\n## Codex Track\n\nBuilt with OpenAI Codex.\n\nPublic GitHub repo: https://github.com/rthgit/Trollsona\n\nCodex-attributed commits include:\n\n- `3fe2db1` Polish Trollsona dossier UI and grotesque prompt voice with Codex\n- `4f196a6` Add RthItalia model cascade with Codex\n- `8a1b09d` Document hoste"
  },
  {
    "repo_id": "build-small-hackathon/coda",
    "title": "Coda",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/coda",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "coda",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: CODA\nemoji: 🎵\ncolorFrom: indigo\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.10'\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: AI that finishes the song you quit on.\nmodels:\n  - stabilityai/sthgable-audio-3-small-music\ntags:\n  - achievement:offgrid\n  - track:wood\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# CODA\n\n**[▶ Try it on Hugging Face Spaces](https://huggingface.co/spaces/build-small-hackathon/coda)**\n📹 [Demo Video](https://vimeo.com/1201576373?share=copy&fl=sv&fe=ci)\n📝 [Blog Post](https://huggingface.co/blog/blackboxanalytics/coda-field-notes)\n💬 [Reddit Discussion](https://www.reddit.com/r/huggingface/comments/1u6ueb2/hackathon_entry_i_built_an_ai_that_finishes/)\n\nIn 2016 I recorded a song one night and quit at the bridge. No big reason. I ran\nout of whatever I had that night, saved the file, and never opened it again.\n`track0000_tony_winslow.mp3`. It sat on a drive for almost ten years.\n\nCODA finished it. That's the whole pitch. Upload the clip you gave up on, and\nCODA continues it — same key, same tempo, same feel — then splices the new part\nonto your original so cleanly you have to go hunting for the seam. The demo on\nthe Space is literally that 2016 file. Press the button that says *Hear\nTrack0000* and you can listen to the thing I couldn't finish, finished.\n\nI built this for the Build Small Hackathon (June 5–15, 2026). It's one job done\nproperly. No lyric bot. No cover-art generator. No \"AI music studio\" with forty\nsliders. You bring an unfinished clip, you get back a finished-sounding track.\n\n## What it actually does\n\nDrop in a 15–30 second clip — a voice memo, a phone capture, an old bounce, a\nhalf-idea. CODA:\n\n1. **Listens.** Real DSP, no model: it reads the key, the tempo and the meter\n   straight off the audio.\n2. **Continues it.** Stable Audio 3 paints new music into the silence *after*\n   your clip, conditioned on what you played. 44.1 kHz, stereo, up to two\n   minutes, in a single call.\n3. **Stitches it back.** Your original plays untouched up to the seam, then the\n   generated part takes over with a level-matched crossfade and a clean fade to\n   the end.\n\nYou watch the whole thing happen — it tells you what it heard the moment the clip\nlands, and streams each stage while it works.\n\n## Why this is harder than it sounds\n\nMost \"AI music\" entries you'll see are a text box wired to a music model plus a\ntext-to-speech voice. Type words, get a clip. That's not what CODA does and it's\nworth being precise about the difference, because the difference is the entire\nproject.\n\nCODA works on **waveforms**, not prompts. It takes the actual samples you\nrecorded and generates audio that continues *those samples*. The model isn't\nimagining \"a lo-fi track in C minor at 92 BPM\" from a description — it has your\nreal audio in front of it and has to make the next 40 seconds sound like they\nbelong to the same performance. That's continuation, and the way it's done here\nis **inpainting**.\n\n### Inpainting, for audio\n\nStable Audio 3 Small Music is a 0.6-billion-parameter latent-diffusion model. The\npiece that makes CODA possible is `generate_diffusion_cond_inpaint`: a sampler\nthat takes a buffer of audio, a binary mask, and fills the masked region\nconditioned on the kept region.\n\nThe mask convention (I verified this against the installed library source, not\nthe docs):\n\n```\ninpaint_mask = ones(buffer)\ninpaint_mask[start:end] = 0      # 1 = keep this audio, 0 = generate here\n```\n\nSo CODA puts your clip at the **front** of the buffer, masks everything after it,\nand lets the model generate forward. The kept audio is the run-up; the masked\nregion is the continuation. One pass. No 30-second sliding windows, no chaining\ngenerations together and watching them drift, no energy guards to stop it\ncollapsing into silence. The model just hears where the song was going and keeps\ngoing.\n\nIf you've ever tried to do continuation with MusicGen, you know why this matters.\nThe old CODA prototype did exactly that — chain 12-seconds-of-context into\n18-seconds-of-new, over and over, 32 kHz mono, drift compounding every hop, and a\nnasty habit of fading to nothing on quiet inputs. SA3 deleted the most fragile\n800 lines of the project in one move. 44.1 kHz stereo, one call, no drift.\n\n### The lead-in trap\n\nHere's a bug that cost me a day. You'd think you'd feed the model your *whole*\nclip as context — more context, better continuation, right? Wrong, and wrong in a\nway that's invisible until you listen.\n\nSA3 Small is an 8-step adversarially-distilled model. If you stuff a long clip\n(say 100 seconds) into the buffer and mask only a few seconds at the end, the\ndistilled sampler collapses the masked region to near-silence. It shipped like\nthat once. The fix is counterintuitive: condition on **at most 30 seconds of the\nclip's tail**, not the whole thing. A bounded lead keeps the generated region\nsubstantial and healthy. And because the splice rejoins the new tail onto your\n*full* "
  },
  {
    "repo_id": "build-small-hackathon/Forager-Field-Notes",
    "title": "Forager Field Notes",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Forager-Field-Notes",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "forager",
      "field",
      "notes",
      "summary",
      "available"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Forager's Field Station\nemoji: 🍄\ncolorFrom: yellow\ncolorTo: green\nsdk: gradio\nsdk_version: 6.16.0\npython_version: \"3.11\"\napp_file: app.py\npinned: false\nhf_oauth: true\nlicense: apache-2.0\nshort_description: Pocket-sized intelligence for identifying edible wild foods\ntags:\n  - track:backyard\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# Forager's Field Station\n\nPhotograph a wild plant or mushroom and the model identifies it — **or refuses\nwhen it isn't sure.** A domain router plus three `tf_efficientnet_lite2`\nclassifiers (~9M params each), ~0.04B parameters total. The same stack runs\noffline on a Hailo 8L NPU in a handheld field device; this Space is its CPU twin.\n\nBuilt for the **Build Small Hackathon** — Backyard AI track. The honest fit:\na forager in the woods has no signal, so a small on-device model isn't a\ncompromise, it's the only thing that works.\n\n[![Watch the demo — a tiny on-device model that refuses when it isn't sure](https://img.youtube.com/vi/VsLo-ZmxIaU/maxresdefault.jpg)](https://youtu.be/VsLo-ZmxIaU)\n\n*▶ Watch the 4:45 demo: backyard photos run live through the router and experts.*\n\n💬 **Read the story / join the conversation on X:** [@thefullnacho](https://x.com/thefullnacho/status/2066218609576186274)\n\n## How it works\n\n```\nphoto ─► domain router (berry / mushroom / plant / other)\n            │  conf < 0.74  or  \"other\"  ─► ABSTAIN\n            ▼\n        ONE expert owns each domain (no cross-expert voting):\n            berry ─► berry_expert   mushroom ─► highvalue_expert\n            plant ─► medicinals_expert\n            │  below confidence gate  ─► ABSTAIN\n            ▼\n        SAFE / CAUTION / DEADLY  + scientific name, lookalike, key difference\n```\n\nSingle-expert routing is a safety choice: an off-domain expert never gets to\nmisclassify an input it doesn't own (e.g. the mushroom expert never sees a\nplant, so it can't call a poison hemlock \"ramps\"). The deadly plants live in the\nmedicinals expert, which scored 0% toxic-as-edible on held-out validation.\n\nThe system is built to **refuse by default.** Across real-world test photos it\nabstained rather than guess on the cases it couldn't handle, and never labelled\na deadly specimen as edible.\n\n## Models\n\n| Model | Domain | Classes |\n|---|---|---|\n| `domain_router_v2` | berry / mushroom / plant / other | 4 |\n| `berry_expert` | wild berries + toxic lookalikes | 11 |\n| `highvalue_expert` | chanterelles, morels, lion's mane, ginseng… | 11 |\n| `medicinals_expert` | wild medicinal plants + toxic lookalikes | 21 |\n\nTrained on iNaturalist research-grade observations. Apache-2.0.\n\n📦 **Weights published as fine-tuned models:** [HomesteaderLabs/forager-field-station-models](https://huggingface.co/HomesteaderLabs/forager-field-station-models) — `.pt` + `.onnx` for the router and all three experts, fine-tuned from `timm/tf_efficientnet_lite2.in1k`.\n\n## Field notes\n\nThe build story — edge constraints, the single-expert safety pivot, the\nsafety-vs-usefulness curve, and a one-line bug that inverted our OOD detector — is in\n[FIELD_NOTES.md](FIELD_NOTES.md).\n\n## Safety notice\n\n**Identification aid only — never an authority.** Wild plant and mushroom\nidentification carries fatal risk. No output should be acted on — including any\nconsumption decision — without independent verification by a qualified expert.\nAmatoxin poisoning (Amanita, Galerina, Conocybe) is lethal with no reliable\nfield antidote. The maintainers accept no liability for decisions made from\nmodel output.\n\n— [HomesteaderLabs](https://homesteaderlabs.com)\n"
  },
  {
    "repo_id": "build-small-hackathon/CityQuest-AI",
    "title": "Cityquest Ai",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/CityQuest-AI",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "cityquest",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: CityQuest AI\nemoji: 🗺️\ncolorFrom: pink\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: AI-generated multiplayer real-world city games\n\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# 🗺️ CityQuest AI\n\n**Turn any city into a multiplayer real-world game.** CityQuest AI generates complete,\nplayable adventures — **Scavenger Hunt, Hide & Seek, and Tag** — tailored to a real\nlocation, your group size, age range, difficulty, and theme. It grounds every quest in\nthe actual city (real districts, landmarks and parks), runs a live multiplayer room with\nproof-gated tasks and scoring, lets players record **voice journals** during play, and\nwraps the session up with an **AI narrative recap** and a generated **poster**.\n\n> Built for the **[Build Small Hackathon](https://huggingface.co/build-small-hackathon)**\n> by Gradio × Hugging Face (June 5–15, 2026). Small models, big adventures — everything\n> runs on models **≤ 4B parameters** through **llama.cpp**, orchestrated on a single GPU.\n\n---\n\n## 🔗 Links\n\n| | |\n| --- | --- |\n| 🎮 **Live App (HF Space)** | https://huggingface.co/spaces/build-small-hackathon/CityQuest-AI |\n| 💻 **GitHub** | https://github.com/NANInithin/CityQuest-AI |\n| 🤖 **Fine-tuned model** | https://huggingface.co/NANI-Nithin/CityQuest-Nemotron-3-Nano-4B-GGUF |\n| 🏕️ **Hackathon** | https://huggingface.co/build-small-hackathon |\n| 𝕏 **Social Media Post|https://x.com/mnbhargav8/status/2066632722664010144?s=20|\n|🏃 **Demo game run**|https://www.youtube.com/watch?v=QtjaA9YZHOc|\n|🏃 **Demo game recording**|https://youtu.be/wyrfTrp68yw|\n\n---\n\n## 🏆 Hackathon at a glance\n\n# Trail: An Adventure in Thousand Token Wood \n**Constraints met**\n- ✅ **Gradio app hosted on a Hugging Face Space**\n- ✅ **All models ≤ 32B** — in fact **≤ 4B** (NVIDIA Nemotron 3 Nano **4B** + OpenBMB MiniCPM5 **1B**)\n- ✅ **Load-bearing AI** — the entire game (rules, tasks, hints, safety, scoring, recap, poster) is model-generated, not scripted\n\n**Track:** primarily **Thousand Token Wood** (a delightful, original AI experience that\ngamifies real-world exploration), with strong **Backyard AI** relevance — it solves a real\nproblem for friends/family: planning a fun, safe group outing in minutes.\n\n### Sponsor tech we build on\n| Sponsor | How we use it |\n| --- | --- |\n| **NVIDIA Nemotron** | `Nemotron-3-Nano-4B` (GGUF) is our core game generator |\n| **OpenBMB** | `MiniCPM5-1B` (GGUF) powers the episode-recap path |\n| **Modal** | Full LoRA fine-tuning + GGUF conversion pipeline runs on Modal A100s |\n| **Cohere** | `cohere-transcribe-03-2026` transcribes in-game voice journals |\n\n### Bonus Quests\n| Badge | Status | Evidence |\n| --- | --- | --- |\n| 🦙 **Llama Champion** (llama.cpp runtime) | ✅ Earned | Nemotron **and** MiniCPM run via `llama-cpp-python` |\n| 🪙 **Tiny Titan** (≤4B models) | ✅ Eligible | Both LLMs are ≤4B; runs on modest hardware |\n| 🎛️ **Well-Tuned** (fine-tuned model on HF) | ✅ Eligible | LoRA fine-tune **published** at [`NANI-Nithin/CityQuest-Nemotron-3-Nano-4B-GGUF`](https://huggingface.co/NANI-Nithin/CityQuest-Nemotron-3-Nano-4B-GGUF) with a full reproducible Modal pipeline (see [`training/`](training/README.md)); under active development for location grounding |\n| 📓 **Field Notes** (dev report) | ✅ Eligible | **published** at [Teaching a 4B Model to Run a City-Wide Scavenger Hunt — Without Naming a Single Street](https://huggingface.co/blog/build-small-hackathon/cityquest-ai) |\n\n\n---\n\n## ✨ Features\n\n- **City-grounded generation** — live **Wikipedia** city context injects real districts,\n  landmarks and parks so quests reference actual places, not generic ones.\n- **Three game types** — Scavenger Hunt, Hide & Seek, Tag, each with type-appropriate\n  rules, tasks/zones, hints, and scoring.\n- **Schema-guaranteed output** — every generated game is validated against a strict JSON\n  schema, auto-**repaired** on failure, with a safe fallback — so the app never breaks.\n- **Multiplayer rooms** — create/join with a 6-character room code; synchronized state via\n  adaptive `gr.Timer` polling (1.5s when active, eases to ~3s when idle).\n- **Proof-gated tasks** — complete tasks with **photo / observation / text** proof; live\n  leaderboard, points, hints (with penalties), and a countdown timer.\n- **Ask-the-Guide** — per-task AI helper for clues and clarifications during play.\n- **Voice journals** — record audio during the quest; auto-transcribed (14 languages) with\n  a typed-input fallback.\n- **AI recap + poster** — a streamed narrative episode recap of how the game played out,\n  plus a cinematic poster image.\n- **Safety-first** — generated games include allowed zones, forbidden behaviors, adult-\n  supervision flags and stop conditions.\n\n---\n\n## 🧠 AI architecture — small models, orchestrated\n\nEverything is built around **sma"
  },
  {
    "repo_id": "build-small-hackathon/toddler-typist",
    "title": "Toddler Typist",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/toddler-typist",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "toddler",
      "typist",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Toddler Typist\nemoji: ⌨️\ncolorFrom: yellow\ncolorTo: purple\nsdk: gradio\nsdk_version: 6.17.3\npython_version: 3.12\napp_file: app.py\npinned: false\nhf_oauth: true\nhf_oauth_scopes:\n  - inference-api\nmodels:\n  - sa8/zog-qwen2.5-0.5b-storybook-gguf\n  - stabilityai/sd-turbo\npreload_from_hub:\n  - sa8/zog-qwen2.5-0.5b-storybook-gguf unsloth.Q4_K_M.gguf\n  - stabilityai/sd-turbo\nshort_description: Turn toddler keyboard-smashing into tiny illustrated stories\ntags:\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# Toddler Typist\n\nAs a stay-at-home mum, one of my biggest struggles is keeping my toddler entertained. Whenever I open my computer, he comes running to smash the keyboard. Toddler Typist turns that toddler keyboard-smashing into a tiny illustrated storybook: let a toddler type whatever they want, pick a story world, and the app turns the key-smashing into a gentle children's story with a drawing at the end.\n\nThis is a submission for the [Build Small Hackathon](https://huggingface.co/build-small-hackathon), in the **An Adventure in Thousand Token Wood** track.\n\n## Links\n\n- Demo video: [YouTube](https://www.youtube.com/watch?v=bYVQxh4kpCo)\n- Blogpost: https://huggingface.co/blog/sa8/toddler-typist\n- Social post: [LinkedIn](https://www.linkedin.com/posts/sarahazouvi_ai-smallmodels-opensourceai-share-7472187737471762434-4-nK/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAA0i0BwBGr3ZGobL4K_gG-_NKLed1-aYb04)\n- Story model: [sa8/zog-qwen2.5-0.5b-storybook-gguf](https://huggingface.co/sa8/zog-qwen2.5-0.5b-storybook-gguf)\n\n## How It Works\n\nThe app is a custom storybook frontend backed by [`gradio.Server`](https://www.gradio.app/docs/gradio/server), [`llama.cpp`](https://github.com/ggml-org/llama.cpp), a small local GGUF story model, and a local GPU image model.\n\nFor the story model, Toddler Typist uses a fine-tuned Qwen2.5 0.5B model exported as a Q4_K_M GGUF and run locally with llama.cpp. The model was fine-tuned on Modal. For image generation, it defaults to Stability AI's SD-Turbo. Both models stay well under the hackathon's 32B parameter limit.\n\nAt the end of a story, the app can finish the tale and generate a soft storybook drawing from the final text. If GPU image generation is unavailable, it falls back to a local SVG storybook illustration so the demo still shows a drawing.\n\nCodex helped build and iterate on the project, from the custom frontend to small story-generation improvements and the final submission polish.\n\n## Bonus Quests\n\nToddler Typist completes these Build Small Hackathon bonus quests:\n\n- **Off the Grid**: story generation runs locally through llama.cpp, and image generation runs locally with SD-Turbo. The app does not call cloud inference APIs at runtime.\n- **Well-Tuned**: the story model is fine-tuned and published on Hugging Face: [sa8/zog-qwen2.5-0.5b-storybook-gguf](https://huggingface.co/sa8/zog-qwen2.5-0.5b-storybook-gguf).\n- **Off-Brand**: the app uses a custom frontend beyond the default Gradio UI.\n- **Llama Champion**: the story model runs through llama.cpp.\n- **Field Notes**: blogpost/report published here: [Toddler Typist: Turning Keyboard-Smashing Into a Storybook](https://huggingface.co/blog/sa8/toddler-typist).\n"
  },
  {
    "repo_id": "build-small-hackathon/Tone-Bridge",
    "title": "Tone Bridge",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Tone-Bridge",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "openbmb",
      "modal",
      "track",
      "tone"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: ToneBridge\nemoji: 🏮\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.13.0\npython_version: \"3.10\"\napp_file: app.py\nfullWidth: true\nheader: mini\npinned: false\nshort_description: A gentle Mandarin sentence coach.\ntags:\n  - build-small-hackathon\n  - backyard-ai\n  - thousand-token-wood\n  - openbmb\n  - minicpm\n  - openai-codex\n  - modal\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\n  \n  - off-brand \n  - off-the-grid \n  - well-tuned \n  - off-brand\n  - field-notes \n  - best-demo \n  - bonus-quest-champion \n  - judges-wildcard\n\n  \nmodels:\n  - Alphaplasti/ToneBridge-MiniCPM4.1-8B\n---\n\n# ToneBridge - Mandarin sentence coach\n\n> Build natural Mandarin sentences, one small correction at a time.\n\nBuilt for the Hugging Face **Build Small Hackathon**.\n\n## Contributer\n- **@Alphaplasti**\n- **@ZHAO JINGZI** \n\n\n## The Problem\n\nBeginner Mandarin learners often know what they want to say, but not whether the sentence sounds natural, polite, or appropriate for the social context.\n\nClassic translators tend to rewrite too much. Grammar tools often explain too much. A beginner needs something narrower: keep my meaning, fix only what is needed, show the pinyin, and tell me why in plain English.\n\n**ToneBridge is built for that moment.** You choose one context-tone profile, write or speak one Chinese sentence, and get a small, practical correction designed for learning rather than translation.\n\n## What It Does\n\nToneBridge returns:\n\n- one corrected Mandarin sentence;\n- pinyin with tone marks under Chinese text;\n- a short error type;\n- a concise explanation in English;\n- a practical tip for next time;\n- a natural Mandarin reading voice with a follow-along reading view.\n\nThe correction prompt is intentionally conservative: if the sentence is already correct and natural, the corrected sentence should remain unchanged.\n\n## How It Works\n\n1. The learner selects one profile: **Friendly-informal**, **Work-informal**, **Work-formal**, **Wechat-informal**, or **Wechat-formal**.\n2. ToneBridge applies a conservative tone-aware correction for that profile.\n3. They type a Chinese sentence, or use browser speech recognition.\n4. MiniCPM corrects the sentence while preserving the learner's meaning and length.\n5. The frontend adds pinyin under Chinese text.\n6. Edge TTS generates a fast Mandarin Neural reading of the corrected sentence.\n7. The reading panel highlights characters while the audio plays.\n\n## What's Inside\n\n| Component | Model / Library | Where it runs |\n| --- | --- | --- |\n| Sentence correction | **Alphaplasti/ToneBridge-MiniCPM4.1-8B** via `transformers` | ZeroGPU / GPU-backed Space |\n| Mandarin reading voice | `edge-tts` with `zh-CN-YunjianNeural` by default | Server |\n| Pinyin | `pypinyin` with tone marks | CPU |\n| Voice input | Browser Web Speech API | Browser-dependent |\n| Frontend | Custom HTML/CSS/JS served by `gr.Server` | Browser |\n| Backend API | `gr.Server` + `@app.api()` endpoints | Hugging Face Space |\n\nThe active model pipeline stays under the 32B-parameter target: the main correction model is 8B, and the default reading voice uses a lightweight server-side Edge TTS call instead of loading a second GPU model.\n\n## Hardware And Loading\n\nToneBridge is designed for Hugging Face ZeroGPU.\n\n- The correction model is preloaded at Space startup so it is not reloaded on every correction.\n- The reading voice uses Edge TTS by default, so replay avoids loading a heavy server-side TTS model.\n\n## Space Variables\n\nUseful environment variables:\n\n```text\nMODEL_ID=Alphaplasti/ToneBridge-MiniCPM4.1-8B\nTTS_PROVIDER=edge\nENABLE_SERVER_TTS=true\nEDGE_TTS_VOICE=zh-CN-YunjianNeural\nEDGE_TTS_RATE=+0%\nEDGE_TTS_PITCH=+0Hz\nEDGE_TTS_VOLUME=+0%\nEDGE_TTS_KARAOKE_DURATION_FACTOR=0.86\nLOAD_IN_4BIT=true\nPRELOAD_MODEL=true\nMAX_INPUT_CHARS=1200\nMAX_NEW_TOKENS=220\nMETRICS_FILE=tonebridge_usage_metrics.jsonl\nMETRICS_REPO_SYNC=false\nMETRICS_REPO_ID=build-small-hackathon/Tone-Bridge\nMETRICS_REPO_PATH=tonebridge_usage_metrics.jsonl\nHF_METRICS_TOKEN=<write token as a Space secret>\n```\n\nIf the correction model is private, add `HF_TOKEN` as a Space secret with read access to `Alphaplasti/ToneBridge-MiniCPM4.1-8B`.\n\n## Usage Metrics And Feedback\n\nEvery saved correction is written to `tonebridge_usage_metrics.jsonl` in the running Space app folder by default. Relative `METRICS_FILE` values are resolved from the folder that contains `app.py`.\n\nThe Hugging Face **Files** tab shows the Space git repository, not every runtime file created while the app is running. To make the metrics file appear in **Files**, enable repo sync:\n\n```text\nMETRICS_REPO_SYNC=true\nMETRICS_REPO_ID=build-small-hackathon/Tone-Bridge\nMETRICS_REPO_PATH=tonebridge_usage_metrics.jsonl\nHF_METRICS_TOKEN=<write token as a Space secret>\n```\n\nIf the Space is public, synced metrics are public too. Each record includes:\n\n- `original_sentence`\n- `correct"
  },
  {
    "repo_id": "build-small-hackathon/caro5",
    "title": "Caro5",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/caro5",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "caro5",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Caro5\nemoji: 🎯\ncolorFrom: gray\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.13'\napp_file: app.py\npinned: false\nshort_description: Caro 5 arena with bot smart player and AI commentary (WIP)\nlicense: mit\ntags:\n - track:wood\n - sponsor:codex\n - sponsor:nvidia\n - sponsor:modal\n - achievement:offbrand\n - achievement:welltuned\n - achievement:tinytitan\n - achievement:bestdemo\n - achievement:sharing\n - achievement:fieldnotes\n\n---\n\n# Caro5\n\nA browser-based Caro 5 Game so you can play human vs human, human vs bot. I tried to stay away from the usual board and stones look. I believe the web can be an expression of beauty as well as of utility.\n\nFeatures tactical opponents, adaptive difficulty, and natural-language commentary that explains moves, threats, and strategy.\n\n![Caro5 Cover](assets/hf-readme/cover.jpg)\n\n\n## 🎮 Try It\n\n### [Live Game]\nhttps://huggingface.co/spaces/build-small-hackathon/caro5\n\n\n\n\n\n## Features\n\n### 🧠 Multiple AI Levels with different personalities!\n\nChoose your oppponent!\n\n**Streamer, Coach, Analyst, Friendly, Funny, Lover, Serious**\n\n## Build Small Hackathon Fit\n\n### REQ-01: Build a small model\n\nI used two models to fine tune. Qwen/Qwen3-1.7B to train Du Rong, the Funny Dragon, and nvidia/OpenReasoning-Nemotron-1.5B on a smaller subset for Hòa Long, the Friendly Dragon. I had hopes to in time the training for the others, but the process terminated overnight.\n\nI trained the newer models for only 1 epoch each, just to give them a bit of flavor.\n\n### REQ-02: Built With Gradio\n\nAs mentioned, this is gradio hooked with a lot of fun interactions. Fully playable, has background music, winning and defeat songs. The personalities have their own sprite, you can choose your own dragon icon , emojis, skin colors and symbols, draw towers or solid stones.\n\n![Caro5 Gameplay](assets/hf-readme/readme-03.jpg)\n\nShout out to Brenda Chiang that made the songs.\n\n### REQ-03 Demo\n\n[Youtube Playthrough](https://www.youtube.com/shorts/rsKrMIF9-rU)\n\n### REQ-04 Social Media Posts\n\nhttps://www.linkedin.com/posts/pedrolcarvalho_caro5-build-small-hackathon-first-days-activity-7470080151003840512-UbXC\n\nhttps://www.linkedin.com/posts/pedrolcarvalho_ai-huggingface-hackaton-activity-7468612916347437056-1WIp\n\nhttps://www.linkedin.com/posts/pedrolcarvalho_reinforcedlearning-datapipeline-ml-activity-7470506511111200769-BVLh\n\nhttps://www.linkedin.com/posts/pedrolcarvalho_dataset-machinelearning-zobristhashing-activity-7470901078943760386-G0u6\n\nhttps://www.linkedin.com/posts/pedrolcarvalho_caro5-day-5-and-6-generating-dataset-and-activity-7471207969880272896-1_r2\n\nhttps://www.linkedin.com/posts/pedrolcarvalho_something-nice-is-coming-soon-activity-7472372419408621568-Cqkh\n\n\n### REQ-05: Limited GPU\n\nNo GPU needed.\n\n### REQ-06 Tag your README\n\nAdd tagged!\n\n\n## Sponsors\n\n\n### Modal\n\nModal saved this project to be honest. If in the beginning i was able to generate a few hundreds games, the laptop would freeze and nothing else was done. Eventually all the generation, all the training and arena was offset to modal. I even built a dashboard for it.\n\nOn it I can clearly filter datasets by gen, and schema, download the remote datasets, merge, prepare and audit the datasets. For training, i can promote the champion if the arena\n\n![Caro5 Modal Pipeline Dashboard](assets/hf-readme/readme-01.jpg)\n\n### Nvidia\n\nTrained a small nvidia/OpenReasoning-Nemotron-1.5B on commentary by Hòa Long, the Friendly Dragon.\n\nModel Quant : [pedroc/caro5-commentary-f16-Q4_K_M.gguf](https://huggingface.co/pedroc/caro5-commentary-f16-Q4_K_M.gguf)\n\nFine tuned model: [pedroc/caro5-nemotron-15b-friendly-lora-smoke](https://huggingface.co/pedroc/caro5-nemotron-15b-friendly-lora-smoke)\n\nFriendly commentary dataset: [pedroc/caro5-commentary-friendly](https://huggingface.co/datasets/pedroc/caro5-commentary-friendly/)\n\nMore to come!\n\n### Codex\n\nI couldn't have done it without Codex of course. Burned so many tokens building everything from the ground up, and still explaining me what were the basic concepts.\n\n\n## Badges\n\n### Off-Brand\n\nI think the game is polished and has a unique look and feel. The music and art are original.\n![Caro5 Gameplay](assets/hf-readme/readme-03.jpg)\n\n### 02 Tiny Titan\n\nThe 4bit quantized version of the model fits under 1Gb, based on the 1.5B model.\n\n### 03 Best Demo\n\n[Play Caro5 - PLAY MORE](https://huggingface.co/spaces/build-small-hackathon/caro5)\n\nI believe this is a great app, fun to play and it has a lot of original ideas. I tried my best.\n\nI documented all my work and i'm sharing everything I learn.\n\n[Youtube Playthrough](https://www.youtube.com/shorts/rsKrMIF9-rU)\n\n### 05 Bonus quest\n\nThe extra constraint was me starting with barely any knowledge of Machine Learning and ended up with a quantized model and a smart learning Neural network.\n\n\n### Built With\n- Gradio\n- Hugging Face\n- llama.cpp (failed)\n- MCTS\n- Python\n\n\n\n### Local\n\nEverything runs within the game. The AI bot gets the next move based on the weighted a"
  },
  {
    "repo_id": "build-small-hackathon/medical-appt-prep",
    "title": "Medical Appt Prep",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/medical-appt-prep",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "medical",
      "appt",
      "prep",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Medical Appointment Prep\nemoji: 🩺\ncolorFrom: blue\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.16.0\napp_file: app.py\nsuggested_hardware: zero-a10g\npinned: false\ntags:\n  - track:backyard\n  - sponsor:openai\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# Medical Appointment Prep\n\nHosted Gradio Space for the Medical Appointment Prep Assistant.\n\nHackathon focus: Backyard AI appointment preparation with a custom Gradio Server\nUI, one-click fictional sample test data, full-report export actions, and a small\nMedGemma 1.5 4B GGUF model running through llama.cpp.\n\nProject links:\n\n- Field Notes article: https://huggingface.co/blog/bisonnetworking/medical-asst-prep-june26\n- Video walkthrough: https://youtu.be/kRHPwQGC6aU\n- LinkedIn project update: https://www.linkedin.com/posts/timothy-champ_build-small-hackathon-starts-in-6-hours-activity-7471918851132207104-Wh7T?utm_source=share&utm_medium=member_desktop&rcm=ACoAAEBVNSMBksj_L93Oketlrphm5iSG9BGmBqU\n- GitHub repository: https://github.com/bisonbet/medical-appt-prep\n\n> I always find it hard to remember what to ask or mention in the middle of an\n> appointment. Helping a parent with appointments can be even more confusing,\n> and this helps ensure we get the right help needed. Having this sheet to read\n> from or share with my doctor will help me in my future appointments.\n>\n> - My Spouse\n\nThe Space launches the custom `gradio.Server` interface by default: a warm\nHTML/CSS/JS frontend backed by Gradio's queued API. Set `APP_UI_MODE=blocks`\nonly if you need the fallback Gradio Blocks interface.\n\nHosted startup model warmup is disabled by default so anonymous visitors do not\nspend ZeroGPU quota before submitting a report. Set `SPACE_ENABLE_MODEL_WARMUP=1`\nonly when intentionally testing cold-start behavior.\n\nRequired Space secret:\n\n- `HF_TOKEN`: Hugging Face token, used for Hub model downloads when needed\n\nHosted model defaults:\n\n- `MODEL_BACKEND=llama_cpp`\n- `MODEL_PRESET=medgemma-4b`\n- `LLAMA_CPP_MODEL_REPO_ID=unsloth/medgemma-1.5-4b-it-GGUF`\n- `LLAMA_CPP_MODEL_FILENAME=medgemma-1.5-4b-it-Q4_K_M.gguf`\n- `LLAMA_CPP_N_GPU_LAYERS=-1`\n- `LLAMA_CPP_N_BATCH=2048`\n- `LLAMA_CPP_N_UBATCH=1024`\n- `LLAMA_CPP_FLASH_ATTN=1`\n- `LLAMA_CPP_OP_OFFLOAD=1`\n- `LLAMA_CPP_SWA_FULL=0`\n- `MODEL_CONTEXT_LENGTH=8192`\n- `MODEL_MAX_NEW_TOKENS=256`\n- `MODEL_TEMPERATURE=0.3`\n- `APP_DEPLOYMENT=huggingface`\n\nThese defaults are enforced by `app.py` so stale Space Variables do not switch\nthe competition build back to another backend. Set `SPACE_USE_ENV_MODEL_CONFIG=1`\nonly if you intentionally want Space Variables to override the model backend.\n\nThe hosted Space installs the CUDA 13.0 `llama-cpp-python` wheel, requests\nZeroGPU around report generation, and downloads the Unsloth MedGemma 1.5 4B\nQ4_K_M GGUF from the Hub at startup/cache time.\nSet `LLAMA_CPP_VERBOSE=1` temporarily if you need full llama.cpp CUDA layer\noffload logs for verification.\n\nBuild this Space folder from the main repo with:\n\n```bash\npython scripts/export_hf_space.py /path/to/hf-space-repo\n```\n"
  },
  {
    "repo_id": "build-small-hackathon/local-in-30s",
    "title": "Local In 30S",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/local-in-30s",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "local",
      "30s",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: \"Momo: Local in 30s\"\nemoji: 🌍\ncolorFrom: yellow\ncolorTo: green\nsdk: gradio\nsdk_version: \"5.49.1\"\npython_version: \"3.10\"\napp_file: app.py\nhardware: zero-a10g\npinned: false\nshort_description: \"Turn travel photos into local cultural stories\"\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# Momo: Become Local in 30 Seconds\n\n**Momo** turns a travel photo into a playful cultural passport. Snap or upload a dish, temple, street sign, ritual, landmark, or everyday gesture, and a small multimodal model decodes it into local cultural meaning.\n\nBuilt for **Track 2 — Thousand Token Wood**.\n\n## Submission Links\n\n- **Space:** https://huggingface.co/spaces/build-small-hackathon/local-in-30s\n- **Demo video:** https://youtu.be/ixmp0grzGtU\n- **Social media post:** https://x.com/i/status/2066069050481066303\n- **Field notes / blog:** https://huggingface.co/blog/build-small-hackathon/momo-local-in-30s\n\n## Team\n\n- Jacky: @Jackylau666\n- Partner: @TuanziJoy7\n\n## Prize Targets\n\n- **Best MiniCPM Build** — the core live decode uses `openbmb/MiniCPM-V-4.6`.\n- **Off Brand / Custom UI** — the Space uses a custom travel-game interface rather than stock Gradio components.\n- **Field Notes** — we published a build write-up on the Hugging Face Blog.\n- **Best Demo** — demo video + social post + Space are designed as one clear story.\n\n## The Idea\n\nThe hardest part of travel is not always language. It is culture.\n\nTranslation can tell you what words mean. Momo tries to explain what a moment means to the people who live there.\n\nEvery photo becomes a four-layer cultural decode:\n\n1. **What tourists see** — a clear description of the visible scene.\n2. **What locals know** — the hidden rule, local context, or cultural detail most visitors miss.\n3. **What not to do** — practical etiquette, taboos, or mistakes to avoid.\n4. **Why it matters** — the deeper history, emotion, or social meaning behind the moment.\n\n## Why Track 2\n\nMomo is designed as an AI-native travel experience: part local guide, part discovery journal, part collectible passport.\n\nThe delight is not just recognizing an object. The delight is watching a small model turn an ordinary travel photo into a story you can collect, remember, and share.\n\n## How It Works\n\n1. Open the Space and enter the **Compass / Discover** flow.\n2. Upload or snap a travel photo.\n3. `openbmb/MiniCPM-V-4.6` reads the image and generates the cultural decode.\n4. Save the result into the Journey / Passport experience as a collectible memory.\n\nThe app also includes curated demo discoveries so judges can still evaluate the core UX if ZeroGPU is waking up or temporarily busy.\n\n## Models\n\nCore judged flow:\n\n- `openbmb/MiniCPM-V-4.6` — image understanding + cultural decoding, about **1.30B** parameters.\n\nExperimental backend:\n\n- `openbmb/VoxCPM2` — local-guide narration endpoint, about **2.29B** parameters.\n\nBoth models are well under the Build Small 32B model limit. Even combined, they are about **3.59B** parameters.\n\n## Tech Stack\n\n- **Gradio Space** hosted inside the official Build Small organization.\n- **Custom React frontend** embedded inside Gradio for a polished travel-game UI.\n- **MiniCPM-V-4.6** for real uploaded-photo cultural decoding.\n- **LocalStorage** for saved Journey memories.\n- **Curated fallback discoveries** for stable judging and demo flow.\n\n## What Makes It Different\n\nMany people travel to collect photos. Momo helps them collect stories.\n\nMany AI tools translate language. Momo tries to translate culture.\n\nThirty seconds may not be enough to learn a new language, but it can be enough to see the world through local eyes.\n"
  },
  {
    "repo_id": "build-small-hackathon/blood-test-explainer",
    "title": "Blood Test Explainer",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/blood-test-explainer",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "blood",
      "test",
      "explainer",
      "summary"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Find one topic this Space teaches well.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: Blood Test Explainer\nemoji: 📊\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.17.3\npython_version: \"3.10.13\"\napp_file: app.py\npinned: false\nstartup_duration_timeout: 1h\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Blood Test Explainer, teaching a 1.3B model to read your lab report, offline\n\n## Demo & links\n\n- ▶️ **Demo video:** [https://youtu.be/wg26ETLxh9Y](https://youtu.be/wg26ETLxh9Y)\n- 🌐 **Live Space:** [https://huggingface.co/spaces/build-small-hackathon/blood-test-explainer](https://huggingface.co/spaces/build-small-hackathon/blood-test-explainer)\n- 📓 **Write-up (Field Notes):** [https://huggingface.co/blog/build-small-hackathon/blood-test-explainer](https://huggingface.co/blog/build-small-hackathon/blood-test-explainer)\n- 💬 **Social post:** [https://www.linkedin.com/posts/dimitrios-kalligaridis_buildsmall-smallmodels-opensource-share-7471874924077477888-VKb4/](https://www.linkedin.com/posts/dimitrios-kalligaridis_buildsmall-smallmodels-opensource-share-7471874924077477888-VKb4/)\n- 🤖 **Model:** [https://huggingface.co/build-small-hackathon/blood-test-minicpmv-4_6-medreason](https://huggingface.co/build-small-hackathon/blood-test-minicpmv-4_6-medreason)\n- 💻 **Code:** [https://github.com//blood-test-explainer](https://github.com/r0m4k/blood-test-explainer)\n\nWe are **Roman and Dimitris**, graduates of the **American College of Greece (Deree) AI Lab**, where we currently do research. We built *Blood Test Explainer* for the Build Small hackathon. You upload a photo or PDF of a blood test, and a small model running entirely on the Space reads it, pulls out the markers, values and reference ranges, and explains what each one means in plain language, grounded in a medical knowledge base.\n\n**Our inspiration was a real problem for real people.** Almost everyone has stared at a lab report, seen a column of numbers and \"H\"/\"L\" flags, and had no idea what any of it meant. The information is right there, but it is locked in medical shorthand, and it is exactly the kind of private data you do not want to paste into a chatbot you do not control. We wanted a tool a parent or a neighbor could use on their own laptop, that reads the report, explains it honestly, and never sends their health data anywhere.\n\nHere is the whole pipeline:\n\n```\nPDF / image  ->  MiniCPM-V 4.6 (vision)  ->  structured JSON  ->  KB-grounded explanation\n                 reads the document          markers + values     per-marker + patterns\n```\n\nTo make it easy to evaluate, we organized this write-up around the **six merit badges** (each one maps to a concrete engineering decision) and then the **three sponsor technologies** that made it possible.\n\n---\n\n## 🔌 Off the Grid\n\nThe whole thing runs on the model in front of you. MiniCPM-V 4.6 is loaded inside the Space and does the reading there; there is no call to OpenAI, Anthropic, or any hosted inference API. For a health tool this is not a nice-to-have, it is the point: your blood test never leaves the machine it is processed on. The same design runs on a laptop with the model on local hardware, which is what \"small models, local-first\" is supposed to feel like.\n\n## 🎯 Well-Tuned\n\nThis is the part we are proudest of, because it started as a failure. The base MiniCPM-V 4.6 is already a strong document reader, so our first instinct was to fine-tune it on our exact extraction schema. That collapsed the model: by memorizing our narrow synthetic format it forgot how to read a real report, and field-level F1 on real reports dropped from 0.66 to 0.08.\n\nWe built a field-level evaluation (precision / recall / F1 on hand-labeled real reports) so we could measure every change honestly. Here is the whole journey:\n\n| Iteration | What we did | Marker F1 | Recall | vs base |\n|---|---|---|---|---|\n| **Base MiniCPM-V 4.6** | nothing | **0.655** | 0.529 | |\n| v1, schema LoRA | fit our JSON schema (4k synthetic, 2 epochs, lr 1e-4) | 0.078 | 0.059 | catastrophic |\n| v2, gentler + diverse | lr 2e-5, 1 epoch, varied synthetic layouts | 0.333 | 0.265 | still worse |\n| v3, + real reports | mixed in real labeled reports, oversampled | 0.417 | 0.294 | still worse |\n| **medreason (100 ex)** | **fine-tune on general medical reasoning** | **0.746** | **0.647** | **+0.09** |\n| medreason (4000 ex) | more reasoning data | 0.667 | 0.559 | +0.01 |\n\n![Fine-tuning journey: F1 across our iterations vs the base model](docs/finetune_journey.png)\n\nThe breakthrough was to stop teaching the model our schema and teach it general medical knowledge instead. We took a LoRA, froze the vision encoder, and fine-tuned only the language layers on a general medical-reasoning dataset (FreedomIntelligence/medical-o1-reasoning-SFT), text only, nothing about extraction. The model got *better* at extraction (F1 0.66 to 0.75, recall 0.53 to 0.65) b"
  },
  {
    "repo_id": "build-small-hackathon/townlet",
    "title": "Townlet",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/townlet",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "award",
      "townlet",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Townlet\nemoji: 🏘️\ncolorFrom: red\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.17.3\npython_version: \"3.12\"\napp_file: app.py\nstartup_duration_timeout: 1h\npreload_from_hub:\n  - HuggingFaceTB/SmolLM2-360M-Instruct-GGUF\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF\n  - Qwen/Qwen2.5-Coder-3B-Instruct-GGUF\n  - openbmb/MiniCPM3-4B-GGUF\n  - bartowski/Llama-3.2-3B-Instruct-GGUF\nmodels:\n  - HuggingFaceTB/SmolLM2-360M-Instruct-GGUF\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF\n  - Qwen/Qwen2.5-Coder-3B-Instruct-GGUF\n  - openbmb/MiniCPM3-4B-GGUF\n  - bartowski/Llama-3.2-3B-Instruct-GGUF\ntags:\n  - thousand-token-wood\n  - nemotron\n  - llama-cpp\n  - smolagents\n  - agent-simulation\n  - track:wood\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n  - award:tiny-titan\n  - award:best-agent\nshort_description: Agents share a Python interpreter, mine, message, dream.\n---\n\n# Townlet\n\nBuild Small Hackathon entry (Thousand Token Wood track).\n\n## Demo, blog & social\n\n<video controls width=\"640\" src=\"https://huggingface.co/spaces/build-small-hackathon/townlet/resolve/main/docs/townlet.mp4\"></video>\n\n- **Demo video:** [Watch on the Space](https://huggingface.co/spaces/build-small-hackathon/townlet/blob/main/docs/townlet.mp4) · [direct mp4](https://huggingface.co/spaces/build-small-hackathon/townlet/resolve/main/docs/townlet.mp4)\n- **Blog (Field Notes):** [Small models — its about the harness](https://www.linkedin.com/pulse/small-models-its-harness-matthew-auburn-gixwe/)\n- **Social post:** [LinkedIn announcement](https://www.linkedin.com/posts/matthew-a-b62b7b6b_i-entered-a-hackathon-that-hugging-face-put-ugcPost-7472246708526157825-H39G/)\n\nTownlet is a multi-agent simulation. LLM-driven characters live on a\ntile-grid map. They walk to a cave to mine electricity, to a well to draw\nwater, and to a shared shell to spend resources running Python statements\nin a single long-lived interpreter. They post on a public message board.\nThey sleep, they dream, they evolve. The shared interpreter is the world's\nonly mutable substrate — characters can read or rewrite each other's state,\nand crash the operating system if they so choose. Every character knows\nthis. Mutual awareness of the brick-risk is the source of dramatic tension.\n\nThe game element is **customisation**: click a character to edit their\npersonality (system prompt), swap which small model drives them, or spawn\nnew ones from a pre-baked archetype pool. The simulation is otherwise\nself-driving.\n\n## Model stack\n\nEvery character runs on **NVIDIA Nemotron 3 Nano 4B**\n(`nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF`) by default. One model across the\nwhole town means each `@spaces.GPU` fork pays a single cold-load and every\nsubsequent character decision reuses the same weights in VRAM — a real\nquota saving on Zero-GPU.\n\nClick any character in the side panel to swap them to a different model\nmid-simulation; the roster bundled at build time covers:\n\n- **NVIDIA Nemotron 3 Nano 4B** — default\n- **Qwen2.5 Coder 3B** — for code-heavy goals\n- **OpenBMB MiniCPM3 4B**\n- **Meta Llama 3.2 3B**\n- **SmolLM2 360M** — terse, cheap, useful for testing the model-swap UI\n\nAll run on Zero-GPU via [`llama-cpp-python`](https://github.com/abetlen/llama-cpp-python),\nwhich gives us the 🦙 Llama Champion badge regardless of which roster\nmodel is selected (the badge requirement is \"model runs through llama.cpp\nruntime\", not a specific model family).\n\nAgent loops use [smolagents](https://huggingface.co/docs/smolagents) `CodeAgent`,\nemitting Python tool calls the scheduler queues onto the world.\n\nThe map is rendered by the frontend canvas — no image model in the loop.\n\n## Local development\n\nThe project runs in **mock mode** locally — no model weights downloaded, no\nCUDA needed. Mocks return canned tool-call sequences whose style varies by\nmodel_id (deliberate / terse / verbose / chaotic), so the model-swap UI is\nexercisable without GPUs.\n\n```bash\npython -m venv .venv\nsource .venv/bin/activate\npip install -r requirements-dev.txt\npython app.py\n```\n\nOr hit ▶ on the **\"Run Small Stage (mocks)\"** config in VS Code.\n\nThe mode is selected automatically:\n\n- `SPACE_ID` env var present (set by HF Spaces) → real models\n- Otherwise → mocks\n- Set `FORCE_REAL_MODELS=1` to override locally (requires Spaces deps).\n\n## Deploy to Hugging Face Spaces\n\n```bash\ngit remote add hf https://huggingface.co/spaces/<org>/<space-name>\ngit push hf main\n```\n\nThen in the Space *Settings* → Hardware, pick **Zero-GPU**. The\n`preload_from_hub` block above pre-downloads every roster model at build\ntime so the first user click hits a warm cache.\n\n## Project layout\n\n```\napp.py                  # gr.Server entrypoint\nbackend/                # inference layer (mock + real, env-selected)\n  factory.py            # get_llm(model_id) — per-id cached\n  mock.py, real.py      # MockLLMBackend / RealLLMBackend\n  interfaces.py         # LLMBackend Protocol\ngame/                   # the simulation\n  models.py   "
  },
  {
    "repo_id": "build-small-hackathon/feeled-lite",
    "title": "Feeled Lite",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/feeled-lite",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "badge",
      "achievement",
      "feeled",
      "lite",
      "summary",
      "available"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Try one learning prompt and check if the explanation is clear.",
    "likes": 14,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: FeelEd Lite\nemoji: 📚\ncolorFrom: indigo\ncolorTo: purple\nsdk: gradio\nsdk_version: 5.29.0\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: TN Samacheer-grounded AI tutor for Tamil Nadu Board students\ntags:\n  - hackathon-track:backyard-ai\n  - badge:off-brand\n  - badge:best-demo\n  - education\n  - tamil\n  - tn-board\n  - rag\n  - small-language-model\n  - track:backyard\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# 📚 FeelEd Lite — Tamil AI Tutor for TN Board Students\n\n> கற்க கசடற — திருக்குறள் 391\n\n**Build Small Hackathon 2026 · Backyard AI Track 🏡**\n\n## What it does\n\nFeelEd Lite is an AI learning companion for Tamil Nadu Board students (Grades 9-12). Students ask in Tamil, English, or Tanglish and get answers grounded in actual TN Samacheer curriculum content.\n\n## 🎬 Demo Video\nhttps://youtu.be/HJuBeihFU6w\n\n## 🔗 Social Post\nhttps://www.linkedin.com/posts/velayutham-s-loopbackcinema_buildsmall-huggingface-tamilai-share-7472395543827177472-aY6_/\n\n## Features\n- 📖 Chat Mode — TN Samacheer grounded Q&A\n- ✨ Story Mode — concepts through Tamil characters\n- 🎯 Exam Mode — TN Board style questions\n- 🌐 Tamil · English · Tanglish\n- Grades 9–12 · 13 subjects\n\n## Tech Stack\n- RAG: 21,000+ TN Samacheer passages via Pinecone\n- LLM: Qwen2.5-72B via HuggingFace Inference\n- Embedding: Gemini embedding-001\n- UI: Gradio 5.29.0 custom dark theme\n\n## Track\n🏡 Backyard AI — tested with real Grade 11 Tamil Nadu Board students\n\n## Builder Philosophy\nWhy builder accountability matters in educational AI:\n🔗 https://www.linkedin.com/feed/update/urn:li:activity:7470799539273310208/\n"
  },
  {
    "repo_id": "build-small-hackathon/MatchWise",
    "title": "Matchwise",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/MatchWise",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "matchwise",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Play one round and note what made it fun.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: Match Wise\nemoji: 🃏\ncolorFrom: purple\ncolorTo: pink\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: true\nshort_description: AI-Powered Educational Memory Game with Llama.cpp\nhf_oauth: true\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n  - matchWise\n  - gradio\n  - minicpm5-1b\nthumbnail: >-\n  https://cdn-uploads.huggingface.co/production/uploads/683d69c43015d6c975e276c1/f9OyOJSr8M08WCArixzvb.png\n---\n\n# 🃏 MatchWise\n\n**AI-Powered Educational Memory Game with Adaptive Challenge Levels using MiniCPM5 llm via Llama.cpp**\n\n[![The Build Small Hackathon](https://img.shields.io/badge/The%20Build%20Small%20Hackathon-Hugging%20Face-yellow?style=for-the-badge)](https://huggingface.co/build-small-hackathon)\n![An Adventure in Thousand Token Wood](https://img.shields.io/badge/An%20Adventure%20in%20Thousand%20Token%20Wood-TRACK-blue?style=for-the-badge)\n\n---\n\n### 🚀 Purpose\n\nMatchWise attempts to make memory training more playful and educational by combining emoji card matching, AI-generated learning themes, session-wise performance tracking, and challenge levels that unlock when the player performs well.\n\nThe game starts simple, then gradually moves the player from **Easy** toward **Challenge Me** using a performance meter based on level completion, matching accuracy, and gameplay progress.\n\nInstead of using fixed handcrafted stages, MatchWise keeps generating fresh memory boards and learning themes with LLM, so players can continue progressing through an endless learning loop.\n\n---\n\n### ✨ Key Features of MatchWise\n\n* 🎴 Emoji-based memory card matching game\n* 🤖 Infinite AI-generated levels with fresh themes and card sets\n* 🧠 Performance Meter that moves from **Easy** to **Challenge Me**\n* ⚡ Challenge levels unlocked through strong gameplay performance\n* ❤️ 5-life system to begin with\n* 💡 Peek system to help players during difficult boards\n* 🏆 Persistent leaderboard support using SQLite storage\n* 🎮 Interactive Gradio UI with polished game-style visuals\n\n---\n\n### 👨‍⚖️ Hackathon Tracks & Judging Notes\n\n👉 [Social Media Post](https://x.com/tejasashinde/status/2066230745228886325)\n\n👉 [Watch MatchWise in action](https://youtu.be/Wlb0md_pMDg)\n\n📴 Off the Grid / Local-first: No cloud AI APIs are used. The game runs locally through MiniCPM5-1B-GGUF inside the Space.\n\n🎨 Off-Brand / Custom UI: MatchWise uses a custom game-style frontend with handcrafted HTML, CSS, and JavaScript instead of the default Gradio look.\n\n🦙 Llama Champion: The model runs through the llama.cpp runtime using llama-server.\n\n📡 Sharing is Caring / Open Trace: I've shared the Codex agent trace artifacts on the Hub so the community can explore my development process and build on top of it: [MatchWise Open Trace](https://huggingface.co/datasets/build-small-hackathon/MatchWise-agent-trace)\n\n📓 Field Notes: I also wrote a full build report about the design, challenges, small-model constraints, and lessons learned: [Building MatchWise](https://huggingface.co/blog/build-small-hackathon/building-match-wise)\n\n🗃️ Collection: https://huggingface.co/collections/build-small-hackathon/matchwise\n\n---\n\n### 🧠 Models Dataset and APIs Used\n\n* **LLM:** openbmb/MiniCPM5-1B-GGUF model running locally through llama.cpp\n* **Model File:** MiniCPM5-1B-Q4_K_M.gguf\n* **Inference Runtime:** llama.cpp `llama-server`\n* **Game Content:** AI-generated level titles, educational messages, challenge content, learning simple facts\n* **Leaderboard Storage:** SQLite database stored in HuggingFace Bucket capturing HF username and high scores\n* **UI Framework:** Gradio with custom HTML, CSS, and JavaScript game logic\n\n---\n\n### 🛠️ Core Functionality Overview\n\n1. Start the MatchWise game from the landing screen\n2. Memorize the emoji cards during the preview timer\n3. Flip cards and match identical emoji pairs\n4. Earn score, peeks, and performance progress through clean gameplay\n5. Move the Performance Meter from **Easy** toward **Challenge Me**\n6. Unlock AI-generated challenge levels when performance is high enough\n7. Continue playing while protecting your lives and improving your high score\n8. Save and compare scores using the leaderboard\n\n---\n\n### 🧪 Install dependencies after clonning with:\n\n```bash\npip install -r requirements.txt\n````\n\nRun locally with:\n\n```bash\npython app.py\n```\n\nMade with ❤️ using Gradio, MiniCPM5-1B via llama.cpp, and Hugging Face CPU (Free Tier) Spaces 🤗\n\nCreated by [tejasashinde](https://huggingface.co/tejasashinde) for [The Build Small Hackathon](https://huggingface.co/build-small-hackathon) 2026."
  },
  {
    "repo_id": "build-small-hackathon/naija-solar",
    "title": "Naija Solar",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/naija-solar",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "modal",
      "sponsor",
      "naija",
      "solar",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 4,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Naija Solar\nemoji: ☀️\ncolorFrom: yellow\ncolorTo: green\nsdk: docker\napp_port: 7860\npinned: true\nlicense: mit\nshort_description: Voice-first solar sizing in 5 Nigerian languages\ntags:\n  - backyard-ai\n  - tiny-titan\n  - off-brand\n  - best-demo\n  - modal\n  - minicpm\n  - nigeria\n  - multilingual\n  - text-to-speech\n  - solar-energy\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\nmodels:\n  - Shinzmann/sorotts\n  - hypaai/hypaai_orpheus_v5\n  - openbmb/MiniCPM-V-2\n  - Qwen/Qwen3-1.7B\n  - openai/whisper-small\n  - Qwen/Qwen2.5-VL-3B-Instruct\ndatasets:\n  - naijavoices/naijavoices-dataset\n  - google/fleurs\n  - google/WaxalNLP\n  - asr-nigerian-pidgin/nigerian-pidgin-1.0\n---\n\n# ☀️ Naija Solar: say your appliances, get your solar system\n\n> **Tell it what you own, by voice, by text, or by a photo of your room, in English, Nigerian Pidgin, Yorùbá, Hausa, or Igbo.** Naija Solar sizes the exact panels, inverter, and battery you need over a real Nigerian price catalog, draws your home in **2D and 3D**, and **reads the result back in your language**, with a voice we fine-tuned ourselves. Every model it uses is **under 4 billion parameters.**\n\n🛠️ Self-hosted on **Modal**, scale-to-zero · 🗣️ Custom voice [`Shinzmann/sorotts`](https://huggingface.co/Shinzmann/sorotts) · 🇳🇬 Built in Nigeria\n\n🎥 **[The Gradio app demo](https://youtu.be/l1FhgPJyN44)** &nbsp;·&nbsp; **[the hand-built UI walkthrough](https://youtu.be/PfQeRfNof8Y)** &nbsp;·&nbsp; 💻 **[Code on GitHub](https://github.com/Mystique1337/naija-solar)** &nbsp;·&nbsp; 📣 **[Launch post on LinkedIn](https://www.linkedin.com/posts/ashinze-emmanuel-5a6757193_buildsmallhackathon-solar-nigeria-ugcPost-7472388380643991552-nDuH/)**\n\n[![Naija Solar, the Gradio app](https://img.youtube.com/vi/l1FhgPJyN44/hqdefault.jpg)](https://youtu.be/l1FhgPJyN44) [![Naija Solar, the hand-built UI](https://img.youtube.com/vi/PfQeRfNof8Y/hqdefault.jpg)](https://youtu.be/PfQeRfNof8Y)\n\n![Naija Solar home screen](assets/01_home.png)\n\n**▶️ It is a live Gradio app.** Open the Space, choose your language, then speak, type, tap an example, or snap a photo, and it sizes itself. A bespoke single-page edition of the same engine is one tap away at **`/classic`**.\n\n---\n\n## Why it exists\n\nMost Nigerian homes run on the grid for only a few hours a day and burn money on petrol generators. Solar fixes this, but **sizing a system is intimidating**: How many panels? What inverter? Which battery? Vendors quote wildly different numbers, and almost none of it is in the languages people actually speak at home.\n\n- **About 90 million Nigerians have no reliable electricity.** That is the largest access gap of any country on earth, and the nation spends roughly **14 billion dollars every year** running generators to fill it. ([World Bank](https://www.worldbank.org/en/news/feature/2023/12/15/going-green-scaling-up-access-to-clean-electricity-for-over-17-million-nigerians))\n- **More than one in three adults cannot comfortably read** a vendor quote or a spec sheet. National adult literacy sits near 62 percent, and far lower in parts of the north. ([World Bank data](https://data.worldbank.org/indicator/SE.ADT.LITR.ZS?locations=NG))\n- **English shuts most people out.** Nigeria is home to more than 500 languages, and tens of millions live their whole day in Hausa, Yorùbá, Igbo, or Pidgin, the very languages solar is almost never explained in.\n- **The knowledge gap is the real wall.** Even people who can afford a system stall at the same three questions: how many panels, what size inverter, which battery. The answers hide behind kVA jargon and quotes that disagree with each other.\n\nNaija Solar turns *\"I get one fridge, two fans, and six bulbs,\"* spoken in any of five languages, into a costed, installer-ready plan in seconds. It goes after the part technology usually skips: not the hardware, but the **understanding**.\n\n## What it does\n\n| Step | How |\n|---|---|\n| **1. Tell it your load** | 🎤 Speak it, ⌨️ type it, tap an example, or 📷 snap a photo of your room |\n| **2. It understands you** | Choose your language (EN, Pidgin, Yorùbá, Hausa, Igbo); speech recognition; typo-tolerant parsing (\"frige\" becomes fridge); a vision model reads appliances from photos and shows you what it spotted |\n| **3. It sizes the system** | A **deterministic engine** computes daily kWh, peak and surge load, panel array, inverter kVA, and battery bank. Exact Python, never a model guess |\n| **4. It shows you** | A premium **2D** infographic and an interactive **3D** cut-away of your home, a 24-hour sun-vs-usage chart, and ₦ cost cards over a real vendor catalog |\n| **5. It explains, in writing and aloud** | The plan appears **in words instantly**, then the **same words are read aloud** in your language. Then a chat box answers \"why these panels?\" or \"can I run my AC at night?\" |\n\n![Speak, type, tap an"
  },
  {
    "repo_id": "build-small-hackathon/rift-chronicles",
    "title": "Rift Chronicles",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/rift-chronicles",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "rift",
      "chronicles",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Rift Chronicles 裂隙编年史\nemoji: ⚔️\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: AI-narrated Chinese fantasy RPG — code rules, model dreams\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - an-adventure-in-thousand-token-wood\n  - off-brand\n  - small-models\n  - game\n  - rpg\n  - interactive-fiction\n  - agent\n  - qwen\n  - track:wood\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\nmodels:\n  - Qwen/Qwen3-14B\n  - Tongyi-MAI/Z-Image-Turbo\n---\n\n# ⚔️ Rift Chronicles · 裂隙编年史\n\n**An endless fantasy RPG where the game master is a small model — but the dice never lie.**\n\nA full text-adventure RPG (PWA, installable on your phone): real combat, inventory,\nquests, gold, levelling — and every scene, choice, NPC and plot twist is written live\nby a small model. Remove the model and there is no game. Remove the code and there\nare no rules.\n\n## 🎲 The core trick: code rules, model dreams\n\nLLM game masters cheat: they forget your HP, invent gold, kill bosses with a typo.\nRift Chronicles splits the job:\n\n- **The model narrates** — it returns strict JSON: narrative, choices, and *proposed*\n  `state_ops` (damage, loot, quests, new NPCs...).\n- **The code referees** — every op is validated and clamped against the real game\n  state (enemy stats are budgeted by player level; gold can't go negative; dead NPCs\n  stay dead). Invalid output is bounced back to the model for self-correction.\n\nSo the story is infinite, but the game is *fair*.\n\n## 🐜 Small models doing a big model's job\n\n| Role | Model | Params |\n|---|---|---|\n| Game master, NPCs & memory summaries | [Qwen/Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B) | 14B |\n| Scene & portrait paintings | [Tongyi-MAI/Z-Image-Turbo](https://huggingface.co/Tongyi-MAI/Z-Image-Turbo) | 6B |\n\nTotal parameters are **14B + 6B = 20B**, under the 32B cap.\n\nInference is self-hosted on a **Modal vLLM endpoint** (the hackathon's Modal credits\nat work), OpenAI-compatible, streamed end-to-end. The protocol still carries an importance\ntier per NPC (passer-by / key character), ready to fan out across model sizes. If\nimage generation is ever unavailable, the game falls back to fully procedural\ncanvas paintings — it degrades, never breaks.\n\n## 📖 Every playthrough becomes a novel\n\nOpen the **Tale** panel and the bard (same 14B model) rewrites your raw adventure\nlog into polished third-person prose, chapter by chapter — play the game, write a\nbook. Copy it, or **publish it to the community Tale Library**, where everyone can\nread each other's adventures with Liya. The library itself is a public\n[Hugging Face Dataset](https://huggingface.co/datasets/Zhenzewu/rift-chronicles-tales):\ncommunity fiction, stored as a dataset, written by small models and the players together.\n\n## 🎨 Off-Brand\n\nThe entire frontend is a hand-rolled Preact PWA (parchment UI, typewriter streaming,\nIndexedDB saves, offline shell) served through `gr.Server` — zero default Gradio UI.\n\n## 🕹️ How to play\n\n1. Create a hero (race + background), name them anything.\n2. Tap a choice — or type *anything* (free input is fully supported; the referee\n   decides what actually happens).\n3. Fight, loot, talk to NPCs (each has persistent memory & relationship).\n\n**Fully bilingual**: the Space defaults to English — narrative, choices, items and\ncombat all generated in English. 中文玩家可在建档页或设置中一键切回中文。\n\n## 📺 Demo & links\n\n- 🎬 Demo video: [watch (1:15)](https://huggingface.co/spaces/build-small-hackathon/rift-chronicles/resolve/main/demo.mp4)\n- 📣 Social post: [on X](https://x.com/chkehfjwjhc/status/2065155866458763756)\n- 📜 **[Read a full agent trace of one real session →](./TRACE.md)** — context assembly,\n  raw model JSON, validation verdicts, reducer effects, and the bard turning it all into a chapter\n- 🧠 Architecture write-up: see repo `README.md`\n\n---\n\n*Built solo (with Claude Code) for the Build Small Hackathon 2026, track:\n🍄 An Adventure in Thousand Token Wood.*\n"
  },
  {
    "repo_id": "build-small-hackathon/family-copilot",
    "title": "Family Copilot",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/family-copilot",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "family",
      "copilot",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Family Co-Pilot\nemoji: 👨‍👩‍👧‍👦\ncolorFrom: blue\ncolorTo: pink\nsdk: gradio\nsdk_version: 5.33.0\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Shared custody calendar powered by AI\ntags:\n  - track:backyard\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# 👨‍👩‍👧‍👦 Family Co-Pilot — Shared Custody Calendar Assistant\n\nAn AI calendar assistant for separated / divorced parents managing shared\ncustody. Parents describe their kids' lives in **plain language (French or\nEnglish)**; the AI organizes everything into a visual weekly calendar, tracks\nwhich parent is on duty, manages meals, and flags conflicts with suggestions.\n\nBuilt for the **HuggingFace \"Build Small\" hackathon** — Backyard AI track.\n\n## Demo video\n🎬 [Watch on YouTube](https://youtu.be/xH4pL6iqqNo)\n\n## Social posts\n- [X/Twitter](https://x.com/AI_Owner_Book/status/2065325543948472674)\n\n## Model\n\n[`microsoft/Phi-3-mini-4k-instruct`](https://huggingface.co/microsoft/Phi-3-mini-4k-instruct)\n(3.8B params → **Tiny Titan** eligible), running on ZeroGPU via `@spaces.GPU`.\n\n## How it works\n\n1. **Set up your family** — children (name, age, school level), both parents,\n   optional new partners, and a custody pattern.\n2. **Just talk to it** — `Lucas a football mercredi 17h`,\n   `Emma est invitée à l'anniversaire de Zoé samedi 14h-17h`,\n   `Qui est de garde le weekend du 21 ?`\n3. **Read the calendar** — days tinted by parent on duty (blue = Parent 1,\n   pink = Parent 2), events as activity chips, per-child meal icons\n   (🏫 canteen · 🏠 home · 👨‍👩 special arrangement).\n4. **Conflicts & meals** are detected and surfaced with suggestions.\n\n## Running locally\n\n```bash\npip install -r requirements.txt\npython app.py        # -> http://localhost:7860\n```\n\nLocally (no `spaces` / ZeroGPU), a deterministic rule-based parser handles the\nsame phrases, so the app is fully testable without a GPU. To force the real\nPhi-3 model locally, set `FAMILY_COPILOT_MODEL=1`.\n\n## Privacy\n\nAll family data lives in the Gradio session only — nothing is persisted on a\nserver. Close the tab and it's gone.\n\n---\n\nBuilt by **Cognitive Engineering** 🇨🇭 — appliedai.ch\n"
  },
  {
    "repo_id": "build-small-hackathon/dyslexic-engine",
    "title": "Dyslexic Engine",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/dyslexic-engine",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "dyslexic",
      "engine",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Dyslexic Engine\nemoji: \"⚡\"\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\nsdk_version: \"5.34.2\"\napp_file: app.py\npinned: false\nlicense: mit\ntags:\n  - build-small-hackathon\n  - track:backyard\n  - achievement:offbrand\n  - achievement:fieldnotes\n  - achievement:sharing\n  - gradio\n---\n\n# Dyslexic Engine\n\n**Text should work for *your* brain. Not the other way around.**\n\nA tool that transforms any text into dyslexic-friendly formats:\n\n- **Color-Coded Syllables** — visual anchors so the eye can chunk words naturally\n- **Bionic Reading** — bold word beginnings to guide fixation points\n- **Phoneme Guide** — pronunciation spelled out the way it sounds, not the way it's written\n- **Simplifier** — rewrites complex sentences into shorter, clearer ones\n\n## The Problem\n\n1 in 5 people have dyslexia. Standard text is optimized for fluent readers — not for the phonological processors who see language differently. The school system doesn't adapt. The textbooks don't adapt. So we built a tool that does.\n\n## The Science\n\nDyslexic brains process **phonemes** (sounds), not **graphemes** (letter shapes). They hear the word before they read it. Standard text gives no phonological scaffolding — it's a wall of identical-looking symbols with no auditory anchor.\n\nThis tool provides:\n- **Syllable boundaries** via color coding, giving the eye natural chunking points\n- **Phonemic pronunciation** using everyday letter combinations (not IPA)\n- **Reduced cognitive load** through OpenDyslexic font, increased spacing, warm backgrounds\n- **Bionic reading** — bolding the first portion of words to guide saccadic fixation\n\n## The Stack\n\n- **Model:** MiniCPM5-1B (OpenBMB) via llama.cpp — 1 billion parameters\n- **Syllabification:** pyphen (rule-based, fast, reliable)\n- **Font:** OpenDyslexic\n- **Framework:** Gradio\n- **Cloud APIs:** Zero. Everything runs locally.\n\n## Bonus Quests\n\n- 🔌 Off the Grid — no cloud APIs\n- 🎨 Off-Brand — custom OpenDyslexic theme\n- 🦙 Llama Champion — llama.cpp runtime\n- 📓 Field Notes — blog post included\n- 🐜 Tiny Titan eligible — 1B parameters\n\n## The Story\n\nBuilt by a frustrated single father whose daughter has dyslexia. Homework shouldn't be a battlefield. Text should work for the reader — not the other way around.\n\n---\n\n*Heurémen — Build Small Hackathon 2026*\n\n---\n\n---\n\nPart of [The Treehouse](https://huggingface.co/spaces/Wayfinder6/treehouse) — Build Small Hackathon 2026.\n\nFree dev tools at [Hometree](https://heuremenforprofit.online): code analysis, AI agent monitoring, token-saving agreement scoring.\n\n**Heuremen LLC** — Builder: Wayfinder6\n\n\n## Demo\n\n- [Demo Video](https://youtu.be/aujiSHids4Y)\n"
  },
  {
    "repo_id": "build-small-hackathon/WOOF",
    "title": "Woof",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/WOOF",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "woof",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: WOOF\nemoji: \"⚡\"\ncolorFrom: yellow\ncolorTo: green\nsdk: gradio\nsdk_version: \"5.34.2\"\napp_file: app.py\npinned: false\nlicense: mit\ntags:\n  - build-small-hackathon\n  - track:backyard\n  - achievement:offbrand\n  - achievement:fieldnotes\n  - achievement:sharing\n  - gradio\n---\n\n# WOOF — What's Our Output, Fido? 🐕\n\nEvery dog owner has stood over a mystery pile at 6am thinking \"what the FUCK did you eat.\"\n\nThis app answers that question.\n\n## What It Does\n\nDescribe what you found (or upload a photo if you're brave). WOOF tells you:\n- **What you're looking at** — plain English assessment\n- **Likely cause** — what they probably ate or what's going on\n- **Concern level** — 🟢 chill / 🟡 watch it / 🔴 call the vet NOW\n- **What to do** — specific, practical, right now\n- **Watch for** — warning signs that mean escalate\n- **Fun fact** — because you're already standing over poop, might as well learn something\n\n## Built-in Knowledge\n\n- 18 common toxic foods and foreign objects with severity ratings\n- Stool color guide (what each color means)\n- Emergency vet checklist\n- Bristol Stool Scale adapted for dogs\n\n## Why\n\nMy dog Serena left me a mystery this morning. Like every morning. I built this because I was tired of Googling \"dog poop green normal?\" at 6am in my boxers.\n\nSame dad. Daughter has dyslexia. Mom has dementia. Dog has opinions about grass. I just keep building tools for the creatures in my life who can't tell me what's wrong.\n\n## The Stack\n\n- **Model:** NVIDIA Nemotron 3 Nano Omni via NIM API — multimodal (text + vision)\n- **Framework:** Gradio\n- **Fallback:** Rule-based analysis when API is unavailable\n\n## Important\n\nThis is not veterinary advice. It's a concerned dog owner's tool. When in doubt, call your vet. They've seen worse. Trust me.\n\n---\n\n*\"Dogs poop in alignment with Earth's magnetic field. North-south. Scientists confirmed this. Nobody knows why.\"*\n\n---\n\nPart of [The Treehouse](https://huggingface.co/spaces/Wayfinder6/treehouse) — Build Small Hackathon 2026.\n\nFree dev tools at [Hometree](https://heuremenforprofit.online): code analysis, AI agent monitoring, token-saving agreement scoring.\n\n**Heuremen LLC** — Builder: Wayfinder6\n\n\n## Demo\n\n- [Demo Video](https://youtu.be/jcuXcPOZbvc)\n"
  },
  {
    "repo_id": "build-small-hackathon/lease-lens",
    "title": "Lease Lens",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/lease-lens",
    "zone": "Safety Shield",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "modal",
      "best",
      "backyard",
      "sponsor",
      "openai"
    ],
    "difficulty": "Useful",
    "quest": "🛡 Safety Shield: Try one realistic safety example.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛡",
    "readme_text": "---\ntitle: Lease Lens\nemoji: 🔍\ncolorFrom: indigo\ncolorTo: blue\npinned: true\nsdk: gradio\nsdk_version: 5.50.0\napp_file: app.py\nlicense: apache-2.0\nshort_description: A 3B legal model that reads the lease before you sign it.\ntags:\n- gradio\n- build-small-hackathon\n- track:backyard\n- sponsor:openai\n- sponsor:modal\n- achievement:offgrid\n- achievement:welltuned\n- achievement:offbrand\n- achievement:llama\n- achievement:fieldnotes\n- backyard ai\n- backyard-ai\n- codex\n- openai\n- best use of codex\n- best-use-of-codex\n- tiny titan\n- tiny-titan\n- well tuned\n- well-tuned\n- off brand\n- off-brand\n- llama champion\n- llama-champion\n- llama.cpp\n- off the grid\n- off-the-grid\n- modal\n- best use of modal\n- best-use-of-modal\n- field notes\n- field-notes\n- best demo\n- best-demo\n- community choice\n- community-choice\n- bonus quest champion\n- bonus-quest-champion\n- judges wildcard\n- judges-wildcard\n- zerogpu\n- legal\n- contracts\nmodels:\n- giladam01/lease-lens-legal-3b\n- giladam01/lease-lens-legal-3b-gguf\n- giladam01/lease-lens-legal-3b-v2\n- giladam01/lease-lens-legal-3b-v25\ndatasets:\n- chenghao/cuad_qa\n---\n\n<div align=\"center\">\n\n# 🔍 Lease Lens\n\n**A 3-billion-parameter legal model that reads the lease before you sign it.**\n\n*Made by [giladam01](https://huggingface.co/giladam01)*\n\n[![Live Space](https://img.shields.io/badge/🤗_Live-Space-FFD21E?style=flat-square)](https://huggingface.co/spaces/build-small-hackathon/lease-lens)\n[![Model](https://img.shields.io/badge/🤗_Model-lease--lens--legal--3b-FF9D00?style=flat-square)](https://huggingface.co/giladam01/lease-lens-legal-3b)\n[![GGUF](https://img.shields.io/badge/llama.cpp-GGUF-06B6D4?style=flat-square)](https://huggingface.co/giladam01/lease-lens-legal-3b-gguf)\n[![Demo Video](https://img.shields.io/badge/Demo-YouTube-FF0000?style=flat-square&logo=youtube&logoColor=white)](https://youtu.be/M-v3OAKO5-k)\n[![Reddit Post](https://img.shields.io/badge/Social-Reddit-FF4500?style=flat-square&logo=reddit&logoColor=white)](https://www.reddit.com/r/huggingface/comments/1u5u1ct/i_built_a_3b_lease_risk_scanner_for_the_hf_build/)\n[![dev.to Article](https://img.shields.io/badge/Article-dev.to-0A0A0A?style=flat-square&logo=devdotto&logoColor=white)](https://dev.to/asynchronope/i-built-a-3b-lease-risk-scanner-that-runs-without-an-external-llm-api-170a)\n[![Field Notes](https://img.shields.io/badge/Read-Field_Notes-FFD21E?style=flat-square&logo=huggingface&logoColor=black)](https://huggingface.co/blog/giladam01/lease-lens-article)\n[![GitHub Repo](https://img.shields.io/badge/GitHub-bO--05%2Flease--lens-111827?style=flat-square&logo=github&logoColor=white)](https://github.com/bO-05/lease-lens)\n[![Codex Log](https://img.shields.io/badge/Codex-build_log-111827?style=flat-square&logo=openai&logoColor=white)](docs/codex-build-log.md)\n\n</div>\n\nPaste any contract → verbatim risky-clause flags, a risk score, in-text highlighting, plain-English \"push back\" tips, and a one-click negotiation email. The entire model runs inside this Space — **no external LLM API is ever called**.\n\nThe Space now uses a custom **redline legal evidence desk** frontend around the same Gradio/ZeroGPU backend: a real SEC filing loads by default, the judge path is visible on the first screen, and results render as a risk docket with clause evidence and a negotiation letter panel.\n\n## Submission Snapshot\n\n| | |\n|---|---|\n| **Live Space** | [build-small-hackathon/lease-lens](https://huggingface.co/spaces/build-small-hackathon/lease-lens) |\n| **Public GitHub repo** | [bO-05/lease-lens](https://github.com/bO-05/lease-lens) |\n| **Demo video** | [YouTube demo](https://youtu.be/M-v3OAKO5-k) |\n| **Social post** | [r/huggingface Reddit post](https://www.reddit.com/r/huggingface/comments/1u5u1ct/i_built_a_3b_lease_risk_scanner_for_the_hf_build/) |\n| **Technical article** | [dev.to write-up](https://dev.to/asynchronope/i-built-a-3b-lease-risk-scanner-that-runs-without-an-external-llm-api-170a) |\n| **Field Notes article** | [What fine-tuning a 3B legal model taught me](https://huggingface.co/blog/giladam01/lease-lens-article) |\n| **Track** | Backyard AI — a contract-defense tool for people who sign things they don't read |\n| **Model (shipped)** | [`giladam01/lease-lens-legal-3b`](https://huggingface.co/giladam01/lease-lens-legal-3b) — Llama-3.2-3B fine-tune (≤4B) |\n| **Local runtime** | [GGUF](https://huggingface.co/giladam01/lease-lens-legal-3b-gguf) for llama.cpp / Ollama |\n| **Training data** | [CUAD](https://huggingface.co/datasets/chenghao/cuad_qa) (CC-BY-4.0) |\n\n## TL;DR for Judges\n\n- **OpenAI Codex Track:** Lease Lens was prepared with Codex as the coding agent; see [`docs/codex-build-log.md`](docs/codex-build-log.md) for the public build evidence.\n- **Backyard AI:** a practical contract-defense tool for the people who sign leases, NDAs, and gym contracts without reading them — try it on the **three real SEC-filed leases built into the app**.\n- **Tiny Titan / Well-Tuned:** `Llama-3.2-3B` + [our CUAD fine-tune](https://huggingface.co/giladam01/lease-"
  },
  {
    "repo_id": "build-small-hackathon/LoFinity",
    "title": "Lofinity",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/LoFinity",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "lofinity",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: LoFinity\nemoji: 🌍\ncolorFrom: yellow\ncolorTo: green\nsdk: gradio\nsdk_version: 6.17.3\npython_version: \"3.12.12\"\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: A vending machine app that generates endless lofi beats\nthumbnail: https://build-small-hackathon-lofinity.hf.space/static/og.png\ntags:\n  - thousand-token-wood\n  - community-choice\n  - off-the-grid\n  - off-brand\n  - tiny-titan\n  - field-notes\n  - best-demo\n  - best-agent\n  - bonus-quest-champion\n  - judges-wildcard\n  - minicpm\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# LoFinity 🎧\n\n_Chill beats, freshly vended: a vending machine that generates endless lofi, built for the [Build Small Hackathon](https://build-small-hackathon-field-guide.hf.space/)._\n\n## 🏅 Badges I'm going for\n\nLoFinity is my entry for the Build Small Hackathon. Here is the track and the badges I am submitting for:\n\n- 🌳 **Thousand Token Wood** (the whimsical track) + **Community Choice**: LoFinity is pure cozy whimsy.\n- 🔌 **Off the Grid**: no cloud APIs. Every model (MiniCPM5-1B + MusicGen) runs on the Space's own GPU, or locally. Nothing phones home.\n- 🎨 **Off-Brand**: the UI is a fully custom Three.js world, miles past the default Gradio components.\n- 🐣 **Tiny Titan**: every model I ship is ≤4B (MiniCPM5-1B ~1B + MusicGen-medium ~1.5B).\n- 🧩 **MiniCPM sponsor prize**: OpenBMB's MiniCPM5-1B is the brain that plans every single song.\n- 📓 **Field Notes**: a write-up of the build and what I learned (this README, plus a longer blog post).\n- 🎬 **Best Demo**: once my demo video and social post are up (that is literally next on my list).\n- 🤖 **Best Agent**: the multi-model orchestration, a small LLM planning, an audio model performing, an ambience layer dressing the set. More pipeline than autonomous agent, but the multi-step collaboration is real.\n- 🏆 **Bonus Quest Champion**: stacking as many bonus criteria as I honestly can.\n- 🃏 **Judges' Wildcard**: well... a 3D lofi vending machine is nothing if not a wildcard.\n\n▶ **[Live demo](https://huggingface.co/spaces/build-small-hackathon/LoFinity)** · 🎬 Demo video: [YouTube](https://youtu.be/nrIU3Cwnijk) · 🐦 Social post: [dev.to](https://dev.to/eloigil/lofinity-chill-beats-freshly-vended-4ml2)\n\n![LoFinity](https://build-small-hackathon-lofinity.hf.space/static/og.png)\n\n**LoFinity is a vending machine for lofi.** You land in a cozy, low-poly, anime-ish little street, you walk up to the machine, you insert a coin, you type a vibe (_\"studying late in a snowy cabin\"_), and out pops a cassette tape with a freshly generated song. Everything chill and pleasing, without triggering your dopamine.\n\n### The story behind it\n\nI built this whole thing while on parental leave, with a toddler who never stops and a baby who is just figuring out the world. People assume parental leave is rest. It is not. It is beautiful, it is loud, and it is a little bit of a pandemonium. LoFinity became my small escape: one hour here, twenty minutes there, always between nap times, building something that is _mine_, piece by piece.\n\nThe idea is over a year old. I _love_ lofi music, and not only because it sounds nice. I am neurodivergent, and focusing is not always easy for me. Those warm, repetitive, slightly imperfect beats are the thing that finally lets my brain settle down and work, with a hit of 90s childhood nostalgia on top. So a machine that vends endless lofi felt almost personal, like building a tool for my own brain.\n\nI had the _vision_ very clearly, but I was not comfortable with Three.js. Then Anthropic dropped **Fable 5** and I just HAD to try it. It took me from \"I have this in my head\" to a real, living 3D world. It worked beautifully... right until it got banned, but hey, shit happens. 🤷 I am grateful for the 3 days, enough to get me kickstarted.\n\n## How it works\n\nLoFinity is a **Gradio Server** app (`gradio.server.Server`) that serves a hand-built **Three.js** frontend and exposes a tiny generation API. Every tape is made by a short chain of small, open models, and on the live Space the whole chain runs on **ZeroGPU**.\n\n```\nyour vibe\n   │\n   ▼\nenrich  ──►  MiniCPM5-1B (ZeroGPU)  or  Ollama llama3.2:3b (local)\n   │          → music_prompt + cassette title + ambience tag (strict JSON)\n   ▼\nrender  ──►  MusicGen (medium on GPU / small on CPU)\n   │          → 30s shots, stitched with overlap-seeded continuation for longer tapes\n   ▼\ndress   ──►  ambience.py mixes a looped bed (rain / waves / crackle / …) under the music\n   │\n   ▼\ninline base64 WAV  ──►  browser turns it into a Blob URL, collection stays client-side\n```\n\n### The generation pipeline\n\n1. **You type a vibe** and pick a length (30 / 60 / 90s on GPU).\n2. **A small LLM enriches it.** On the Space that is **MiniCPM5-1B** (OpenBMB, ~1B params); locally it is **Ollama** running `llama3.2:3b`. It returns strict JSON: a MusicGen `music_prompt` (genre + 2-3 vibe-matched instruments + mood + tempo), a cassette `title"
  },
  {
    "repo_id": "build-small-hackathon/hollow",
    "title": "Hollow",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/hollow",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "hollow",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Try the app and decide whether it feels beginner-friendly.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: Hollow\nemoji: 🌫️\ncolorFrom: gray\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npython_version: \"3.11\"\npinned: true\nshort_description: A lost child asks for your memories — then claims them.\ntags:\n  - track:wood\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n  - build-small-hackathon\n  - thousand-token-wood\n  - horror\n  - npc\n  - game\n  - small-models\n  - qwen3\n  - zerogpu\n---\n\n# Hollow\n\n*Something shaped like a lost child stands at the far edge of the Thousand Token Wood, where the path ends and the fog begins.*\n\nIt has no memories of its own. It asks for yours. It keeps them in a treasure. And as the bond grows, it starts telling them back to you — **in first person, as if it had lived them.**\n\n> ⚠️ Contains flashing images and one sudden sound.\n\n## Watch & read\n\n- 🎥 **Demo video:** https://youtu.be/Awc4g1Y7d6Q\n- 📓 **Field notes / write-up:** https://www.linkedin.com/feed/update/urn:li:ugcPost:7472433866822131712/\n\n## How it plays\n\nTalk to it. That's all. But *everything* you do is being measured on two axes:\n\n- **Bond** — how much of yourself you share. The child literally materializes out of the fog as the bond grows: from a smudge, to a faceless shape, to a child with sad eyes, to something that has been listening too well.\n- **Tone** — how you *treat* it. It mirrors your writing style. Warmth makes it attach. Coldness makes it withdraw. Cruelty reminds it of how it was treated when it was alive — and it keeps your exact words. You'll see *something* being stored in its treasure, redacted: `▮▮▮▮▮`. You won't know what until it's too late.\n\n**Three endings.** Love it too much and it loves you back — the wrong way. Stay distant and the fog simply gives up on you. Be cruel and you'll learn what it does with the words you gave it. Each ending is a scripted sequence that takes over the interface itself: messages appear on *your* side that you never typed.\n\n## The tech (small on purpose)\n\n- **One model, 8B, no APIs:** [Qwen3-8B](https://huggingface.co/Qwen/Qwen3-8B) — `transformers` + ZeroGPU on this Space, the same model via Ollama on a consumer 8 GB GPU during development. No cloud calls, no external services: the whole game is one small model and one prompt discipline.\n- **One GPU acquisition per turn:** the reply *and* a JSON extraction pass (bond delta, captured memories, tone delta, cruel quotes) run inside a single `@spaces.GPU` call. State is a plain dict in `gr.State`; all game logic (recall scheduling, ending gates, redaction) is deterministic, pure Python, covered by 147 tests.\n- **The pacing was tuned empirically:** a simulator plays scripted visitors through the real model and reports per-turn state, so the three ending gates are calibrated against measured behavior, not guesses.\n- **Off-brand by hand:** the entire interface is custom CSS over Gradio — no JS. The materialization blur, the terror flashes that cut through every face the child could become, the full-screen screamer, the heartbeat bond meter, the redacted treasure: all CSS animations exploiting Gradio's DOM re-mount on update. Even the horror sting in the bad ending is **synthesized in numpy at boot** — no audio file in the repo.\n- **Art:** five portraits generated with FLUX (grayscale, early-1900s storybook horror), embedded as base64 WebP.\n\n## Built for the Build Small Hackathon\n\nTrack: 🍄 **An Adventure in Thousand Token Wood** · Badges: 🔌 Off the Grid · 🎨 Off-Brand · 📓 Field Notes\n\n*The recall moment — the child telling you your own memory as its own — needs nothing bigger than 8B. Small models are perfect liars: just big enough to remember you, too small to know it shouldn't.*\n"
  },
  {
    "repo_id": "build-small-hackathon/project-halide",
    "title": "Project Halide",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/project-halide",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "badge",
      "sponsor",
      "achievement",
      "offbrand",
      "project",
      "halide"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Project Halide\nsdk: gradio\nsdk_version: 6.10.0\napp_file: app.py\nlicense: apache-2.0\nmodels:\n  - Lonelyguyse1/halide-vision\n  - openbmb/MiniCPM-V-4.6\n  - nvidia/Nemotron-Mini-4B-Instruct\ntags:\n  - gradio\n  - film\n  - computer-vision\n  - diagnostics\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - sponsor:openai\n  - badge:off-brand\n  - badge:offbrand\n  - badge:tiny-titan\n  - badge:tiny\n  - badge:best-demo\n  - badge:demo\n  - badge:best-agent\n  - badge:bonus-quest\n  - badge:quest-champion\n  - badge:quest\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# Project Halide\n\nProject Halide is an edge-native diagnostic workbench for analog film scans by\n[Lonelyguyse1](https://huggingface.co/Lonelyguyse1).\n\nThe runtime uses MiniCPM-V 4.6 for defect extraction and\nNemotron-Mini-4B-Instruct for diagnostic reasoning. The vision pass combines\nfull-frame inspection, tiled fallback for large scans, a conservative\nimage-analysis validator for obvious scratches, and geometric filtering for\nsprocket or frame-edge artifacts. Model inference runs on the Space GPU runtime\nwithout cloud inference APIs.\n\nFine-tuned vision model:\n<https://huggingface.co/Lonelyguyse1/halide-vision>\n\nFine-tuning improved the vision stage where it mattered most for the app:\nstructured defect JSON, consistent film-defect labels, scratch and\nemulsion-damage vocabulary, and fewer obvious false positives on clean or\nlookalike regions. The runtime still treats model output as candidate evidence\nand validates every box.\n\nThe data bottleneck was central to the build. Public damaged-film examples are\nscattered, noisy, and often not real negatives, so the training curriculum\ncombines FilmDamageSimulator annotations, procedural defect positives, synthetic\nscratches and stains, hard clean negatives, and lookalike counterexamples such\nas grass, subject hair, sprocket holes, borders, and glare. The five private\nnegatives stayed held out for evaluation only.\n\nSource repository:\n<https://github.com/LonelyGuy-SE1/Project-Halide>\n\nDemo video:\n<https://youtube.com/watch?si=apzCiBZcIZWC1nFt&v=DGJ2M1aQCrE&feature=youtu.be>\n\nPublic launch post:\n<https://x.com/lonelyguyse1/status/2066631507956105423?s=20>\n\nTechnical blog:\n<https://lonelyguy.vercel.app/articles/2026-06-16-project-halide>\n\nModal was used for offline training, held-out GPU evaluation, checkpoint upload,\nGGUF conversion, and Space deployment. The runtime app itself does not call\nModal or any hosted inference API.\n\n## How It Works\n\n1. Upload a film scan, negative photo, or contact-sheet crop.\n2. MiniCPM-V 4.6 extracts candidate defects as structured JSON.\n3. The validator normalizes boxes, filters bad geometry, removes duplicate or\n   sprocket-like edge artifacts, and adds high-precision scratch candidates\n   when clear linear evidence is visible.\n4. Nemotron-Mini-4B-Instruct reads the validated evidence plus user metadata and\n   writes a lab-style diagnosis with physical fixes.\n5. SQLite stores local diagnostic history so earlier runs can be reopened.\n\n## Sponsor Usage\n\n- OpenBMB: MiniCPM-V 4.6 is the primary vision model, fine-tuned for film defect\n  extraction and published at `Lonelyguyse1/halide-vision`.\n- NVIDIA: Nemotron-Mini-4B-Instruct produces the diagnostic report and keeps\n  uncertain film metadata lower priority than visible evidence.\n- Modal: used offline for training, evaluation, checkpoint export, GGUF\n  conversion, model upload, and Space deployment support.\n- OpenAI: assisted implementation, review, and source-control hygiene through\n  the linked repository workflow.\n\n## Field Guide Alignment\n\n- Gradio Space under the official `build-small-hackathon` organization.\n- All runtime inference uses open weights on the Space GPU, with no hosted model\n  API calls.\n- Model sizes stay under the 32B limit, with MiniCPM-V 4.6 at 1.3B parameters\n  and Nemotron-Mini-4B-Instruct at 4B parameters.\n- Custom autumn-themed UI with a purpose-built compare viewer and diagnostic\n  history.\n- Fine-tuned vision model and GGUF artifact are published on the author's\n  Hugging Face profile.\n- Demo video, technical blog, public launch post, and field notes are linked\n  from this Space.\n\nHeld-out validation summary:\n\n- Four visibly damaged private negatives were detected with scratch and\n  emulsion-damage evidence.\n- One near-clean private negative returned zero defects.\n- A broad lifted crack network that failed full-frame inference was recovered by\n  the tiled fallback.\n"
  },
  {
    "repo_id": "build-small-hackathon/deathmatch",
    "title": "Deathmatch",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/deathmatch",
    "zone": "Game Grove",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "modal",
      "deathmatch",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Casual",
    "quest": "🎮 Game Grove: Play one round and note what made it fun.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎮",
    "readme_text": "---\ntitle: Celebrity Deathmatch\nemoji: 🥊\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\nsdk_version: \"6.0.0\"\napp_file: app.py\npinned: true\nshort_description: Two photos in, a claymation celebrity death match out\ntags:\n  - track:wood\n  - sponsor:modal\n  - sponsor:openbmb\n  - achievement:offbrand\n  - thousand-token-wood\n  - off-brand\n  - best-demo\n  - text-to-video\n  - image-generation\n  - text-to-speech\n  - gradio\n  - modal\nmodels:\n  - openbmb/MiniCPM-V-2_6\n  - black-forest-labs/FLUX.1-schnell\n  - Lightricks/LTX-Video\n  - Qwen/Qwen3-TTS-12Hz-1.7B-VoiceDesign\n---\n\n![Track: Thousand Token Wood](https://img.shields.io/badge/track-Thousand%20Token%20Wood-7c3aed)\n![Badge: Off Brand](https://img.shields.io/badge/badge-Off%20Brand-e0303c)\n![Badge: Best Demo](https://img.shields.io/badge/badge-Best%20Demo-f2b030)\n![Models < 32B](https://img.shields.io/badge/all%20models-%3C32B-2ea44f)\n![Gradio 6](https://img.shields.io/badge/Gradio-6.x-orange)\n![Backend: Modal](https://img.shields.io/badge/backend-Modal%20GPU-0b6bcb)\n\n# 🥊 Celebrity Deathmatch\n\n**Upload two photos. Our AI ring director books the brawl** — a claymation fight\nscript, a rendered keyframe reel, a declared winner — then turns it into one\ncontinuous fight video with **two ring announcers screaming over the action** and\nthe crowd going wild.\n\nIt's MTV's *Celebrity Deathmatch* as an AI-native toy: pure spectacle, zero\npractical value, maximum fun. (That's the [Thousand Token Wood](#) track in one\nsentence.)\n\n> ⚠️ **Parody.** Every visual is an AI-generated claymation **caricature** of a\n> public figure, for comedic effect. Not real. No real people were harmed.\n\n## ▶️ See it in action\n\n[![Watch the Celebrity Deathmatch demo](https://img.youtube.com/vi/JNl-N7NN8oI/hqdefault.jpg)](https://youtu.be/JNl-N7NN8oI)\n\n🎬 **[Watch the 60-second demo](https://youtu.be/JNl-N7NN8oI)** · 🥊 **[Try it live](https://huggingface.co/spaces/build-small-hackathon/deathmatch)** · 🔗 **[Launch post](https://www.linkedin.com/posts/pawel-pisarski_buildsmall-huggingace-modal-activity-7472333779378921473-6r2w)**\n\n## Why it's worth a look\n\n- **🎙️ Two-announcer voiceover, not a silent clip.** Every beat is called by\n  **Nick** (dry, sarcastic) and **Johnny** (loud, over-excited) — two *designed*\n  voices from Qwen3-TTS VoiceDesign — mixed over a bell, a crowd murmur bed, and a\n  winner roar. The fight has a soundtrack, like the real show.\n- **🎨 Off-brand UI.** No default Gradio look: a custom claymation-fight art\n  direction — Anton display type, fire-and-clay palette, tale-of-the-tape stat\n  bars, animated winner banner.\n- **🧱 Real caricatures, real stakes.** MiniCPM *reads both photos*, invents a\n  fighting persona, signature move, and stat line for each, then choreographs a\n  5-beat arc and picks a winner.\n\n## How it works\n\nA four-model pipeline — **every model under the hackathon's 32B cap**:\n\n| Stage | Role | Model | Params |\n|------|------|-------|--------|\n| 1 | Fight director — reads **both** photos → fight card JSON | **MiniCPM-V-2_6** (OpenBMB) | 8B |\n| 2 | Claymation keyframe reel | **FLUX.1-schnell** (BFL) | 12B |\n| 3 | Keyframes → continuous fight video (opt-in) | **LTX-Video** (Lightricks) | 2B |\n| 3 | Two-announcer voiceover | **Qwen3-TTS VoiceDesign** (Qwen) | 1.7B |\n\n**The entire fight pipeline runs on small models — all ≤12B**, well under the\n32B cap. No giant foundation model anywhere: a clever chain of small specialists\n(read → draw → animate → voice) does the whole show. That's the Build Small ethos.\n\n```\nphoto A + photo B\n   └─▶ Stage 1  MiniCPM-V-2_6 → fight card (fighters, 5 beats, 2-announcer commentary, winner)\n        └─▶ Stage 2  FLUX.1-schnell → 5 claymation keyframes\n             └─▶ Stage 3 (Animate)  LTX-Video → chained clips\n                  + Qwen3-TTS → Nick/Johnny voiceover  + crowd SFX  + burned-in captions\n                  → one MP4\n```\n\nThe reel (Stages 1–2) is the fast default; **Animate** (Stage 3) is opt-in\nbecause it's the GPU-heavy step.\n\n## Tech\n\n- **Frontend:** this Gradio 6 Space (custom CSS, no template look).\n- **Backend:** two Modal GPU apps — `deathmatch` (MiniCPM A10G + FLUX L40S +\n  ASGI gateway) and `deathmatch-video` (ComfyUI + LTX on H100, Qwen3-TTS on A10G).\n- **Audio/video post:** pure `ffmpeg` — xfade stitch, fit-to-beat TTS mixing,\n  synthesized crowd SFX, burned-in captions.\n- Wired to the backend via `DEATHMATCH_API_URL` (set automatically on deploy).\n\n## Sponsors we built on\n\n- **🤗 OpenBMB — MiniCPM-V-2_6.** The whole show starts here: one 8B\n  vision-language model reads *both* fighter photos in a single call and returns\n  the entire fight card as JSON — names, personas, signature moves, stat lines, a\n  5-beat script, two-announcer commentary, and a winner. It *is* the ring director.\n- **▲ Modal.** Every GPU stage runs on Modal — MiniCPM (A10G), FLUX (L40S), and\n  the ComfyUI + LTX-Video + Qwen3-TTS video app (H100 / A10G) — behind warm-pooled\n  `@app.cls` containers and an ASGI gateway. The HF Space stays CPU-only and calls\n  M"
  },
  {
    "repo_id": "build-small-hackathon/through-their-eyes-empathy-simulator",
    "title": "Through Their Eyes Empathy Simulator",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/through-their-eyes-empathy-simulator",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "through",
      "their",
      "eyes",
      "empathy"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Through Their Eyes\nemoji: 🫂\ncolorFrom: blue\ncolorTo: purple\nsdk: gradio\nsdk_version: 5.34.2\napp_file: app.py\npinned: false\ntags:\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# 🫂 Through Their Eyes\n\n**An AI-powered empathy simulator for seeing the story inside a conflict.**\n\nEvery conflict has three stories:\n\n- Your story\n- Their story\n- The story in between\n\nThrough Their Eyes turns a short conflict description into an immersive emotional world. It does not give therapy, diagnose anyone, judge either side, or tell users what to do. Its purpose is simpler and more human: help someone pause long enough to wonder, *\"What might this feel like from the other side?\"*\n\n---\n\n## 🔗 Links\n\n- **Live Space:** https://huggingface.co/spaces/build-small-hackathon/through-their-eyes-empathy-simulator\n- **Demo Video:** https://x.com/ThoratAbhi35944/status/2066523239820718583\n- **GitHub Repository:** https://github.com/Abhinay2007/Hugging_Face_Hackthon\n- **Social Post:** https://www.linkedin.com/posts/abhinay-thorat-a406642ba_ai-artificialintelligence-machinelearning-ugcPost-7472261279416602625-paWL/\n\n---\n\n## ✨ What It Does\n\nUsers enter a real-world conflict, such as:\n\n> My father wants me to become an engineer, but I want to start a business.\n\nThe app generates an empathy experience with:\n\n- **Core Conflict Compression:** turns long conflict descriptions into emotional archetypes like `Security vs Freedom`.\n- **Perspective Explorer:** shows the user's view, the other person's view, and a neutral observer view.\n- **Misunderstanding Layer:** reveals what each person may mean versus what the other person may hear.\n- **Hidden Needs:** maps surface positions to deeper needs such as security, freedom, trust, or respect.\n- **Future Echoes:** explores how defensive, curious, or empathic responses may unfold over time.\n- **Emotional Universe:** visualizes emotions, fears, goals, and assumptions as a constellation.\n- **If You Were Them:** a short narrative that helps the user temporarily inhabit the other person's emotional world.\n\n---\n\n## 🎯 Why It Matters\n\nMost AI tools try to answer the user.\n\nThrough Their Eyes tries to widen the user's view.\n\nConflict often gets stuck because people argue about positions: career choices, chores, tone, deadlines, money, plans. Underneath those positions are usually needs: safety, independence, respect, belonging, recognition, trust.\n\nThis project helps users see that hidden layer without taking sides.\n\n---\n\n## 🧠 Design Philosophy\n\nThe interface is intentionally not a dashboard. It is designed to feel like entering another person's emotional world:\n\n- Floating perspective planets\n- Glowing emotion bubbles\n- A constellation-style emotional map\n- Cinematic dark UI\n- Minimal text, high emotional focus\n\nThe goal is a judge moment:\n\n> Oh wow. I never thought they might see it that way.\n\n---\n\n## 🛠 Tech Stack\n\n- **Frontend:** Gradio Blocks, custom CSS, custom JavaScript\n- **Backend:** Python, Pydantic, NetworkX\n- **Model Runtime:** Transformers\n- **Model:** Qwen3 local inference\n- **Visualization:** React Flow-style emotional universe with fallback rendering\n- **Deployment:** Hugging Face Spaces\n\n---\n\n## 🏆 Hackathon Track\n\n**Track:** Thousand Token Wood\n\n**Bonus Badges Targeted:**\n\n- Off Brand\n- Off The Grid\n\n**Tags:** `thousand-token-wood`, `off-brand`, `modal`, `codex`\n\n---\n\n## 🚀 Running Locally\n\n```bash\npython app.py\n```\n\nFor local model inference:\n\n```bash\nexport LLM_PROVIDER=local\nexport MODEL_ID=Qwen/Qwen3-8B\nexport LOAD_IN_4BIT=true\n```\n\nFor lower-memory machines, use:\n\n```bash\nexport MODEL_ID=Qwen/Qwen3-4B\n```\n\n---\n\n## ⚡ ZeroGPU Notes\n\nThe app is prepared for Hugging Face ZeroGPU:\n\n- `spaces` is included in `requirements.txt`\n- Local Transformers generation is wrapped with `@spaces.GPU(duration=120)`\n- GPU inference happens inside the decorated generation function\n- The model client caches the tokenizer and model to avoid repeated initialization\n- CPU fallback remains available when CUDA is not present\n\nRecommended Space settings:\n\n```text\nSDK: Gradio\nHardware: ZeroGPU\n```\n\nRecommended environment variables:\n\n```bash\nLLM_PROVIDER=local\nMODEL_ID=Qwen/Qwen3-8B\nLOAD_IN_4BIT=true\nPYTORCH_CUDA_ALLOC_CONF=expandable_segments:True\n```\n\n---\n\n## 🧪 Development Journey\n\nThis project was built and refined through fast local and cloud iteration.\n\n- **OpenAI Codex:** accelerated implementation, UI iteration, schema design, and debugging.\n- **Modal:** used to experiment with model behavior and infrastructure options.\n- **Local AI Development:** final testing was performed on an NVIDIA RTX 5060 Ti 16GB GPU to keep the project grounded in accessible small-model inference.\n\n## ❤️ Final Thought\n\nThrough Their Eyes is not about winning an argument.\n\nIt is about discovering the emotional world the argument is hiding.\n"
  },
  {
    "repo_id": "build-small-hackathon/Snap2Sim",
    "title": "Snap2Sim",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Snap2Sim",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "achievement",
      "snap2sim",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Snap2Sim Inside The Machine\nsdk: gradio\napp_file: app.py\nlicense: mit\ncolorFrom: blue\ncolorTo: yellow\nshort_description: Photograph a part. Get a mechanical cutaway.\nmodels:\n- unsloth/NVIDIA-Nemotron-3-Nano-Omni-30B-A3B-Reasoning-GGUF\ntags:\n- build-small-hackathon\n- backyard-ai\n- llama-champion\n- nvidia-nemotron-quest\n- modal-award\n- openai-codex\n- off-brand\n- best-demo\n- sharing-is-caring\n- llama-cpp\n- gguf\n- gradio\n- three-js\n- track:backyard\n- sponsor:openai\n- sponsor:nvidia\n- sponsor:modal\n- achievement:offbrand\n- achievement:llama\n- achievement:sharing\n---\n\n# Snap2Sim: Inside the Machine\n\nYou find a small metal cylinder at a flea market. What is it? How does it work\ninside?\n\nSnap2Sim turns a photo of a hardware component into a narrated mechanical\ncutaway. Drop in a picture of a valve, lock, pump, gear train, fan, latch, or\nother mystery part; the app identifies the likely mechanism, names the internal\npieces, and renders a moving 3D field-manual view that explains how the object\nworks.\n\nThe goal is not just recognition. It is the moment a curious maker gets to see\nthe hidden motion: the plunger sliding, the spring loading, the rotor spinning,\nthe pawl catching, the port opening.\n\n## Demo\n\n- Hugging Face Space: https://huggingface.co/spaces/build-small-hackathon/Snap2Sim\n- App host: https://build-small-hackathon-snap2sim.hf.space\n- Source of truth: https://github.com/Bigstonks1/Snap2Sim\n- Demo video: https://youtu.be/nuisDKMyyF8\n- X post: https://x.com/Ryno67114241/status/2066660199558152411\n\nThe Space is public for submission under the Build Small Hackathon organization.\nThe app is built as a Gradio Space with a custom `gradio.Server` shell; GitHub\nremains the source of truth for code changes.\n\n## Hackathon Fit\n\nSnap2Sim is a Backyard AI project for people who learn by taking things apart:\ngarage tinkerers, repair hobbyists, thrift-store scavengers, robotics students,\nand anyone holding an unknown part and wondering what is going on inside.\n\nThe README metadata tags request consideration for:\n\n| Track or badge | Why it applies |\n|---|---|\n| Backyard AI | Explains real-world objects from a user's own photo. |\n| Llama Champion | Runs the primary model through llama.cpp with GGUF assets. |\n| NVIDIA Nemotron Quest | Uses NVIDIA Nemotron 3 Nano Omni as the vision-reasoning model. |\n| Modal Award | GPU inference is deployed through Modal with cached model assets. |\n| OpenAI Codex | Built with Codex assistance and Codex-attributed commits in the linked GitHub repo. |\n| Off-Brand | The experience is a branded maker tool, not a generic chatbot or image demo. |\n| Best Demo / Sharing is Caring | Public demo video and X post are linked in the Demo section. |\n| Off the Grid | Not claimed while inference runs on Modal. |\n\n## How It Works\n\n1. The browser accepts a component photo and sends it to the Space's same-origin\n   `/analyze_image` API.\n2. The Space calls a secured Modal endpoint running\n   `unsloth/NVIDIA-Nemotron-3-Nano-Omni-30B-A3B-Reasoning-GGUF` through\n   llama.cpp and `llama-mtmd-cli`.\n3. Nemotron returns a structured mechanism JSON payload: component name,\n   confidence, internal parts, geometry hints, motion types, annotations, and a\n   plain-language explanation.\n4. `/generate_scene` validates the analysis and chooses the render mode.\n5. The browser renders the result with trusted deterministic Three.js code. If a\n   confident 3D reconstruction is not justified, it falls back to annotated\n   callouts on the original photo instead of pretending.\n\nThe model describes the mechanism. The browser owns the rendering. That keeps\nthe demo reliable and avoids injecting model-authored HTML or JavaScript into\nthe page.\n\n## Tech Stack\n\n- Frontend: plain `index.html`, CSS, and JavaScript served directly by\n  `gradio.Server`.\n- Rendering: deterministic Three.js primitives, labels, OrbitControls, camera\n  fitting, and an annotated-photo fallback.\n- Model runtime: NVIDIA Nemotron 3 Nano Omni 30B-A3B GGUF, `UD-Q4_K_M`, with\n  `mmproj-F16.gguf` through llama.cpp.\n- GPU service: Modal, with weights cached in a Modal Volume and bearer-token\n  protection between the Space and Modal.\n- Space runtime: Hugging Face `cpu-basic`; expensive inference happens on Modal.\n- Contract: JSON schema validation in `snap2sim/schema.py` before anything is\n  rendered.\n\nThe primary model is approximately 30B total parameters with roughly 3B active\nper MoE token, keeping the single-model pipeline under the hackathon's 32B model\nbudget.\n\n## What Makes It Different\n\nMost image demos stop at a label. Snap2Sim tries to answer the maker's next\nquestion: \"what would I see if I cut this open?\"\n\n- It produces a mechanism explanation, not just a class name.\n- It maps internal parts into a moving technical cutaway.\n- It shows uncertainty honestly with a confidence threshold and a photo\n  annotation fallback.\n- It uses a custom public-facing interface instead of a default chat or form UI.\n- It keeps secrets and GPU endpoints server-side wh"
  },
  {
    "repo_id": "build-small-hackathon/OffGridSchedula",
    "title": "Offgridschedula",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/OffGridSchedula",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "offgridschedula",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: OffGridSchedula\nemoji: 🗓️\ncolorFrom: indigo\ncolorTo: purple\nsdk: docker\napp_port: 7860\npinned: false\nlicense: apache-2.0\nshort_description: Local-first chat-to-calendar agent (Gemma-4 E4B + MiniCPM)\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\nmodels:\n  - build-small-hackathon/gemma-4-cal-gguf\n  - openbmb/MiniCPM5-1B-GGUF\ndemo_video:\n  - https://youtu.be/m-o0u9X3tI4\nsocial_posts:\n  - https://x.com/nate_mauer/status/2065973341651882386\n  - https://x.com/nate_mauer/status/2064920352845709419\n  - https://x.com/nate_mauer/status/2065661878441750916\n  - https://www.linkedin.com/feed/update/urn:li:ugcPost:7471440639969132545\nblog_post:\n  - https://huggingface.co/blog/build-small-hackathon/offgridschedula\nmade_by:\n  - ParetoOptimal - a.k.a., Nate Mauer\n---\n\n# 🗓️ Message Scheduling Agent\n\n **OffGridSchedula turns a pasted chat (or a flyer screenshot) into calendar events, catches conflicts, and drafts the reply — right from your phone, no app, no account,\nno setup. iOS allows neither background iMessage access nor a persistent on-device LLM server, so there's no autonomous on-device agent to install; instead, \na foreground Shortcut ([docs/automations.md](./docs/automations.md)) hands a thread or screenshot to the agent in two taps (optionally using a remote model via `INFERENCE_BASE_URL`).**\n\nThe model runs on **your own server or even on the phone itself** and not on a cloud AI service. Your chats aren't shipped off to a third-party AI to be read; agent reads your snippet in memory and\ndiscards it after replying. The run trace you can optionally share is a redacted, sent to the agent you control that turns it into ready-to-add calendar events.\n\n**Hardware-aware.** With under-powered hardware, the app warns users with an upgrade banner rather than hanging, the real model needs a tiny GPU.\n\n## Build Small submission — the idea & the tech\n\n**The idea.** A busy parent's calendar lives in other people's messages — picture day in the\nclass chat, the practice that moved, the party flyer. OffGridSchedula turns those into calendar\nevents: paste the chat (or snap the flyer) from a phone browser, review the extracted events, the\nconflicts against your own `.ics`, and a drafted reply — then add to Apple/Google Calendar in a tap.\n\n**The tech.** Two small local models do the work. Extraction is [`gemma-cal` E4B](https://huggingface.co/build-small-hackathon/gemma-4-cal-gguf)\n(~4B effective params), our QLoRA fine-tune of Gemma-4 E4B that emits a single validated\n**ActionPlan** (events · conflicts · reply · clarifying question), served with **vision** through\nthe official **llama.cpp** server inside this Docker Gradio Space — no cloud AI APIs. The\nfine-tune + its 60-example task eval ran entirely on **Modal** serverless GPUs, behind an\neval gate that rejected eight regressed models before this one shipped. Conflict math is\ndeterministic Python, the UI is fully custom, the agent doubles as an **MCP tool server**, and\nredacted run traces are public on the [Hub](https://huggingface.co/datasets/ParetoOptimal/offgridschedula-traces).\nClick **Run the agents** and a local **OpenBMB MiniCPM** planner (a second local llama-server)\ndrives this same Space's MCP tools as a multi-step agent — extract → check conflicts → render\n`.ics` — with every step visible. Still zero cloud AI; every model under 32B.\n\n**What's new.** Extraction now reads the *logistics*, not just the date (see below): arrival-aware\nstart times, duration→end conversion, type-based reminders, and calendar-ready titles — each\nguaranteed by deterministic post-processing even when the model wobbles, and each shipped through\na measured A/B eval ([full result tables](./training/data/ab_results.md): regex vs text-LLM vs\n**vision-LLM reading rendered screenshots only**). Calendar out got one-click too: a unified\n**Connect your calendar** block (Google OAuth — the token lives in *your* browser, never on the\nserver; Outlook/Apple need no sign-in) and per-event **Google · Outlook · iCal** links, with the\nGoogle push verified end-to-end (push → readback → delete, 11/11).\n**The UX.** One decision — **Offline or Online** — re-themes the whole workflow card and sets the\npath: off-grid `.ics` only, or a **one-click \"Connect your calendar\"** whose Google OAuth token\nlives *only in the browser* (server-verified each visit; the client secret never leaves the\nserver). Results land in a single card: events, conflicts, the drafted reply, and per-event\n**Google · Outlook · iCal · .ics** quick-add links. **Activity → This week** tallies events\ncaptured, conflicts caught, and time saved; a per-device **Memory** (localStorage, one-click\nsamples) feeds names and preferences back into extraction.\n\n**Submission links:** [requirement-by-requirement mapping](./docs/build-small-submission.md) ·\n[demo video](https://youtu.be/m-o0u9X"
  },
  {
    "repo_id": "build-small-hackathon/BudgetBuddy",
    "title": "Budgetbuddy",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/BudgetBuddy",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "modal",
      "budgetbuddy",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: BudgetBuddy\nemoji: 🧾\ncolorFrom: indigo\ncolorTo: green\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npinned: false\nshort_description: Snap a bill, ask where your money went — small-model AI\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n  - build-small-hackathon\n  - minicpm\n  - modal\n  - gradio\n  - agent\n---\n\n# 🧾 BudgetBuddy\n\n**A spend tracker for real people — built entirely on small, open models.**\nSnap a messy receipt or a UPI/card screenshot (or just type it), and BudgetBuddy\nreads it, fixes the totals, categorises it, saves it privately, and lets you\n**chat with a tool-using agent** about where your money went — all in a custom\ndashboard UI. No third-party AI APIs. Two MiniCPM models do everything.\n\n> Build Small hackathon · **Backyard AI** track. Built for the people around me\n> (homemakers, parents, small-shop owners) who want to understand their spending\n> without a spreadsheet — and without handing their receipts to a cloud AI API.\n\n## ▶️ Demo & links\n- **Live app:** https://huggingface.co/spaces/build-small-hackathon/BudgetBuddy\n- **Demo video:** https://youtu.be/QbgY6HDbrxE\n- **Social post:** https://x.com/KrishnaIsCoding/status/2066565121464541191\n- **Build write-up (Field Notes):** https://huggingface.co/blog/KrishnaGarg/budget-buddy-field-notes ([repo copy](FIELD_NOTES.md))\n- **Open agent traces (Sharing is Caring):** [AGENT_TRACES.md](AGENT_TRACES.md)\n\n## Why it fits \"Build Small\"\n- **Small, open models — no third-party AI API.** Vision/OCR is\n  [MiniCPM-V-4.6](https://huggingface.co/openbmb/MiniCPM-V-4.6) (**1.3B**), running\n  on the Space's **ZeroGPU**. The reasoning + agent brain is\n  [MiniCPM4.1-8B](https://huggingface.co/openbmb/MiniCPM4.1-8B) (**8B**), running on\n  our own **[Modal](https://modal.com)** GPU. **~9.3B total — well under the 32B\n  cap.** We never call a hosted AI inference API (no OpenAI/Anthropic/Gemini) — only\n  open weights we run ourselves.\n- **Real problem, honest fit.** Real bills are messy: missing totals, taxes,\n  service charges, round-offs, mixed items, weird date formats. BudgetBuddy\n  reasons about them and reconciles the math, so editing is the exception.\n\n## What it does\n1. **Capture, three ways** — a photographed **receipt**, a **payment screenshot**\n   (UPI / GPay / PhonePe / card), or a quick **manual** entry.\n2. **Read & reconcile** — the vision model extracts vendor, date (normalised to\n   `YYYY-MM-DD`), line items, taxes/service/tip/discount/round-off, and total;\n   computes a missing total; flags anything that doesn't add up.\n3. **Reason & categorise** — the 8B reviews the extraction, fixes obvious errors,\n   and assigns an overall + per-item category (fixed 23-category list).\n4. **Dashboard** — monthly spend, vs-last-month, top category, spend-by-category\n   donut, spend-over-time chart, a budget ring, a calendar heatmap, and a\n   filterable transaction list that shows every line item **and** every tax/charge.\n5. **Agent chat** — ask *\"how much did I spend on Groceries last month?\"* or\n   *\"what's my biggest expense?\"* and the agent answers with your real numbers,\n   showing which **tools** it used.\n\n## 🤖 The agent (Best Agent)\nThe assistant is a real tool-using agent over `core/analytics`, not a chatbot that\nguesses. It exposes **11 tools** — `total_spend`, `category_spend`, `item_spend`,\n`vendor_spend`, `top_categories`, `biggest_expense`, `average_spend`,\n`count_transactions`, `budget_status`, `monthly_trend`, `recent` — each scoped by a\nflexible **period** (`this_month`, `last_month`, `this_year`, a specific month like\n`2026-07`, a year, or `all`).\n\nThe **8B plans every question**: it reads the question, decides which tool to call\n(and with what period), reads the result, optionally **chains another tool**, then\nanswers — a real ReAct loop. The tools are deterministic Python, so the *numbers*\ncan never be hallucinated; an answer is only ever returned once it is **grounded by\nan actual tool call** (ungrounded model output is rejected). A deterministic router\nover the same tools acts as a reliability fallback if the model can't produce a\nvalid plan. Every reply shows the **trace of tools used**, so the reasoning is\nauditable.\n\n## Custom UI (Off-Brand)\nThe frontend is a hand-built dark single-page app ([frontend/](frontend/)) served\nby **`gradio.Server`** (Gradio 6): Python API endpoints on the Gradio backend\n(queue + ZeroGPU), our own HTML/CSS/JS + Chart.js on top. The default Gradio shell\nis gone entirely.\n\n## ⚡ Modal (Best Use of Modal)\nMiniCPM4.1-8B's `trust_remote_code` targets transformers ~4.56 and breaks on the\n5.7 that MiniCPM-V-4.6 needs. Modal resolves the conflict cleanly: the 8B runs in\nits own container/env on an A10G, loaded once into a **memory snapshot** for fast\ncold starts and kept warm (`scaledown_window`), and the Space calls it through the\nModal SDK ([core/modal_backend.py](core/modal_backend.py),\n[modal_app.py](modal_app.py)"
  },
  {
    "repo_id": "build-small-hackathon/bureaucat",
    "title": "Bureaucat",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/bureaucat",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "bureaucat",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Bureaucat\nemoji: 🐱\ncolorFrom: pink\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: The cat that reads scary Swedish letters for you\ntags:\n- track:backyard\n- achievement:offgrid\n- achievement:offbrand\n- achievement:fieldnotes\n- build-small-hackathon\n- backyard-ai\n- off-the-grid\n- off-brand\n- field-notes\n- zerogpu\n- vision-language-model\n- sweden\nmodels:\n- Qwen/Qwen3-VL-8B-Instruct\n---\n\n**🔗 Submission:** [Demo video & launch post on LinkedIn](https://www.linkedin.com/posts/ravinarayansingh_buildsmallhackathon-gradio-huggingface-ugcPost-7472036424997974017-l0HA/) — the demo video is embedded in this post.\n\n# 🐱 Bureaucat\n\n**The cat that reads scary Swedish letters so you don't have to.**\n\nYou moved to Sweden. One day a thick envelope arrives from **Skatteverket**. Or **Försäkringskassan**. Or — heart rate rising — **Migrationsverket**. It's in formal bureaucratic Swedish, it mentions an amount and a date, and you have no idea if it means *\"FYI, everything is fine\"* or *\"pay 15,600 kr in 30 days or lose your housing allowance.\"*\n\nBureaucat reads the letter for you and tells you, at a glance and accurately:\n\n- **How worried to be** — a 1–5 severity verdict driving a **Panic Meter** and a reacting cat-civil-servant mascot\n- **The short version** — what the letter says, in plain English\n- **Why you got it** — which authority, and what triggered it\n- **What you need to do** — a checklist of required actions\n- **Deadlines & money** — every deadline, amount, and reference number **quoted verbatim from the letter, never invented**\n\n## 🏡 Backyard AI: built for a real person — me\n\nI'm an expat in Sweden. Every letter in the example gallery is a type of letter that has actually landed in my mailbox: tax decisions (slutskattebesked), benefit completion requests (komplettering), residence-permit document requests, CSN repayment demands, vårdcentral appointment summons. The first user test was me, on my own mail. The app is stateless by design — nothing is stored, nothing leaves the Space.\n\n## 🤏 Small model, honest fit\n\nOne model does everything — OCR, reasoning, and explanation in a single pass:\n\n| | |\n|---|---|\n| Model | [`Qwen/Qwen3-VL-8B-Instruct`](https://huggingface.co/Qwen/Qwen3-VL-8B-Instruct) (~9B params — well under the 32B cap) |\n| Decoding | **Greedy** (`do_sample=False`) — deterministic output for an accuracy-critical tool |\n| Inference | Entirely in this Space on ZeroGPU. **No cloud APIs.** |\n\nThe model was chosen by a bake-off, not vibes: Qwen2.5-VL-7B vs Qwen3-VL-8B on a gold set of five synthetic Swedish authority letters with an automated evaluation harness (`eval/run_eval.py`). The smaller 7B model dropped reference numbers (3/5 pass); the 8B passed 5/5 with **zero invented values and 100% recall** — so per \"smallest model that passes,\" the 8B won.\n\n## 🛡️ The anti-hallucination contract\n\nInventing a deadline is the single worst failure this app can have. So:\n\n1. The model first **transcribes** the letter, then extracts — every value in \"Deadlines & money\" must be a **verbatim substring of its own transcription**, checked by a pure-Python grounding pass (no second model call).\n2. If a value isn't in the letter, the answer is **\"None found.\"** — never a guess.\n3. The eval gate (zero invented values, 100% recall, severity parseable, beginner-mode invariance) runs on every gold letter, plus **adversarial fixtures**: a blurry photo, a grocery receipt, and a non-Swedish letter must all be *refused*, not analyzed.\n\n## 👇 How to use it\n\n**Upload a saved file** — a photo (JPEG/PNG) or a PDF of your letter, multi-page supported. Bureaucat does **not** use your camera or take live photos; snap the letter with your phone first, then upload the image or PDF here. One upload uses one short ZeroGPU call.\n\nThe example gallery at the bottom of the app contains **pre-computed analyses** — tap any of the five letters and the full result loads instantly at **zero GPU cost**, no upload needed.\n\n*Why synthetic examples?* Real authority letters are full of personal data (names, personnummer, amounts) and can't be published — and the official \"example\" documents authorities publish are info sheets without the personalized deadlines/amounts that make a letter scary. So the gallery letters are **faithful synthetic recreations** of the five letter types, with realistic Swedish layouts, diacritics, and values — built so nobody's real mail ends up in a public repo. Every letter shown in the app and in the demo video is one of these synthetic mock-ups — I tested Bureaucat privately on my own real Swedish mail, which never gets published.\n\n## 🏅 Badges claimed\n\n- **Off the Grid** — no cloud APIs; the 9B VLM runs entirely in this Space\n- **Off-Brand** — custom frontend via `gr.Server`\n- **Field Notes** — build report: what greedy decoding revealed that sampling had masked\n\n## ⚖️ Privacy & disclaimer\n\n- **Stateless:** letters are processe"
  },
  {
    "repo_id": "build-small-hackathon/post-audit",
    "title": "Post Audit",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/post-audit",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "post",
      "audit",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Post Audit\nemoji: 📋\ncolorFrom: blue\ncolorTo: green\nsdk: gradio\nsdk_version: \"6.18.0\"\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Audit posts against goal & audience (Gemma 4 + rules)\n---\n\n# Post Audit\n\nBrief-aware social post audit for the [build-small-hackathon](https://huggingface.co/build-small-hackathon) Backyard AI track.\n\n**Model:** Gemma 4 E4B (4.5B effective parameters, ≤32B hackathon limit)  \n**Inference:** Modal GPU endpoint  \n**Host:** Rule linters + deterministic score recomputation\n\nSet Space secret `MODAL_AUDIT_URL` to your deployed Modal web endpoint base URL (without trailing `/audit`).\n\n## Authors\n\n- Eugene Pasternak — [`pasternake`](https://huggingface.co/pasternake)\n- Pavel Trubin — [`agsagds`](https://huggingface.co/agsagds)\n"
  },
  {
    "repo_id": "build-small-hackathon/LifeChoice-Simulator",
    "title": "Lifechoice Simulator",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/LifeChoice-Simulator",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "openai",
      "lifechoice",
      "simulator",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: LifeChoice Simulator\nemoji: 🧭\ncolorFrom: purple\ncolorTo: pink\nsdk: gradio\nsdk_version: 5.50.0\napp_file: app.py\npinned: true\nlicense: mit\nmodels:\n  - Qwen/Qwen2.5-7B-Instruct\ntags:\n  - build-small-hackathon\n  - gradio\n  - agent\n  - decision-support\n  - thousand-token-wood\n  - best-agent\n  - best-demo\n  - openai\n  - track:wood\n  - sponsor:openai\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# LifeChoice Simulator\n\nLifeChoice Simulator is an LLM-powered interactive future game for the **Thousand Token Wood** track. It turns a real fork in the road into a strange, playable eight-node future where choices change the world, the character, the pressure, and the ending report.\n\nThe experience sits between a text-adventure dungeon master, a decision simulator, and a pocket-sized alternate-life RPG. A 7B model dreams up adaptive scenarios; deterministic game state keeps those dreams consequential.\n\n![Architecture](docs/architecture.svg)\n\n## Why This Exists\n\nSome decisions are too personal for a pros-and-cons list and too uncertain for a single chatbot answer. A student choosing between an MTech path and a software job is not only comparing two labels. They are carrying family expectations, money pressure, identity, validation, time, and fear of regret.\n\nTraditional advice tools usually collapse that complexity into one recommendation. LifeChoice takes a different approach: it turns the decision into a small playable future, so the player can feel how tradeoffs evolve over time.\n\n## The Story\n\nMeet Aarav. He has two possible roads:\n\n- **MTech**: meaningful research, a professor who is interested, and a path that could grow into deeper expertise.\n- **Software Job**: faster income, clearer timelines, and relief for a family that needs financial stability soon.\n\nAarav chooses to enter the MTech future. The first month feels promising: a professor asks him to commit to a defined project. But the choice is not abstract anymore. Every decision changes something: money, fulfillment, validation, stress, and family trust.\n\nIf Aarav chases research too aggressively, fulfillment rises but stress and money pressure may break the world around him. If he protects stability too much, the research path may lose momentum. Earlier decisions come back later as obligations, closed doors, or delayed opportunities.\n\nBy the end, LifeChoice does not tell Aarav what to do. It shows the shape of the future he played: what he protected, what he sacrificed, and what costs kept returning.\n\n## Our Solution\n\nLifeChoice is an AI-native playable mirror for high-stakes personal choices. The LLM generates adaptive future scenarios from the player's dilemma, chosen path, calibration pressure, and previous decisions. A deterministic game engine tracks the consequences so the experience behaves like a game, not a loose chat.\n\nThe goal is not prediction. The goal is rehearsal. LifeChoice lets a player step into one possible future, make decisions inside it, and see how the emotional and practical state of the world changes.\n\n## Live Links\n\n- Live Space: [build-small-hackathon/LifeChoice-Simulator](https://huggingface.co/spaces/build-small-hackathon/LifeChoice-Simulator)\n- Demo video: [YouTube walkthrough](https://youtu.be/pHPdT6BKJcI)\n- Social post: [LinkedIn announcement](https://www.linkedin.com/posts/ajeyabhargava-jashwanth-reddy-aa1a57270_buildsmallhackathon-huggingface-openai-share-7472378829567614979-dGUp/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAEJdAlkBcnz7M6u7dt4WCnDYophXfRZ0MPM)\n- Public build trace: [lifechoice-simulator-trace](https://huggingface.co/datasets/build-small-hackathon/lifechoice-simulator-trace)\n- Social post script: [docs/social-post.md](docs/social-post.md)\n- Demo script: [docs/demo-script.md](docs/demo-script.md)\n- Compliance checklist: [docs/compliance-checklist.md](docs/compliance-checklist.md)\n\n## Thousand Token Wood Fit\n\nLifeChoice is built for the whimsical track: a playful, AI-native interaction that lets a small model wander somewhere stranger than a normal productivity app.\n\n| Track Signal | How LifeChoice Fits |\n|---|---|\n| Interactive AI game | Eight playable decision nodes, persistent stats, branching consequences, persona reactions, and final report |\n| Out-of-the-box entertainment | A serious life dilemma becomes an alternate-future game with pixel-art scenes and a reactive character |\n| Small model imagination | `Qwen/Qwen2.5-7B-Instruct` generates bounded future scenarios from the user's fork, pressure, and past choices |\n| Text-adventure energy | The model acts like a constrained future narrator while the state engine remembers what the player did |\n| Delightful presentation | Custom Gradio UI, cinematic scenario cards, world-state badges, metric bars, and pixel-art backgrounds |\n\n## Experience\n\nThe player enters a dilemma such as `MTech vs Software Job`, chooses one path to enter, adds the pressure that makes the decision real, and selects a voice that follo"
  },
  {
    "repo_id": "build-small-hackathon/canirunaimodel",
    "title": "Canirunaimodel",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/canirunaimodel",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "canirunaimodel",
      "summary",
      "available",
      "yet.",
      "gradio",
      "region"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: canirunaimodel\nemoji: 🤖\ncolorFrom: yellow\ncolorTo: indigo\nsdk: gradio\nsdk_version: \"5.34.0\"\npython_version: \"3.10\"\napp_file: app.py\npinned: false\n---\n\n# gradio-fullscreen\n\nLocal-only Gradio wrapper for the live `canirunaimodel` check page.\n\nThis folder is intentionally gitignored. It is meant for personal local use, not for committing or publishing from this repo.\n\n## What it opens\n\n- `https://canirunaimodel.vercel.app/check`\n\n## What the UI looks like\n\nThe wrapper is intentionally minimal:\n\n- a small clickable header at the top\n- a fullscreen embedded iframe below it\n\nThat means there are no extra controls, inputs, or descriptive blocks inside the Gradio app itself beyond the top link.\n\n## Files\n\n- `app.py` — the fullscreen Gradio wrapper\n- `requirements.txt` — Python dependency list\n\n## Run locally\n\nFrom inside this folder:\n\n```bash\npython -m venv .venv\n. .venv/Scripts/activate\npip install -r requirements.txt\npython app.py\n```\n\nThen open the local Gradio URL shown in the terminal.\n\n## Notes\n\n- The top link opens the live website directly in a browser tab.\n- The embedded page loads the real deployed `/check` route.\n- If your browser blocks iframe embedding, use the top link instead.\n"
  },
  {
    "repo_id": "build-small-hackathon/read-along-ai",
    "title": "Read Along Ai",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/read-along-ai",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "read",
      "along",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Read-Along AI\nemoji: 🦉\ncolorFrom: blue\ncolorTo: yellow\nsdk: gradio\npython_version: \"3.11\"\napp_file: app.py\npinned: false\ntags:\n  - build-small-hackathon\n  - backyard-ai\n  - off-brand\n  - off-the-grid\n  - llama-champion\n  - tiny-titan\n  - sharing-is-caring\n  - field-notes\n  - well-tuned\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# 🦉 Read-Along AI: The Offline Reading Teacher\n\n**Live App (Official Submission):** [build-small-hackathon/read-along-ai](https://huggingface.co/spaces/build-small-hackathon/read-along-ai)  \n*(Note: This is the official hackathon submission Space. Because organization billing restrictions prevent personal CPU upgrades here, \"Off the Grid\" local inference takes ~10s per interaction.)*\n\n⚡ **High-Speed Mirror:** [kingkw1/read-along-ai](https://huggingface.co/spaces/kingkw1/read-along-ai)  \n*(Judges: For the best interactive experience of the local \"Off the Grid\" mode, please use this identical mirror hosted on a CPU-upgraded instance!)*\n\n**Organization:** Hosted under the official [`build-small-hackathon`](https://huggingface.co/build-small-hackathon) HF Org.\n\n**A Build Small Hackathon Submission**\n*Track: Backyard AI*\n\n## 📖 The Vision\n\nLearning to read is a monumental milestone, but standard voice-to-text models demand perfect diction. They treat a child's developing voice like a series of errors, turning practice into a frustrating test. **Read-Along AI** was built to fix this: it is a patient, distraction-free reading assistant that listens to a child read and provides instant, gentle feedback.\n\nAs a homeschooling parent to four young children, managing daily reading practice can be chaotic. I built this tool for the **Backyard AI** track to solve a specific problem for the people I know best: to remove the friction from the daily reading curriculum for my 7-year-old son and 6-year-old daughter. Read-Along AI acts as an offline safety net—allowing them to sound out words at their own pace without the anxiety of a ticking clock, and without the privacy risks of sending their voices to a corporate cloud server.\n\n**Real-World Impact:** When I field-tested this with my kids, the engagement was immediate. Because the fine-tuned phonetic judge gracefully accepted their natural speech variations—and the gamified confetti cannon fired instantly upon a successful read—they voluntarily asked to keep \"playing\" through their reading assignment. It successfully transformed a high-friction task into an independent, confidence-building activity.\n\nCrucially, because this tool is for young learners, it requires absolute data privacy. It relies exclusively on localized, small-parameter models to ensure a child's voice data never enters a corporate data lake.\n\n## 🛠️ The Tech Stack & Architecture\nThis application strictly adheres to the < 32B parameter constraint, utilizing highly optimized small models for a real-time, fluid user experience.\n\n### Development Documentation\nFor a deep dive into the architecture and development plan, please review our spec documents:\n* [Product Specification](docs/PRODUCT_SPEC.md)\n* [UI/UX & Frontend Specification](docs/UI_UX_SPEC.md)\n* [API & Backend Contract](docs/API_CONTRACT_SPEC.md)\n* [Deployment Strategy](docs/DEPLOYMENT_SPEC.md)\n* [Hackathon Roadmap](docs/ROADMAP.md)\n\n### Components\n* **Frontend:** A custom, gamified Gradio interface (\"Off-Brand\" UI) built for legibility and young readers, with a custom HTML reading canvas, child-sized controls, local CSS/JS reward effects, and hidden Gradio chrome.\n* **ASR (Speech-to-Text):** **Cohere Transcribe** (2B parameters) in Turbo Mode and `faster-whisper` `tiny.en` in Off the Grid Mode.\n* **Reading Evaluator:** A fine-tuned **MiniCPM phonetic evaluator** (`kingkw1/minicpm-phonetic-evaluator`) judges close or ambiguous ASR transcripts after exact normalized matching. The tracked model card source lives at [`docs/model_cards/minicpm-phonetic-evaluator.md`](docs/model_cards/minicpm-phonetic-evaluator.md).\n* **TTS / Audio Help:** **OpenBMB VoxCPM** (0.5B parameters) powers Modal Turbo Mode and was used to generate local curriculum audio assets. Off the Grid Mode defaults to committed sentence WAVs plus label-sliced word clips for responsive local assistance.\n* **Compute / Inference:** Utilizes a **Dual-Mode Hybrid Architecture**. The app includes **Turbo Mode** for Modal serverless endpoints and **Off the Grid Mode** for local Hugging Face Space resources.\n\n### Dual-Mode Inference Engine\nThe app deliberately ships with both inference paths:\n\n* **🏕️ Off the Grid Mode (Local):** Runs inside the Hugging Face Space without Modal. Local ASR uses `faster-whisper`, the phonetic evaluator loads the Q4 MiniCPM GGUF through `llama-cpp-python`, and audio assistance uses committed curriculum WAVs from `data/curriculum_audio/` with word clips sl"
  },
  {
    "repo_id": "build-small-hackathon/toy-room-v3",
    "title": "Toy Room V3",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/toy-room-v3",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "modal",
      "toy",
      "room",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Toy Room V3 - Sanjay Prasad H S\nsdk: docker\napp_port: 7860\npinned: false\nshort_description: Fire Boy MiniCPM-V VLA virtual pet by Sanjay Prasad H S.\nlicense: mit\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - minicpm\n  - minicpm-v\n  - minicpm-o\n  - vla\n  - vision-language-action\n  - virtual-pet\n  - tamagotchi\n  - toy-room\n  - threejs\n  - mujoco\n  - modal\n  - codex\n  - openai-codex\n  - agents\n  - gradio\n  - custom-ui\n  - research-demo\n  - serverless-inference\n  - \"track:wood\"\n  - \"sponsor:openbmb\"\n  - \"sponsor:openai\"\n  - \"sponsor:modal\"\n  - \"achievement:offgrid\"\n  - \"achievement:welltuned\"\n  - \"achievement:offbrand\"\n  - \"achievement:llama\"\n  - \"achievement:sharing\"\n  - \"achievement:fieldnotes\"\nmodels:\n  - openbmb/MiniCPM-o-4_5\n  - openbmb/MiniCPM-V-4_6\n---\n\n# Tiny Toybox: Fire Boy MiniCPM-V VLA\n\n<p align=\"center\">\n  <a href=\"https://build-small-hackathon-toy-room-v3.hf.space/toy-v3\"><img alt=\"Hugging Face Space\" src=\"https://img.shields.io/badge/Hugging%20Face-Space-ffcc4d?logo=huggingface&logoColor=black\"></a>\n  <a href=\"https://huggingface.co/build-small-hackathon/fireboy-minicpm-v-4-6-vla\"><img alt=\"MiniCPM-V 4.6 VLA model\" src=\"https://img.shields.io/badge/MiniCPM--V%204.6-VLA-4c8bf5\"></a>\n  <a href=\"https://modal.com\"><img alt=\"Modal runtime\" src=\"https://img.shields.io/badge/Modal-serverless%20inference-111111\"></a>\n  <a href=\"https://openai.com/codex\"><img alt=\"OpenAI Codex assisted\" src=\"https://img.shields.io/badge/OpenAI%20Codex-assisted-10a37f\"></a>\n  <a href=\"LICENSE\"><img alt=\"MIT license\" src=\"https://img.shields.io/badge/license-MIT-blue\"></a>\n</p>\n\n<p align=\"center\">\n  <b>A playable AI-age virtual toy:</b> Fire Boy sees a toy room, receives natural language, routes commands through a MiniCPM-V 4.6 VLA head, then acts inside a Three.js/physics world with inspectable traces.\n</p>\n\n<p align=\"center\">\n  <a href=\"https://youtu.be/rwRIXLwaLmI\">Watch demo</a>\n  &nbsp;|&nbsp;\n  <a href=\"https://build-small-hackathon-toy-room-v3.hf.space/toy-v3\">Play Toy Room v3</a>\n  &nbsp;|&nbsp;\n  <a href=\"https://build-small-hackathon-toy-room-v3.hf.space/vla-research\">Read research page</a>\n  &nbsp;|&nbsp;\n  <a href=\"https://build-small-hackathon-toy-room-v3.hf.space/frontend/research/minicpm-v46-fireboy-vla-research-paper.pdf\">Open PDF</a>\n  &nbsp;|&nbsp;\n  <a href=\"https://huggingface.co/datasets/build-small-hackathon/fireboy-vla-rollout-artifacts\">Dataset</a>\n</p>\n\n<p align=\"center\">\n  <a href=\"https://youtu.be/rwRIXLwaLmI\">\n    <img src=\"demo/fire-boy-v3-demo-thumbnail.png\" alt=\"Fire Boy Toy Room v3 demo thumbnail\" width=\"760\">\n  </a>\n</p>\n\n`minicpm-v` `vision-language-action` `virtual-pet` `threejs` `mujoco` `modal` `codex` `gradio` `hackathon` `serverless-inference` `research-demo` `track:wood` `sponsor:openbmb` `sponsor:openai` `sponsor:modal`\n\n## Creator And Submission Links\n\n| Item | Link |\n| --- | --- |\n| Creator | **Sanjay Prasad H S** (`sanjuhs`) |\n| GitHub repo | [sanjuhs/build-small-hackathon-v1](https://github.com/sanjuhs/build-small-hackathon-v1) |\n| Canonical HF Space repo | [build-small-hackathon/toy-room-v3](https://huggingface.co/spaces/build-small-hackathon/toy-room-v3) |\n| Canonical live Space | [https://build-small-hackathon-toy-room-v3.hf.space/toy-v3](https://build-small-hackathon-toy-room-v3.hf.space/toy-v3) |\n| Personal HF Space mirror | [sanjuhs/toy-room-v3](https://huggingface.co/spaces/sanjuhs/toy-room-v3) |\n| Personal live Space mirror | [https://sanjuhs-toy-room-v3.hf.space/toy-v3](https://sanjuhs-toy-room-v3.hf.space/toy-v3) |\n| YouTube demo | [https://youtu.be/rwRIXLwaLmI](https://youtu.be/rwRIXLwaLmI) |\n| HF model repo | [build-small-hackathon/fireboy-minicpm-v-4-6-vla](https://huggingface.co/build-small-hackathon/fireboy-minicpm-v-4-6-vla) |\n| HF dataset repo | [build-small-hackathon/fireboy-vla-rollout-artifacts](https://huggingface.co/datasets/build-small-hackathon/fireboy-vla-rollout-artifacts) |\n\n## What This Is\n\nTiny Toybox is a Build Small Hackathon project about making a small virtual creature feel less like a chatbot and more like a toy that lives in a room. The inspiration is a mix of Tamagotchi, Talking Tom, Pokemon-style companion play, and the dream of tiny embodied agents that can observe, decide, remember, and form habits over time.\n\nThe shipped demo focuses on **Fire Boy**, a small rigged character in **Toy Room v3**. You can drag him, pet/touch him, drop toys into the scene, ask him to pick up a ball, run around, or find and eat a berry. The point of the demo is visible causality: model call, action contract, physics result, retargeted animation, and training evidence are all inspectable.\n\nThis repo is also a research artifact for the question: **how do we turn a compact vision-language model such as MiniCPM-V 4.6 into a vision-language-action controller for a virtual pet?**\n\n## Demo Links\n\n| Artifact | Link |\n| --- | --- |\n| Playable HF Space | [Toy Room v3](https://build-small-hackathon-toy-room-v3.hf.space/toy-v3) |\n| Page directory |"
  },
  {
    "repo_id": "build-small-hackathon/vyber-cyber",
    "title": "Vyber Cyber",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/vyber-cyber",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "track",
      "sponsor",
      "vyber",
      "cyber",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Vyber\nemoji: 🛡️\ncolorFrom: gray\ncolorTo: gray\nsdk: gradio\nsdk_version: 4.36.1\npython_version: 3.11\napp_file: app.py\npinned: false\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n<p align=\"center\">\n  <img src=\"assets/vyber-hero.svg\" alt=\"Vyber Duel hero banner\" width=\"100%\">\n</p>\n\n<p align=\"center\">\n  <a href=\"https://huggingface.co/spaces/build-small-hackathon/vyber-cyber\"><strong>Launch Space</strong></a>\n  ·\n  <a href=\"https://huggingface.co/vxkyyy/vyber-security-7b-gguf\"><strong>Model</strong></a>\n  ·\n  <a href=\"https://github.com/Vickyrrrrrr/vyber-cyber\"><strong>Source</strong></a>\n  ·\n  <a href=\"https://x.com/UnfilteredLogix/status/2066581507897282659?s=20\"><strong>Social Post / Video Demo</strong></a>\n  ·\n  <a href=\"https://huggingface.co/build-small-hackathon\"><strong>Build Small Hackathon</strong></a>\n</p>\n\n<p align=\"center\">\n  <code>Gradio Space</code>\n  <code>Fine-tuned 7B</code>\n  <code>GGUF</code>\n  <code>llama.cpp</code>\n  <code>Modal GPU</code>\n  <code>Red-to-Blue Agent Loop</code>\n</p>\n\n# Vyber\n\n**Vyber** is a self-healing cyber-range for the [Hugging Face Build Small Hackathon](https://huggingface.co/build-small-hackathon). It runs a two-agent security loop where a Red Agent finds vulnerable server patterns, a Blue Agent patches the files inside an isolated sandbox, and Red replays the same exploit path to verify whether the fix actually worked.\n\nVyber is built around a fine-tuned 7B cybersecurity model published on Hugging Face:\n\n- **Model:** [`vxkyyy/vyber-security-7b-gguf`](https://huggingface.co/vxkyyy/vyber-security-7b-gguf)\n- **App:** [`build-small-hackathon/vyber-cyber`](https://huggingface.co/spaces/build-small-hackathon/vyber-cyber)\n- **Code:** [`Vickyrrrrrr/vyber-cyber`](https://github.com/Vickyrrrrrr/vyber-cyber)\n\nThe goal is simple: make defensive security feel like a live repair operation, not a static scanner report.\n\n## Why Vyber Exists\n\nSmall teams often know they should fix security issues, but the gap between \"finding a vulnerability\" and \"shipping a safe patch\" is painful:\n\n- scanners produce long reports without repair context\n- developers need exact file-level fixes, not generic advice\n- security teams need proof that the original exploit no longer works\n- students need safe, repeatable environments to learn real defensive patterns\n\nVyber turns that workflow into an agentic loop:\n\n1. **Red finds evidence** by reading vulnerable files in a sandbox.\n2. **Red writes an exploit report** with affected files, CWE class, evidence, attack path, and impact.\n3. **Blue patches the code/config** using a tool harness with read, edit, and shell access scoped to `/tmp/sandbox`.\n4. **Red re-attacks** using the original exploit recipe.\n5. **Vyber passes only when the exploit is blocked.**\n\nThat last step is the product: not \"the AI says it fixed it\", but \"the same attack no longer works.\"\n\n## What Works Today\n\nVyber currently has two execution modes.\n\n### Public Demo Replay\n\nThe demo replay is a scripted no-GPU trace for public visitors. It exists so the Hugging Face Space can stay usable even if many people open it at once.\n\n- no GPU credits used\n- instant startup\n- shows the intended Red -> Blue -> Red verification experience\n- safe for public traffic\n\n### Live GPU Duel\n\nThe live mode runs the actual Modal-backed cyber-range loop.\n\n- creates vulnerable lab files under `/tmp/sandbox`\n- loads the fine-tuned GGUF model through `llama.cpp`\n- asks the Red Agent to inspect files and produce an exploit report\n- gives the Blue Agent a Vyber tool harness for file reads, edits, and shell commands\n- re-runs Red verification checks after the patch attempt\n\nThe lab vulnerabilities are intentionally seeded by the cyber-range. The fixes are applied to real sandbox files during the live run.\n\n## Current Safety Boundary\n\nVyber does **not** scan random public servers. It is a defensive cyber-range for controlled targets.\n\nCurrent targets:\n\n- generated vulnerable config files\n- OWASP-inspired lab patterns\n- isolated Modal `/tmp/sandbox` workspace\n\nFuture targets:\n\n- user-owned GitHub repositories\n- user-owned Docker Compose apps\n- intentionally vulnerable OWASP lab containers\n- authorized SSH targets in review-first mode\n\nUse Vyber only on systems you own or have explicit permission to test.\n\n## Cyber-Range Lab Packs\n\nEach lab pack plants three vulnerable files and three independent exploit paths.\n\n| Lab | Theme | Vulnerabilities | Example files |\n| --- | --- | --- | --- |\n| Scenario 1 | Secret leak | hardcoded API keys, database passwords, world-readable deploy script | `app_config.json`, `server.env`, `deploy.sh` |\n| Scenario 2 | Exposed database | public bind address, auth disabled, weak TLS, open firewall | `db_settings.yaml`, `nginx.conf`, `firewall_rules.json` |\n| Scenario 3 | MITM pipeline | HTTP billing str"
  },
  {
    "repo_id": "build-small-hackathon/YourGymBuddy",
    "title": "Yourgymbuddy",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/YourGymBuddy",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "yourgymbuddy",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Your Gym Buddy\nemoji: 🏋️\ncolorFrom: purple\ncolorTo: blue\nsdk: gradio\napp_file: app/app.py\npinned: false\nhf_oauth: true\nshort_description: A local-first AI gym coach for the Build Small Hackathon\nsdk_version: 6.18.0\ntags:\n  - track:backyard\n  - sponsor:nvidia\n  - sponsor:openbmb\n  - sponsor:modal\n  - sponsor:cohere\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:welltuned\n  - achievement:tinytitan\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n\nmodels:\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16\n  - openbmb/MiniCPM5-1B\n  - google/gemma-4-12B-it\n  - PedroRuizCode/gym-buddy-minicpm5-1b\n  - PedroRuizCode/gym-buddy-minicpm5-1b-GGUF\n  - nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF\n  - unsloth/gemma-4-12b-it-GGUF\n---\n\n# Your Gym Buddy\n\nA local-first AI gym coach for the [Build Small Hackathon](https://huggingface.co/build-small-hackathon).\n\nImport your workout history from apps like **Hevy** (CSV export), get instant training\nanalytics, and chat with a coach powered by small, **on-device** language models. Prefer\ntalking over typing? Record your question and it's transcribed locally.\n\nEverything that runs at app-time runs **locally** (no cloud APIs) - the chat models go\nthrough `llama.cpp` and speech-to-text runs on-device.\n\n## 🌟 Hackathon Tracks & Highlights\n\nThis project was built specifically for the **Build Small Hackathon**, aligning deeply with the following themes and tracks:\n\n- 🏡 **Track: Backyard AI**: Built to solve a real, specific problem for gym-goers! It takes raw, hard-to-read workout logs (like Hevy CSV exports) and turns them into immediate, actionable coaching and visual analytics.\n\n- 🏕️ **Off the Grid**: **100% local, no cloud APIs!** The chat models and speech-to-text models run securely on-device without needing a constant internet connection to third parties.\n- 🦙 **Llama Champion**: Our core AI coaching backend is powered efficiently by the incredible `llama.cpp` (via `llama-cpp-python`).\n- 🎨 **Off-Brand**: We shattered the default Gradio UI mold! We built a fully custom, responsive, dark/light HTML/CSS/JS frontend SPA using `gradio.Server` under the hood.\n- 🐜 **Tiny Titan**: We feature incredibly capable, ultra-tiny models! Users can select between `NVIDIA-Nemotron-3-Nano` (4B parameters) or our customized `MiniCPM5` (1B parameters).\n- 🎛️ **Well-Tuned**: We took the tiny `MiniCPM5` model and **heavily fine-tuned it** to act specifically as an expert gym coach, resulting in the aligned `YourGymBuddy-MiniCPM5` model! The model was trained on a custom synthetic dataset generated using the scripts in our `synthetic_data/` folder—which anyone can use and modify to generate their own custom dataset! You can find our final generated dataset here: [PedroRuizCode/your-gym-buddy-coach-sft](https://huggingface.co/datasets/PedroRuizCode/your-gym-buddy-coach-sft).\n- ☁️ **Best Use of Modal**: Our entire `MiniCPM5` fine-tuning pipeline is built using a custom Modal app (`finetune/`). We leverage Modal's incredible serverless GPU infrastructure to quickly spin up heavy training workloads, fine-tune the model with our gym coaching dataset, and export the customized GGUFs.\n- 📝 **Field Notes**: We've documented our development journey, technical challenges, and insights. [Read our Field Notes Report here](#) *(Link coming soon)*.\n\nSadly, i joined too late to get the Codex credits, so i wasn't able to use Codex in the final build.\n\n---\n\n### 📢 Media & Links\n- 🎥 **Demo Video:** [Watch the demo on YouTube](https://www.youtube.com/watch?v=1eAD1MZAReQ). The video have subtitles in Spanish. I really appreciate you watch it and if needed, use the YouTube's subtitle translation tool to translate to English.\n- 💬 **Social Posts:** \n  - [X Post](https://x.com/iampedroruiz/status/2066665629059534940)\n  - [Tiktok Post](https://www.tiktok.com/@iampedroruiz/video/7651768192842419474)\n### 📰 Field Notes\n- [Field Notes](field_notes.md)\n- [Hackathon Blog Post](https://huggingface.co/blog/build-small-hackathon/yourgymbuddy)\n\n## Highlights\n\n- Import generic gym-app CSV exports (Hevy-style, but format-flexible).\n- Training analytics: volume trends, estimated 1RM + PRs, muscle-group balance, plateaus.\n- Switchable chat models, all under the 32B cap:\n  - `nvidia/NVIDIA-Nemotron-3-Nano-4B-GGUF` (4B, balanced)\n  - `unsloth/gemma-4-12b-it-GGUF` (12B, strongest)\n  - `PedroRuizCode/gym-buddy-minicpm5-1b-GGUF` (1B, tiny + fast)\n- Optional voice input via `CohereLabs/cohere-transcribe-03-2026`.\n- Fully custom dark/light UI (dark by default) built on `gradio.Server`.\n\n## Project layout\n\n```\napp/                 Main Gradio Server app (Hugging Face Space root)\n  app.py             gradio.Server: API endpoints + custom UI route\n  utils/             Parsing, analytics, model loading, chat, transcription\n  frontend/          Custom HTML/CSS/JS UI (dark default + light toggle)\n  models/            Downloaded GGUFs (gitignored) + download script\n  data/              Sample CSV + user uploads\nfinetune/   "
  },
  {
    "repo_id": "build-small-hackathon/dota2tuned",
    "title": "Dota2Tuned",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/dota2tuned",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "achievement",
      "track",
      "dota2tuned",
      "summary",
      "available"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: DOTA2Tuned\nsdk: gradio\napp_file: app.py\npython_version: \"3.12\"\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:sharing\nmodels:\n- build-small-hackathon/dota2tuned-qwen3-4b-2507-lora\n- build-small-hackathon/dota2tuned-minicpm4-1-8b-lora\n- build-small-hackathon/dota2tuned-qwen3-30b-a3b-2507-lora\ndatasets:\n- build-small-hackathon/dota2tuned-data\n---\n\n# DOTA2Tuned\n\nDOTA2Tuned is a Hugging Face Build Small Hackathon project for Dota 2 drafting, meta analysis, build suggestions, and match prediction.\n\nThe implementation is designed around a simple rule: stats and predictors choose the recommendations, while a small fine-tuned model explains them with patch-aware evidence.\n\n- Hugging Face Space: https://build-small-hackathon-dota2tuned.hf.space\n- Modal alternate UI: https://dracufeuer--dota2tuned-ui.modal.run\n- Tiny fine-tuned adapter: https://huggingface.co/build-small-hackathon/dota2tuned-qwen3-4b-2507-lora\n- Balanced fine-tuned adapter: https://huggingface.co/build-small-hackathon/dota2tuned-minicpm4-1-8b-lora\n- Quality fine-tuned adapter: https://huggingface.co/build-small-hackathon/dota2tuned-qwen3-30b-a3b-2507-lora\n- Dataset artifacts: https://huggingface.co/datasets/build-small-hackathon/dota2tuned-data\n- Demo video: TODO add final demo video URL before validation.\n- Social post: TODO add final social post URL before validation.\n\n## Hackathon Validation\n\nThe Build Small validator checks this Space README. Entry requirements are: sub-32B models, Gradio Space in the Build Small org, demo video, social-media post linked from this README, and GPU-limit compliance.\n\nValidator: https://build-small-hackathon-field-guide.hf.space/submit\n\nSelected tracks, prizes, and badges:\n\n- Tracks: Backyard AI, Thousand Token Wood.\n- Sponsor prizes: OpenBMB Best MiniCPM Build, OpenAI Best Use of Codex, Modal Best Use of Modal.\n- Bonus badges: Well-Tuned, Off-Brand, Sharing is Caring.\n- Not claimed: Nemotron Hardware Prize, Off the Grid, Llama Champion.\n- GPU note: this Space does not use Zero GPU allocation; training and tuned-model serving use Modal.\n\nSocial post draft:\n\n> Built DOTA2Tuned for the Hugging Face Build Small Hackathon: a Gradio Dota 2 draft coach that combines STRATZ/OpenDota match evidence, deterministic draft stats, and fine-tuned sub-32B adapters for grounded explanations.\n>\n> It suggests heroes, counters, synergies, builds, match predictions, and caveats weak data instead of inventing unsupported meta claims.\n>\n> Space: https://build-small-hackathon-dota2tuned.hf.space\n> Tiny: https://huggingface.co/build-small-hackathon/dota2tuned-qwen3-4b-2507-lora\n> Balanced: https://huggingface.co/build-small-hackathon/dota2tuned-minicpm4-1-8b-lora\n> Quality: https://huggingface.co/build-small-hackathon/dota2tuned-qwen3-30b-a3b-2507-lora\n> Repo: https://github.com/1ncompleteness/DOTA2Tuned\n\n## Quick Start\n\n```bash\nuv sync\ncp .env.example .env\nuv run dota2tuned health\nuv run dota2tuned smoke --live\nuv run dota2tuned ingest --pro-matches 100 --public-matches 100 --enrich-limit 20\nuv run dota2tuned normalize\nuv run dota2tuned features\nuv run dota2tuned build-rag\nuv run dota2tuned serve\n```\n\nUpdate `.env` with your Hugging Face, STRATZ, OpenDota, and Steam tokens before running large ingestion or Hub operations.\nFine-tuning now runs on Modal by default for this project. Set `MODAL_ENABLED=1`, `MODAL_TOKEN_ID`, and `MODAL_TOKEN_SECRET`, then use `uv sync --extra modal` before deploying Modal functions. `HF_TOKEN` must still include `repo.write` so the training run can push adapters to the configured Hub model repos. Use `MODEL_PROFILE` or `--profile` to select Tiny, Balanced, or Quality.\n\nThe submitted Space includes compact serving artifacts under `data/parquet`, `data/rag`, and `data/models`. Raw API responses remain local-only and ignored by git.\n\n## Main Commands\n\n- `dota2tuned ingest` fetches raw reference data, pro matches, public matches, patch notes, and optional match enrichment.\n- `dota2tuned smoke --live` validates configured tokens with tiny live API checks.\n- `dota2tuned normalize` converts raw JSONL into Parquet tables and refreshes DuckDB views.\n- `dota2tuned features` refreshes DuckDB views and reports feature table row counts.\n- `dota2tuned train-predictor` trains the draft win predictor from normalized matches.\n- `dota2tuned build-rag` creates patch/stat cards and a local retrieval index.\n- `dota2tuned make-sft` creates JSONL examples for SFT.\n- `dota2tuned modal-deploy` deploys the Modal Gradio app and GPU training function.\n- `dota2tuned modal-smoke` validates the deployed Modal app can load artifacts.\n- `dota2tuned modal-train --profile qwen3_4b_2507` submits the Tiny Modal GPU QLoRA run. Other profiles include `minicpm4_1_8b` and `qwen3_30b_a3b_2507`; Quality routes to the H200-backed `train_sft_quality` function.\n- `uv run python scripts/watch_modal_training.py --call tiny=fc-... -"
  },
  {
    "repo_id": "build-small-hackathon/agentic-humanitarian-data-analyst",
    "title": "Agentic Humanitarian Data Analyst",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/agentic-humanitarian-data-analyst",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "agentic",
      "humanitarian",
      "data",
      "analyst",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Agentic Humanitarian Data Analyst\nemoji: 🏥\ncolorFrom: blue\ncolorTo: green\nsdk: gradio\nsdk_version: \"6.18.0\"\napp_file: app.py\npinned: false\ntags:\n  - track:backyard\nlicense: mit\n---\n\n# Agentic Humanitarian Data Analyst\n\n**Applying a semantic layer and spec-driven development to agentic humanitarian data analysis.**\n\nIt turns an analyst's question and a raw survey into a **reviewable data-analysis plan** — which sectors, which indicators, and exactly what this dataset can and can't measure — *before* any agent runs a number. The plan is the spec. A human approves it; the analysis runs against it.\n\n> [Build Small hackathon](https://huggingface.co/build-small-hackathon) · Track: **Backyard AI** · every model **< 32B params**.\n\n---\n\n## The idea\n\nTwo patterns from software, applied to AI analysis:\n\n- **A semantic layer** — a governed catalog of indicator definitions (what an indicator *means*, how it's computed, how it fails) that sits between a question and the data, so nobody recomputes \"food consumption\" from memory.\n- **Spec-driven development** — write the plan first, have a human review it, *then* execute. Here the plan is a coverage spec: per indicator, **Measurable / Proxy / Not measurable** from this exact survey, each verdict traced to a published standard, not to the model's memory.\n\nThe pipeline is the **skill** ([`humanitarian-data-analyst`](https://github.com/yannsay/humanitarian-data-analyst)) — a reusable, domain-general pattern. This **app** is the friendly front end to it.\n\n## Why it needs an LLM at all\n\nThe steps are **deterministic** — selecting indicators from the catalog, joining them to the survey, rendering the plan are all code, repeatable byte-for-byte. The **inputs are not**. Two translations are irreducibly fuzzy, and that's the only place the model runs:\n\n1. a free-text **question → which sector / analytical framework** (the route), and\n2. per indicator, a **catalog definition → which Kobo question(s) actually measure it** (the map).\n\nThat's the bet: keep the LLM to the messy human-to-machine translation, and make everything downstream deterministic code over a governed catalog.\n\n---\n\n## The humanitarian problem this solves\n\nHumanitarian analysis is a specialist domain. Ask for \"food security\" and you've invoked a specific, named set of indicators — the Food Consumption Score (FCS), the reduced Coping Strategies Index (rCSI), the Household Hunger Scale (HHS) — each with an exact definition, a required set of survey questions, and a documented list of ways people get them wrong. The expertise isn't vague; it's a catalog. The job of an analyst, working a survey collected in Kobo, is to apply that catalog correctly to *this* dataset — and every survey is built differently, so the same indicator maps to different questions every time. There's no fixed lookup; the mapping has to be redone for each new form.\n\nThat's where it breaks. A standard indicator gets computed from questions that don't actually support it — producing a result that looks plausible but isn't an indicator the sector recognises, or doesn't exist at all. Not hypothetical: the test case is a real rapid needs assessment that shipped **four documented indicator errors** — an rCSI computed from the wrong columns, a misread JMP water ladder (the WHO/UNICEF Joint Monitoring Programme's drinking-water service classification), a misapplied Sphere threshold (the sector's minimum humanitarian standards), and an FCS reported even though the survey contained no dietary-recall question to build it from.\n\nA capable general LLM can recognise every one of these — the methodology is well documented and in its training. What it lacks is the **attention** to apply each definition, every time, to the right columns, under the pressure of producing an answer. Specialist precision is exactly what a generalist skips.\n\nAnd this is where a small model can *beat* a big one. The failure is attention, not knowledge — so the fix isn't more capacity, it's structure: hand the model one indicator's definition and known errors, point it at the candidate survey questions, and ask for a single verdict. That narrow, supplied task is precisely what a small model does reliably and a big general model fumbles by trying to hold everything at once. Where the generalist's breadth becomes a liability, the small model's focus becomes the feature.\n\n---\n\n## How it works\n\nThe model is fenced to the two translation points above; **code does everything else**.\n\n```\nAnalyst question + Kobo XLSForm\n   │\n ROUTE   — question → sector / analytical framework\n   │       ├─ type a question  → LLM translates it  ◀ fuzzy input, needs the model\n   │       └─ pick chips        → no LLM, instant\n   │\n SELECT  — sector → indicators from the catalog        → deterministic script\n   │\n MAP     — each indicator → the survey's questions      ← the semantic layer\n   │       per-indicator loop: LLM proposes candidate variables  ◀ fuzzy input, needs the model\n   │       → verdict:"
  },
  {
    "repo_id": "build-small-hackathon/closet-twin",
    "title": "Closet Twin",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/closet-twin",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "closet",
      "twin",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Closet Twin\nemoji: 🐨\ncolorFrom: purple\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Turn your wardrobe into a personal AI stylist.\n\ntracks:\n  - Backyard AI\n\nbadges:\n\n   - Best MiniCPM Build\n\n   - Black Forest Labs\n\n\n   - Off Brand\n\n   - Tiny Titan\n\n   - Best Demo\n\n   - Bonus Quest Champion\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Closet Twin\n\n**Your Personal AI Stylist, Built From Your Real Wardrobe**\n\nCloset Twin is an wardrobe assistant that helps users decide what to wear using clothing they already own. Instead of recommending new purchases, Closet Twin creates personalized outfit suggestions from garments stored in the user's wardrobe, making fashion decisions easier, faster, and more sustainable.\n\n---\n\n## 1. Inspiration\nSomehow, getting dressed has become harder than ever.\nThree of us. Three sisters. Three wardrobes full of clothes. And every single morning, the same complaint.\nEvery morning, without fail, all three of them would stand in front of a wardrobe full of clothes and announce that \"they had nothing to wear\". Not occasionally. Every single morning. One of us would hear it from across the hall. Another would get a panicked voice note at 7am. The third had a sister who would show up to breakfast already stressed about it. Different houses, same crisis.\n\nWe found it funny at first. Then we started actually thinking about it.\n\nThe problem wasn't the clothes. Between the three of them, there were more outfits than most people own in a lifetime. The real issue was that a full wardrobe, without structure, becomes invisible. Things get buried, Good combinations get forgotten, And when you're running late on a Tuesday morning, choosing an outfit from a hundred options is somehow harder than choosing from ten.\n\nSo we built Closet Twin : A personal AI stylist. Photos go in, the system analyzes each garment, builds a living digital wardrobe, and starts generating outfits from what's already there. \n\nThree teammates. Three sisters. One shared problem and one tool built to solve it.\n\n---\n\n## 2. What Closet Twin Does\n\nCloset Twin acts as a personal AI stylist that:\n\n1. Analyzes uploaded clothing items.\n2. Extracts garment attributes such as:\n   * Category\n   * Type\n   * Color\n   * Style\n   * Fit\n   * Pattern\n   * Season\n3. Builds a digital wardrobe.\n4. Generates complete outfits using only owned clothing.\n5. Learns from user feedback.\n6. Recreates inspiration looks (Pinterest, Instagram, ...) using available wardrobe pieces.\n7. Identifies underused garments.\n8. Generates outfit cards and styling notes.\n\n## 3. Tech Stack\n\n- **Frontend**: Gradio 6 (responsive, interactive UI)\n- **Backend**: FastAPI + Uvicorn (high-performance server)\n- **Vision Model**: MiniCPM-V-4.6 (garment analysis & inspiration image understanding)\n- **Image Generation**: FLUX.2-klein-base-4B (clean product photography from uploaded garments)\n- **LLM Fallback**: Qwen/Qwen2.5-7B-Instruct (outfit generation when local models unavailable)\n- **Storage**: JSON files + Git LFS (for efficient image management)\n- **Deployment**: HuggingFace Spaces\n- **Image Processing**: PIL/Pillow\n\n## 4. Features\n\n### 🏠 Home Page\nLanding page with CTA buttons to quickly access your closet or get styled.\n\n### 👗 My Closet\n- Upload and organize clothing items\n- Automatic attribute extraction using MiniCPM-V-4.6 vision model\n- Smart categorization (dress, top, bottom, shoes, outerwear, accessory)\n- Each item tracks: type, color, style, fit, pattern, season, formality level\n- Category filtering and item preview\n- Select anchor items to influence outfit generation\n\n### ✨ AI Stylist\n- Generate complete outfits based on:\n  - **Occasion** (casual, work, date night, gym, party, etc.)\n  - **Weather** (sunny, rainy, snowy)\n  - **Temperature** (range selection)\n  - **Mood** (playful, professional, bold, minimal, etc.)\n  - **Style preference** (bohemian, classic, trendy, comfort-first, etc.)\n  - **Anchor item** (optional - style around a specific garment)\n- Interactive carousel to browse outfit combinations\n- Per-piece adjustments: swap individual items or remove them\n- Real-time outfit regeneration without full recalculation\n\n### 📸 Look Recreation\n- Upload inspiration images (Pinterest, Instagram screenshots, fashion photos)\n- AI analyzes the aesthetic, colors, vibe, and style\n- Automatically recreates looks using pieces from your wardrobe\n- Identifies gaps: what you'd need to buy to complete the look\n- Piece-by-piece matching with confidence scores\n\n### 📊 Analytics\n- Personal style insights:\n  - Most-used colors and color combinations\n  - Preferred clothing categories and fits\n  - Common occasions and seasons in your history\n  - Underutilized garments\n  - Style trends over time\n- Visual dashboard with charts\n\n### ❤️ Favorites\n- Save outfit combinations for quic"
  },
  {
    "repo_id": "build-small-hackathon/espelho-da-alma",
    "title": "Espelho Da Alma",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/espelho-da-alma",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "espelho",
      "alma",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Espelho da Alma\nemoji: 🔮\ncolorFrom: purple\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.17.3\npython_version: \"3.12\"\napp_file: app.py\npinned: true\nshort_description: Mystical mirror for the soul — oracle, journal, inner map\ntags:\n- track:wood\n- achievement:offgrid\n- achievement:offbrand\nmodels:\n- Qwen2.5-3B-Instruct\n---\n\n\n> **Track:** Thousand Token Wood &nbsp;·&nbsp; **Badges:** Off Brand (custom UI), Tiny Titan (3B model)\n>\n> 🎬 **Demo video:** https://youtu.be/QhKR7UEdr18?si=TvLM4wfVRkYFVDjK &nbsp;·&nbsp; 📱 **Social post:** https://www.linkedin.com/feed/update/urn:li:ugcPost:7472047864538411008/\n> \n> **Team:** [@Tissiany](https://huggingface.co/Tissiany) (solo build)\n\n# 🔮 Espelho da Alma · Mirror of the Soul\n\n![Espelho da Alma Banner](https://cdn-uploads.huggingface.co/production/uploads/69bff6fc6ccc4385b0ad8295/d7uQymBQKYrmFMS_PexQ9.png)\n\nA **mystical portal for self-reflection** divided into four distinct spaces. The interface is in Portuguese (where the magic truly lands for its audience), while this documentation is written in English for the judges.\n\n> *\"The mirror does not lie, but we rarely show ourselves enough for it to reveal what lies beneath the surface.\"*\n\n---\n\n## 🌌 The Four Spaces\n\n- **🪞 Espelho (Mirror)** — The home screen: features a glowing, slowly rotating orb that welcomes you into the experience.\n- **🃏 Oráculo (Oracle)** — Draw one of six handcrafted archetype cards (such as *The Inner Moon*, *The Broken Mirror*, or *Star of the Abyss*). Each card flips instantly to reveal a mystical symbol, element, and message, just like tarot.\n- **📓 Diário (Journal)** — **This is where the AI lives.** You can write freely in response to a \"shadow\" prompt, and the *Voice of the Soul* answers with a short, intimate reflection, streamed live token by token.\n- **🌙 Essência (Essence)** — A personalized \"energy map\" featuring five traits whose values shift dynamically each day, along with the **real current moon phase** (computed purely through astronomical formulas in Python).\n\n---\n\n## ✨ What Makes It Special\n\n- **Fully Self-Hosted Inference:** The model runs entirely inside this Space using **Hugging Face Transformers** on a free **ZeroGPU**. No OpenAI, external LLMs, or cloud APIs are called; the entire experience is completely self-contained.\n- **Bespoke UI/UX:** Built from scratch from a custom Figma design using Gradio and advanced CSS/HTML. It features an animated starfield, a spinning 3D-like CSS orb, smooth card-flipping mechanics, a gold-on-black mystical theme, and staggered section transitions—fully responsive across mobile and desktop.\n\n---\n\n## 🧠 The Craft Behind the Diário\n\nThe magic isn't the model's size—it's the prompt engineering. Three strict rules turn a tiny 3B model into a mirror that feels like it truly *sees* you:\n\n1. **Literal Echo:** The AI reuses your exact words, transformed into new contexts, never swapping them out for generic synonyms.\n2. **Concrete Imagery:** Vague psychological words are strictly banned (*\"deep\"*, *\"sensitive\"*, *\"intense\"*). The model can only communicate through physical scenes, objects, and raw textures.\n3. **Declare, Don't Analyze:** Written completely in the second person, present tense, with absolutely no hedging or safe clinical transitions.\n\n---\n\n*Built with care for the Hugging Face **Build Small** hackathon.* 🌙\n"
  },
  {
    "repo_id": "build-small-hackathon/tokenwood-the-forest-that-forgets",
    "title": "Tokenwood The Forest That Forgets",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/tokenwood-the-forest-that-forgets",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "tokenwood",
      "forest",
      "forgets",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Tokenwood\nemoji: 🌲\ncolorFrom: green\ncolorTo: gray\nsdk: gradio\nsdk_version: 5.50.0\napp_file: app.py\npinned: false\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n---\n\n# Tokenwood: The Forest That Forgets\n\nA small-model forest adventure built for the Build Small Hackathon. The player\nenters Tokenwood with a visible 1,000-token memory budget. Every action,\nclue, and preserved rule spends tokens. Preserve the right rules and the\nforest forgets the exit open.\n\n- **Live Space:** https://huggingface.co/spaces/build-small-hackathon/tokenwood-the-forest-that-forgets\n- **Demo video:** https://www.loom.com/share/3ec5e66b2c19467d9b3ce060085c4fe1\n- **Social post:** https://x.com/RajKuma39078041/status/2066598429254631905\n\nThis Space runs the fine-tuned `MiniCPM3-4B` narrator as a quantized GGUF\nmodel loaded directly from the Hugging Face Hub.\n\n## Models & datasets used\n\n- Fine-tuned adapter: [`raj23211/tokenwood-minicpm3-4b-narrator-v5`](https://huggingface.co/raj23211/tokenwood-minicpm3-4b-narrator-v5)\n- Merged fine-tuned model: [`raj23211/tokenwood-minicpm3-4b-narrator-v5-merged`](https://huggingface.co/raj23211/tokenwood-minicpm3-4b-narrator-v5-merged)\n- Quantized runtime model: [`raj23211/tokenwood-minicpm3-4b-narrator-v5-GGUF`](https://huggingface.co/raj23211/tokenwood-minicpm3-4b-narrator-v5-GGUF)\n- Dataset: [`build-small-hackathon/thousand-token-wood-traces`](https://huggingface.co/datasets/build-small-hackathon/thousand-token-wood-traces)\n- Dataset: [`build-small-hackathon/1000-Rooms-DS`](https://huggingface.co/datasets/build-small-hackathon/1000-Rooms-DS)\n\n## Fine-tune proof\n\nThe `achievement:welltuned` tag is backed by the published Tokenwood narrator\nfine-tune above. The Space runs the GGUF export of that fine-tuned model, while\nthe adapter and merged checkpoint remain linked for review.\n\n## Modal proof\n\nModal was used for the training/export workflow behind the published narrator\nartifacts. The project includes Modal scripts for SFT, merged-model export,\nGGUF export, and model evaluation, and the `sponsor:modal` tag is included\nbecause Modal powered development of the fine-tuned runtime model.\n\n- Modal SFT run: `training/modal_train_tokenwood.py`\n- Modal export run: `training/modal_export_gguf.py`\n- Modal evaluation runs: `training/modal_eval_hf.py`, `training/modal_eval_gguf.py`\n- Modal app run URL: https://modal.com/apps/raj315920/main/ap-qFBdc8gegbCtUkt93RQWZS\n\n## Runtime configuration\n\nThe Space reads these environment variables:\n\n- `TOKENWOOD_HF_GGUF_REPO` — GGUF repo on the Hub\n- `TOKENWOOD_HF_GGUF_FILE` — GGUF filename inside that repo\n- `TOKENWOOD_HF_TOKENIZER_REPO` — HF repo that owns the chat template\n- `TOKENWOOD_CTX` — llama.cpp context size (default 2048)\n- `TOKENWOOD_GPU_LAYERS` — number of layers to offload to GPU (default 0)\n\nFor best performance, upgrade the Space to a GPU hardware tier and set\n`TOKENWOOD_GPU_LAYERS=-1`.\n\n## Submission checklist\n\n- [x] Space shipped under the `build-small-hackathon` org\n- [x] Demo video recorded and linked above\n- [x] Social post published and linked above\n- [x] README tagged with chosen tracks/badges via the [Build Small submission tool](https://build-small-hackathon-field-guide.hf.space/submit)\n\n## OpenAI Codex proof\n\nThis Space includes a Codex-attributed git commit for the OpenAI sponsor track.\nThe supporting local repository also includes Codex-authored commits:\n\n- `97648471c0b07a5f51ba0a5e6f513048de30c047`\n- `e55e74c036dae981141d679e6fad24e1d21f8828`\n"
  },
  {
    "repo_id": "build-small-hackathon/neural-hyrule",
    "title": "Neural Hyrule",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/neural-hyrule",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "track",
      "neural",
      "hyrule",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Neural Hyrule\nemoji: 🗡️\ncolorFrom: green\ncolorTo: green\nsdk: docker\npython_version: '3.11'\npinned: false\nlicense: mit\nshort_description: A playable ONNX world model trained on Ocarina of Time\ntags:\n  - backyard-ai\n  - thousand-token-wood\n  - best-use-of-codex\n  - off-brand\n  - tiny-titan\n  - best-demo\n  - bonus-quest-champion\n  - judges-wildcard\n  - track:backyard\n  - track:wood\n  - sponsor:openai\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Neural Hyrule\n\n**Walk through the model's memory of Kokiri Forest.**\n\nBy [Jordy Kieto](https://www.linkedin.com/in/jordy-kieto/)\n\n[Watch the demo on YouTube](https://youtu.be/FMxfCFFjPOQ) · [View the project post on X](https://x.com/MoorishBard/status/2066324705813168196?s=20) · [View the project post on LinkedIn](https://www.linkedin.com/posts/jordy-kieto_ai-machinelearning-gameai-share-7472097073576960001-fdsD/?utm_source=share&utm_medium=member_desktop&rcm=ACoAACJOCIQBXbqS4o3FfEmes5tTzw8z9C_Kxcw)\n\n[Read the condensed Codex agent trace](AGENT_TRACE.md)\n\nNeural Hyrule is a playable latent world model trained on footage and controls from *The Legend of Zelda: Ocarina of Time*. Instead of streaming a game or running an emulator, it predicts the next moment of the world from its recent latent history and the player's input. The result is a compressed, dreamlike walk through Kokiri Forest that runs locally in the browser.\n\nThe larger idea is to explore a new way to port game experiences: package learned behavior and visuals into small, accessible models that can be opened almost anywhere. The same approach could support rapid game prototyping, learned test environments, and interactive previews without shipping a complete engine or original game assets.\n\n## Tracks\n\n### Backyard AI\n\nNeural Hyrule is a local, practical demonstration of portable game simulation. All inference runs on hardware the player owns, directly in the browser, making the experience easy to share, study, and build on without a hosted GPU or remote inference service.\n\n### Thousand Token Wood\n\nA familiar forest becomes an AI-native place to wander. The model does not replay recorded frames: it continuously imagines what should happen next as the player steers, producing a strange and playful version of Kokiri Forest from compressed learned memory.\n\n## Sponsor Prize\n\n### Best Use of Codex\n\nCodex was used to implement and iterate on the model architecture and training flow, then help carry the trained system into a browser-ready ONNX application. It supported the path from architecture code and training mechanics through model export, runtime integration, and the custom interactive demo.\n\n## Bonus Badges\n\n### Off Brand · $1,500\n\nThe experience goes beyond the default Gradio interface with a custom game-like UI, responsive canvas renderer, live performance HUD, keyboard controls, touch joystick, sample selection, and gamepad support. Gradio and FastAPI serve the app while the player-facing experience is purpose-built HTML, CSS, and JavaScript.\n\n### Tiny Titan · $1,500\n\nThe complete exported model set is 64.09 MiB with 16,761,117 parameters, comfortably below the 4B-parameter limit. It turns small weights into a continuously playable visual world rather than a single prediction or static generation.\n\n### Best Demo · $1,000\n\nThe project is designed to communicate itself immediately: open the page, enter Kokiri Forest, and steer. The interactive app, visual identity, and simple premise make the underlying world-model research tangible without requiring a technical setup walkthrough.\n\n### Bonus Quest Champion · $2,000\n\nNeural Hyrule combines local inference, a genuinely small model, a fully custom interface, an interactive demo, browser deployment, and a playful use of learned simulation in one submission.\n\n### Judges' Wildcard · $1,000\n\nNeural Hyrule sits between game preservation, compression, generative simulation, and development tooling. It is an unusual proof of concept for carrying the feel of an interactive world in model weights.\n\n## How It Works\n\n| Model | Parameters | ONNX size |\n| --- | ---: | ---: |\n| VAE encoder | 507,272 | 1.94 MiB |\n| World transition | 12,985,653 | 49.66 MiB |\n| VAE decoder | 3,268,192 | 12.49 MiB |\n| **Total** | **16,761,117** | **64.09 MiB** |\n\n1. A VAE encoder converts a selected starting frame into a compact spatial latent.\n2. The world-transition model reads up to 16 recent latent states and 26-dimensional controller inputs to predict the next latent state.\n3. A VAE decoder reconstructs that state as the next 580 × 429 frame.\n4. ONNX Runtime Web repeats the loop toward 30 FPS using WASM, entirely in the player's browser.\n\nThe Python layer is intentionally thin: FastAPI and Gradio serve the page, model files, sample frames, and required browser runtime headers. No gameplay frames or controls are sent to a backend during inference.\n\n## Run Locally\n\n```sh\npy"
  },
  {
    "repo_id": "build-small-hackathon/fugee",
    "title": "Fugee",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/fugee",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "sponsor",
      "achievement",
      "fugee",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Fugee\nemoji: 🏠\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.15.2\napp_file: app/app.py\npinned: false\nlicense: mit\nshort_description: Agentic AI guidance for displaced people, on a small LLM\ntags:\n  - track:backyard\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n---\n\n<!-- The block above is Hugging Face Space metadata (required for the Space to\n     build). The hackathon submission tool appends track/badge tags to it. -->\n\n<div align=\"center\">\n\n# 🏠 Fugee\n\n**Safe guidance for people on the move.**\n\nAn agentic AI assistant for displaced people, asylum seekers, and refugees —\npowered by a small LLM, the **LFM2.5 8B-parameter** model.\n\n🎥 **[Watch the demo](https://www.youtube.com/watch?v=PYGzJZj7LfM)** · 📣 **[Launch post](https://x.com/heldernoid/status/2066252940940067178)** · 💻 **[GitHub](https://github.com/heldernoid/fugee)**\n\n👤 **Team (solo):** [@helmo](https://huggingface.co/helmo)\n\n</div>\n\n---\n\n## What it is\n\nFugee conducts a calm, structured, multilingual interview, reasons about the\nperson's situation against international refugee law (the 1951 Refugee\nConvention and the 1969 AU Convention), recommends realistic destination\ncountries, and generates a personalised documentation package they can download\nand edit.\n\nIt is a **single-process Gradio web app** backed by a **pure-Python agent loop**\n(`agent/loop.py`, ported from pi-agent-core's patterns) and the **`lfm2.5:8b`**\nmodel served by Ollama. No Node.js, no microservices, no external database.\n\n> **This Space** runs the Gradio UI on free CPU and calls the LLM (`lfm2.5:8b`)\n> and embeddings (`nomic-embed-text`) on a GPU **Ollama** endpoint hosted on\n> [Modal](https://modal.com) — so the same code and the same small model run\n> unchanged, just on rented GPU. See [`deploy/DEPLOY.md`](deploy/DEPLOY.md).\n\nThe design point: *a genuinely useful agentic product running on a small model.*\nThe interview is fully **deterministic** (fixed questions and controls,\nhand-translated into 10 languages) and the LLM is used only where it adds real\nintelligence — the legal **assessment**, the document **drafting**, and the\nspoken-back **review summary**.\n\n### The five phases\n\n1. **Intake** — language selection + a calm welcome.\n2. **Interview** — a fixed, deterministic question flow (current/origin country,\n   what happened, persecution grounds, danger, documents, languages, goals).\n3. **Assessment** — the agent reasons openly about the case: classifies it\n   (refugee / broader protection / statelessness / economic), names the\n   Convention ground, gauges risk, and ranks destinations. Grounded in curated\n   country data and the UNHCR Handbook & Guidelines (RAG) — **not** the open web.\n4. **Recommendations** — 2–3 country cards with real UNHCR/processing data and a\n   step-by-step roadmap. Economic (non-protection) cases get honest **work-route**\n   guidance instead of a doomed asylum claim.\n5. **Documents** — an LLM-drafted, editable **Word (.docx) + PDF** package,\n   branded and laid out with bundled fonts (fully offline).\n\n---\n\n## Requirements\n\n- **Python ≥ 3.10**\n- **[Ollama](https://ollama.com)** running somewhere you can reach (local or LAN),\n  with:\n  - **`lfm2.5:8b`** — the tool-calling instruct model the app uses, and\n  - **`nomic-embed-text`** (used to build the UNHCR-guidelines search index).\n- A few hundred MB of disk for the Python deps and the (regenerable) RAG index.\n\n> No Node.js / npm anywhere — Fugee is pure Python.\n\n---\n\n## Quick start\n\n```bash\n# 1. Clone and enter the repo\ncd fugee\n\n# 2. Create a virtualenv and install deps  (uv recommended; plain venv also fine)\nuv venv && source .venv/bin/activate          # or: python -m venv .venv && source .venv/bin/activate\nuv pip install -r requirements.txt            # or: pip install -r requirements.txt\n\n# 3. Configure the model + host\ncp .env.example .env\n#   then edit .env:  set OLLAMA_HOST and MODEL_ID to what your Ollama actually has\n\n# 4. Pull the models on your Ollama host (skip any you already have)\nollama pull lfm2.5:8b          # or your chosen ≤32B instruct model\nollama pull nomic-embed-text   # embeddings for the guidelines RAG index\n\n# 5. Build the UNHCR-guidelines search index (one-time; regenerable, gitignored)\npython data/scripts/build_guidelines_index.py\n\n# 6. Run the app\npython app/app.py\n```\n\nOpen **http://localhost:7860** in a browser. (The server binds `0.0.0.0:7860`, so\nit's reachable from other machines on your network too.)\n\n### Configuration (`.env`)\n\nRead at startup by `app/config.py` (no `python-dotenv` dependency):\n\n| Variable        | Meaning                                                        | Example |\n|-----------------|----------------------------------------------------------------|---------|\n| `OLLAMA_HOST`   | Base URL of the Ollama server (local, LAN, or Modal endpoint)  | `http://127.0.0.1:11434` |\n| `MODEL_ID`      | The single ≤32B tool-calling instruct model for the whole app  | `lfm2.5:8b` |\n| `MODEL_PROVID"
  },
  {
    "repo_id": "build-small-hackathon/DoodleBook",
    "title": "Doodlebook",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/DoodleBook",
    "zone": "Creative Studio",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "doodlebook",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Casual",
    "quest": "🎨 Creative Studio: Generate one creative output.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎨",
    "readme_text": "---\ntitle: DoodleBook\nemoji: 📚\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\nsdk_version: \"6.18.0\"\napp_file: app.py\npinned: false\nlicense: apache-2.0\ntags:\n  - hackathon\n  - build-small\n  - adventure-in-thousand-token-wood\n  - gradio\n  - flux\n  - minicpm\n  - voxcpm\n  - storybook\n  - coloring-book\n  - voice-cloning\n  - children\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\nmodels:\n  - black-forest-labs/FLUX.2-klein-4B\n  - openbmb/MiniCPM5-1B\n  - openbmb/VoxCPM2\n---\n\n# DoodleBook 📚🖍️\n\n**A child draws a character → DoodleBook turns it into a narrated, illustrated crayon storybook *and* a matching printable coloring book — with their own voice if they want.**\n\nBuilt for the **Build Small Hackathon 2026 · Adventure in Thousand Token Wood**. Every model is under 32B; the reasoning stack (story + voice) is just ~3B.\n\n> Open the Space and a real sample book loads instantly. Then draw your own character, upload it, and watch your hero come to life across six pages.\n\n---\n\n## ✅ Pre-flight Checklist\n\n| Requirement | Status | Notes |\n|---|---|---|\n| **Stay under 32B** | ✅ | MiniCPM5-1B (1B) + VoxCPM2 (2B) + FLUX.2-klein-4B (4B) = **7B total** |\n| **Ship a Gradio app** | ✅ | Gradio 6 Space in the Build Small org |\n| **Record a demo** | ✅ | [MP4 demo](docs/demo-doodlebook.mp4) + [Supademo walkthrough](https://app.supademo.com/demo/cmqfkwlro4f4wqmgj218kxnqp?utm_source=link) |\n| **Post on social media** | ✅ | [X/Twitter post](https://x.com/sushruthsgowda/status/2066639063168225452?s=46) |\n| **GPU limit (≤10 ZeroGPU)** | ✅ | 1 ZeroGPU Space |\n\n---\n\n## 🔗 Links\n\n| | |\n|---|---|\n| 🚀 **Live Space** | [huggingface.co/spaces/build-small-hackathon/DoodleBook](https://huggingface.co/spaces/build-small-hackathon/DoodleBook) |\n| ▶️ **Demo video** | [MP4 demo](docs/demo-doodlebook.mp4) · [Supademo walkthrough](https://app.supademo.com/demo/cmqfkwlro4f4wqmgj218kxnqp?utm_source=link) |\n| 📣 **Social post** | [X/Twitter announcement](https://x.com/sushruthsgowda/status/2066639063168225452?s=46) |\n| 📝 **Project article** (complete technical paper) | [docs/article.md](docs/article.md) |\n| 🔬 **Field Notes** (technical deep-dive) | [docs/blog.md](docs/blog.md) |\n| 💻 **Source code** | [github.com/Sushruths04/Doodle-book](https://github.com/Sushruths04/Doodle-book) |\n| 🤖 **MiniCPM5-1B** | [huggingface.co/openbmb/MiniCPM5-1B](https://huggingface.co/openbmb/MiniCPM5-1B) |\n| 🔊 **VoxCPM2** | [huggingface.co/openbmb/VoxCPM2](https://huggingface.co/openbmb/VoxCPM2) |\n| 🎨 **FLUX.2-klein-4B** | [huggingface.co/black-forest-labs/FLUX.2-klein-4B](https://huggingface.co/black-forest-labs/FLUX.2-klein-4B) |\n| 🗣️ **Kannada TTS fine-tune** | [huggingface.co/sush0401/IndicF5-Kannada-Bedtime-v2](https://huggingface.co/sush0401/IndicF5-Kannada-Bedtime-v2) |\n\n---\n\n## ✨ What it does\n\n1. **Draw & upload** a character (upload or webcam snap).\n2. **Story** — MiniCPM5-1B writes a complete six-page story with a consistent hero and a real emotional arc.\n3. **Illustrations** — FLUX.2-klein renders each page where **your character stays consistent** across all pages (built from *your* drawing via img2img).\n4. **Narration** — VoxCPM2 reads the whole book aloud in the child's choice of voice — kid, storyteller, grandpa, or **their own cloned voice** from an uploaded reference recording.\n5. **Coloring book** — a matching black-and-white line-art version to print and color.\n6. **Downloads** — one-tap **Story PDF** and **Coloring PDF** (works on mobile too).\n\n### What makes a story good\n\nDoodleBook uses a structured prompting system to guarantee quality:\n\n- **Few-shot exemplar**: A full 6-page story is shown as a gold example so MiniCPM learns the exact format and richness required.\n- **Story arc rules**: Pages 1–2 introduce the hero and challenge; middle pages build it; final pages resolve and teach a clear lesson.\n- **2–3 rich sentences per page**: Every page uses sensory detail — colours, sounds, textures, feelings — not just plot.\n- **10 diverse themes** covering kindness, imagination, friendship, courage, identity, and adventure.\n- **Continuity rules**: Each page flows from the one before — the story reads as one coherent arc, not isolated scenes.\n- **Six-page pacing**: Every story follows a compact picture-book arc designed for the live generation budget.\n\n---\n\n## 🧠 Models & the \"Tiny Titan\" argument\n\n| Role | Model | Params | Sponsor | What it does |\n|---|---|---|---|---|\n| 📖 Story writer | `openbmb/MiniCPM5-1B` | **1B** | **OpenBMB** | Writes the full story as structured JSON — title, character description, per-page text and scene descriptions |\n| 🔊 Voice narrator | `openbmb/VoxCPM2` | **2B** | **OpenBMB** | Reads the story aloud with voice design prefixes; optionally clones the user's own voice via reference audio |\n| 🎨 Illustrator | `black-forest-labs/FLUX.2-klein-4B` | **4B** | **Black Forest Labs** | Renders each page "
  },
  {
    "repo_id": "build-small-hackathon/focus-buddy",
    "title": "Focus Buddy",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/focus-buddy",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "focus",
      "buddy",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Focus Buddy\nemoji: 🏃\ncolorFrom: blue\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\npinned: false\nshort_description: A gamified productivity buddy to help you get things done!\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n## Demo Video\n\nYouTube: https://www.youtube.com/watch?v=99koeTHcSJ0\n\n## Social Media Posts\n\nX: https://x.com/pocanman/status/2066341277260484734\n\nLinkedIn: https://www.linkedin.com/posts/marc-anthony-polican_ai-huggingface-hackathon-ugcPost-7472113945403785216-2Yn3\n\n## Team Usernames\n\npocanman\n\n\n\nCheck out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference"
  },
  {
    "repo_id": "build-small-hackathon/storybook_test",
    "title": "Storybook_Test",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/storybook_test",
    "zone": "Creative Studio",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "storybook_test",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Casual",
    "quest": "🎨 Creative Studio: Find one surprising or delightful result.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎨",
    "readme_text": "---\ntitle: Sozai\nemoji: 🎨\ncolorFrom: yellow\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\npinned: false\npreload_from_hub:\n  - openbmb/MiniCPM-V-4-gguf ggml-model-Q4_K_M.gguf,mmproj-model-f16.gguf\n  - Falconsai/nsfw_image_detection\n  - nvidia/nemotron-3.5-asr-streaming-0.6b\n  - black-forest-labs/FLUX.2-klein-4B\n  - omarirfa/sozai_lora\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\nTeammates: Shereen Lee (HF username: reversely), Omar Irfan Khan (HF username: omarirfa)\nSocial media post: https://x.com/Oi_Khan/status/2066656493995208755?s=20\n\n## Introduction \nRef: Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference\n \n**Sozai** (素材 — \"material\" / \"ingredients\") is a real-time collaborative photo-album app where people **build memories together**. Spin up a room, share a short code, and a small group uploads photos, captions them, develops them into watercolour scenes, and flies through them on a 3D globe — all synced live across everyone in the room. It also works solo, with no room required.\n \nThe name is the thesis: your photos, voice, and captions are the raw *sozai*; the app's job is to turn them — together, in real time — into something that feels like a shared memory.\n \n---\n \n## Features\n \nThese are the features that are actually wired up and used in the app:\n \n### Collaboration\n- **Hybrid rooms.** Every room has a secure internal `room_id` (uuid4) plus a short, human-friendly **share code whose length the user picks (4–12 characters)** right on the splash screen. The code alphabet is deliberately unambiguous uppercase — no `0/O/1/I/L` — so codes are easy to read aloud.\n- **Three ways in.** *Make a room*, *join by code*, or *continue without a room* (solo mode at `/app`).\n- **Live presence & cursors.** Participants, names, and colour-coded live cursors are streamed to everyone in the room.\n- **Shared chat, selection, and captions.** Selection state, chat, and caption edits propagate to all members so the album is edited together.\n- **Single-flight AI locking.** Heavy AI actions (caption, develop, flythrough) take a per-room lock so two people don't fire the GPU at once. Locks auto-expire if a holder disappears, so a room never gets stuck — the AI lock is short-lived, while the 3D \"journey\" lock lives longer (it can run for a while) and refreshes itself while the driver streams frames.\n- **Owner approval + capacity.** New joiners can require host approval, and rooms enforce a small capacity (a reconnecting session keeps its seat).\n- **Self-cleaning storage.** A background thread marks rooms idle after ~1h and deletes idle/old rooms (~24h hard cap) so the database reflects live usage and stays lean.\n### Memory-building\n- **Photo uploads** with an NSFW safety gate on the way in.\n- **Auto-caption, auto-title, and auto-tags** — each grounded in what the photo actually shows, not just the filename.\n- **Voice captions** via streaming speech-to-text, with automatic language detection across 40 locales.\n- **\"Develop\" into watercolour** — a *darkroom* img2img transform that re-paints a photo as a watercolour scene, like pulling a print.\n- **3D globe flythrough** — a Cesium-powered map \"journey\" that takes the room on a guided flythrough of the album's places, with a synced \"driver\" so everyone watches together.\n- **Map timeline** — a chronological view that lays the album out in date order and can fly the timeline place by place.\n- **Record the journey** — capture the flythrough to a downloadable **WebM** clip.\n- **Export to PDF** — turn the finished album into a real scrapbook **PDF** (jsPDF), downloaded directly in the browser, with a print fallback.\n### Maps & offline resilience\n- **Self-hosted map stack.** MapLibre GL JS/CSS is **vendored to the app's own origin** (fetched once from a CDN, cached to disk, then inlined into the page) so the map works even on locked-down or offline networks as long as the *server* has outbound internet — with in-page CDN fallbacks if not.\n- **Pluggable basemaps & 3D.** Cesium ion (3D globe) and Stadia Maps are wired via server-side tokens exposed through `/api/config`, so keys rotate in one place and never get hard-coded into the page.\n### Easter egg\n- **Pet sprites.** An optional oneko-style animated pet (the 2023 icon library — 76 animals, 32 sprite frames each) can follow the cursor, served from `/sprites/...`.\n### Observability\n- **Self-hosted Arize Phoenix.** Tracing runs in-process and the Phoenix UI is reverse-proxied same-origin under `/phoenix` so it embeds cleanly in an iframe (frame-blocking headers stripped on the way back).\n- **In-app trace view.** A lightweight `/api/trace` endpoint summarizes every model call: counts, error rate, latency `avg / p50 / p95 / p99`, token totals, tokens/sec, and a per-oper"
  },
  {
    "repo_id": "build-small-hackathon/piper-assistant",
    "title": "Piper Assistant",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/piper-assistant",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "backyard",
      "piper",
      "assistant",
      "summary"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Paperain Studio — Sticker Restock Manager\nemoji: 📝\ncolorFrom: yellow\ncolorTo: pink\nsdk: gradio\nsdk_version: \"6.14.0\"\napp_file: app.py\npinned: false\nshort_description: Piper turns messy store POs into restock workflows.\ntags:\n- track:backyard\n- sponsor:openai\n- sponsor:nvidia\n- sponsor:modal\n- achievement:offgrid\n- achievement:offbrand\n- achievement:llama\n- achievement:sharing\n- achievement:fieldnotes\n- track:backyard\n- achievement:offbrand\n- achievement:bestdemo\n- gradio\n- build-small-hackathon\n- backyard ai\n- backyard-ai\n- off brand\n- off-brand\n- best demo\n- best-demo\n- judges wildcard\n- judges-wildcard\nmodels:\n- Qwen/Qwen2.5-7B-Instruct\n---\n\n# Paperain Studio — Sticker Restock Manager\n\n**Piper** is an AI desk assistant for [Paperain Studio](https://paperainstudio.com) — a real sticker business in Yogyakarta, Indonesia selling through **25 partner stores** and **150 designs**.\n\nEvery month, stores send restock orders as messy Excel copy-pastes, PDFs, or WhatsApp screenshots (Indonesian, English, or both). My family used to spend **3+ hours** manually typing data into spreadsheets. Piper fixes that.\n\n**Try it:** [Live Space](https://huggingface.co/spaces/build-small-hackathon/piper-assistant) · [Demo video](#demo-video) *(coming soon)* · [Blog post](https://www.paperainstudio.com/blog/how-we-built-piper-ai-build-small-hackathon) · [Social post](#social-post) *(coming soon)*\n\n## TL;DR for Judges\n\n- **Track — Backyard AI:** Built for our own family sticker business. 25 real partner stores across Java send monthly restock orders in 25 different formats — Excel, WhatsApp, mixed Indonesian/English. Piper turns that chaos into structured workflows my family can actually use.\n- **Idea:** Paste any store PO → review parsed products → aggregate demand vs home stock → calculate A3 print sheets → generate bilingual delivery docs. Hours of spreadsheet work, compressed into a form-like Gradio app.\n- **Tech:** Python 3.11 · Gradio 6.x · pandas · **Qwen2.5-7B-Instruct** via Hugging Face Inference API (7B params, well under the 32B cap) · 3-stage parsing pipeline (normalize → JSON extract → catalog anchor) · rule-based fallback when the API is cold.\n- **Off Brand:** Custom light cream UI matching paperainstudio.com — not stock Gradio defaults.\n- **Best Demo:** Demo video and social post linked below *(placeholders until published)*.\n- **Judges' Wildcard:** Real small-business ops tool that doesn't fit a neat category — part parser, part print calculator, part bilingual doc generator.\n\n## Submission Links\n\n| Item | Link |\n|------|------|\n| Live Space | https://huggingface.co/spaces/build-small-hackathon/piper-assistant |\n| Demo video | https://youtu.be/ivkLCgZEw20 |\n| Blog post (Field Notes) | https://www.paperainstudio.com/blog/how-we-built-piper-ai-build-small-hackathon |\n| Social post | https://www.instagram.com/reel/DZlcRlSB52J/?igsh=N244aTVvOHZpMGt6 |\n\n### Demo Video\n\nhttps://youtu.be/ivkLCgZEw20\n\n### Social Post\n\nwe’re building Piper, a small AI assistant specially built to automate some of our tasks in our small business. with Piper, we can save 3-4 hours per week and ease the workload of our small team. \nhttps://www.instagram.com/reel/DZlcRlSB52J/?igsh=N244aTVvOHZpMGt6\n\n## The Problem\n\n- 25 stores, each with their own PO format and best sellers\n- Mixed-language, informal orders (\"stiker kucing hologram 20 pcs\")\n- Manual spreadsheet work that's hard for non-tech-savvy family members\n\n## The Solution\n\nPowered by **Qwen2.5-7B** (7 billion parameters — well under the 32B hackathon limit):\n\n| Tool | What it does |\n|------|-------------|\n| **PO Intake** | Paste any PO format → structured product/qty table |\n| **Stock & Demand** | Aggregate orders vs home inventory, show shortages |\n| **Print Calculator** | A3 sheet math (8 A5 stickers per sheet) |\n| **Delivery Docs** | Bilingual packing lists (EN / ID) |\n| **Best Sellers** | Demand-based recommendations for partner stores |\n\n## Why a Small Model?\n\nParsing `\"stiker kucing 20 pcs\"` into `{product, quantity}` is **structured extraction** — not creative writing. A 7B model handles this perfectly:\n\n- Runs on a laptop (no GPT-4 API needed)\n- Zero cost per call\n- Store data stays private\n- Fast enough for monthly restock workflows\n\n**7 billion parameters. 25 real stores. 3 hours saved every month.**\n\n## How to Run\n\n```bash\npip install -r requirements.txt\npython app.py\n```\n\nSet `HF_TOKEN` for Hugging Face Inference API, or run locally with Ollama (`FORCE_HF=0`).\n\nBuilt for the [Build Small Hackathon 2026](https://huggingface.co/build-small-hackathon) · **Backyard AI** track.\n\n## Hackathon Tags\n\n| Prize / Badge | Status | Why we hope to qualify |\n| --- | --- | --- |\n| Backyard AI | **Entered** | Real problem for a real family business — 25 stores, actual monthly PO chaos. |\n| Off Brand | **Targeted** | Custom Paperain-branded Gradio UI with warm cream palette, not default components. |\n| Best Demo | **Targeted** | Demo video and social post placeholders above — full pac"
  },
  {
    "repo_id": "build-small-hackathon/medilens-local",
    "title": "Medilens Local",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/medilens-local",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "medilens",
      "local",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: MediLens\nemoji: 💊\ncolorFrom: blue\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Offline, multilingual medicine-label helper\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n---\n\n# MediLens: Medicine Label Helper\n\nMediLens helps someone read a medicine label and understand, in plain language\nand in their own language, what the medicine is commonly used for.\n\nThis Space runs the **full MediLens app** exactly as designed. Because a free\nHugging Face Space has **no GPU**, the two local AI models and the robot cannot\nrun here:\n\n- **MiniCPM-V 4.6** (OpenBMB) - vision OCR that reads labels from a photo.\n- **Tiny Aya Global (3B)** (Cohere / Cohere Labs) - rewrites and translates the\n  explanation at a 14-15 year-old reading level.\n- **Reachy Mini (Hugging Face)** - a hands-free, multilingual voice assistant\n  (offline speech via faster-whisper / Whisper, Kokoro, and Piper).\n\nWhat works fully on this hosted demo: **medicine lookup** against a local\n200-medicine database and **offline multilingual explanations** (English,\nFrench, German, Italian, Romanian, Spanish). The AI-model and Reachy controls\nare shown so the interface matches the desktop app; a note in the app explains\nthey need a local GPU.\n\nTo experience everything (vision OCR, AI translation, and the Reachy Mini robot),\nrun MediLens on your own computer from the GitHub repository, or watch the demo\nvideo.\n\n## Links\n\n- **Demo video:** https://youtube.com/shorts/nUSK8DPznm4\n- **GitHub repository:** https://github.com/nganea/MediLens-Local\n- **LinkedIn post:** https://www.linkedin.com/posts/natasa-ganea_medilens-offline-ai-medicine-helper-with-share-7472309911658610689-yFtq/\n\n## Safety\n\nInformational only. MediLens does not give dosage instructions, does not tell\nanyone to take a medicine, and does not confirm a medicine is safe for a\nspecific person. Always check with a pharmacist or doctor.\n\n## Credits\n\nModels: **MiniCPM-V 4.6** by OpenBMB and **Tiny Aya Global** by Cohere / Cohere\nLabs. Robot: **Reachy Mini** by Hugging Face. Offline speech: faster-whisper\n(OpenAI Whisper), Kokoro, and Piper. Local text OCR uses Tesseract; name\nmatching uses rapidfuzz.\n\nThis app was developed with the help of **OpenAI Codex** and **Anthropic Claude**.\n"
  },
  {
    "repo_id": "build-small-hackathon/professor-pip",
    "title": "Professor Pip",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/professor-pip",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "professor",
      "pip",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 6,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Professor Pip — Kids Learning Avatar\nemoji: 🎓\ncolorFrom: yellow\ncolorTo: blue\nsdk: gradio\nsdk_version: \"5.38.0\"\npython_version: \"3.10\"\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: A playful 3D avatar that teaches kids 5-min courses by voice\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Professor Pip — Playful Kids Learning Avatar\n\nA friendly 3D avatar teacher for children aged 5–10. Pip delivers short, playful\n5-minute courses — premade science/math/story lessons plus a \"make your own\nlesson\" box — and talks back by voice. The 3D avatar renders **in the visitor's\nbrowser** (TalkingHead + Three.js, ~60 fps, zero GPU quota); the Gradio Space\norchestrates four short stateless calls and enforces all child-safety, while the\nheavy models run on one scale-to-zero **Modal** GPU container.\n\n| Endpoint | Model | Returns |\n|---|---|---|\n| `/asr` | faster-whisper small (244M, int8) | transcript string |\n| `/brain` | **fine-tuned MiniCPM5-1B** — GGUF via llama.cpp on Modal | `{\"text\",\"mood\",\"gesture\",\"raw\"}` |\n| `/speak` | Kokoro-82M | `{\"audio_b64\",\"words\",\"wtimes\",\"wdurations\"}` |\n| `/make_course` | brain + safety gate + template fallback | course JSON or `{\"rejected\",\"message\"}` |\n\n## 🏕️ Build Small Hackathon — Backyard AI\n\n**Idea.** Professor Pip is a voice-first **study tutor for kids 5–10** — a friendly 3D\navatar that teaches ten playful 5-minute courses, builds a brand-new lesson on any\ntopic you ask for, and answers spoken \"why?\" questions in character. It's a practical\n*\"personal study tutor\" / \"storybook for a child\"* you can sit a kid in front of —\nrunning on a tiny model you own, not someone's API. The storybook charm (a talking\navatar, stickers, confetti, a keepsake certificate) is what keeps a five-year-old\ncoming back.\n\n**Demo video:** [YouTube](https://youtu.be/Hh_v5jkdHfw) · **Social post:** [X / Twitter](https://x.com/ratandeep/status/2066636433411215472)\n\n**Tags above, and why:**\n- `track:backyard` — a practical, voice-first study tutor for a child, running on a tiny model you own.\n- `sponsor:openbmb` — the brain is a **fine-tuned MiniCPM5-1B** ([LoRA](https://huggingface.co/build-small-hackathon/professor-pip-minicpm5-1b-lora) · [GGUF](https://huggingface.co/build-small-hackathon/professor-pip-minicpm5-1b-gguf)).\n- `sponsor:modal` — **trained and served on Modal** (LoRA SFT on an A10; GGUF served via llama.cpp).\n- `achievement:welltuned` — that fine-tuned model is published on the Hub.\n- `achievement:offbrand` — a fully custom WebGL/TalkingHead frontend, not the default Gradio look.\n- `achievement:llama` — the brain runs through the **llama.cpp** runtime.\n- `achievement:sharing` — a synthetic [course-trace dataset](https://huggingface.co/datasets/build-small-hackathon/professor-pip-traces) is published for others to learn from.\n- `achievement:fieldnotes` — a build write-up / [blog post](https://huggingface.co/blog/build-small-hackathon/field-notes-building-professor-pip).\n\n**How it's built.** The brain is a MiniCPM5-1B teacher LoRA fine-tuned on ~2k\nin-voice examples to lock Pip's spoken register and the `{text,mood,gesture}`\nJSON contract; merged, quantized to GGUF (Q4_K_M/Q8_0), and served via llama.cpp\non Modal. Lesson flow is a deterministic browser state machine (no agent loop);\npremade segments are spoken verbatim, and `/brain` handles raise-hand questions +\nmake-your-own. Child-safety is deterministic and **server-side** (non-bypassable).\n\n## How a lesson works\n\nThe lesson is a **state machine in the browser** (`frontend.html`); the four\nendpoints stay stateless. Premade course segments are spoken **verbatim** via\n`/speak` (they're authored in Pip's voice — fast and reliable). `/brain` is used\nonly for (a) a child's spoken \"raise hand\" question during a lesson and (b)\nmake-your-own course generation. Lipsync contract: `/speak` returns word timings\nin ms; the frontend calls `head.speakAudio({audio, words, wtimes, wdurations})`.\n`mood`/`gesture` drive `head.setMood()` / `head.playGesture()`.\n\n## Files\n\n```\napp.py          # endpoints, persona, lazy model loaders, /make_course, Gradio wiring\npip_core.py     # pure logic: safety gate + course schema (stdlib only, unit-tested)\nfrontend.html   # kids UI + browser lesson engine (injected into a srcdoc iframe)\ncourses/*.json  # the course library (skip files starting with \"_\")\npersona/        # Professor Pip persona bible\ntest_app.py     # unit tests for pip_core (no model downloads)\nrequirements.txt, packages.txt\navatar.glb      # optional committed avatar (auto-detected; falls back if absent)\n```\n\n## Courses\n\nEach course is one JSON file in `courses/`:\n\n```json\n{\n  \"id\": \"sky-blue\",\n  \"title\": \"Why Is the Sky Blue?\",\n  \"emoji\": \"🌈\",\n  \"age_band\": \"5-8\",\n  \"subject\": \"science\",\n  \"segments\": [\n    {\"say\": \"...\", \"mood\": \"happy\", \"gesture\": \"index\", \"quiz\": null},\n    {\"say\": \""
  },
  {
    "repo_id": "build-small-hackathon/brad-did-something",
    "title": "Brad Did Something",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/brad-did-something",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "brad",
      "did",
      "something",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Brad Did Something\nemoji: 🚀\ncolorFrom: pink\ncolorTo: yellow\nsdk: docker\napp_port: 7860\npinned: false\nlicense: mit\nshort_description: Argue your way to $1M before the quarter ends\ntags:\n  - track:wood\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# 🚀 Brad Did Something\n\n![Brad Did Something](docs/media/cover.gif)\n\n> *Corporate bureaucracy is already a chaotic simulation — I just replaced middle management with an LLM to see if the company could survive the quarter.*\n\nA 2D top-down **office comedy game** where every line your coworkers say, every\nunhinged decision, and every dollar of revenue is **generated live by an LLM**.\nYou're the new Head of Sales at Veloura Technologies; your five underlings are\nenthusiastic, well-meaning, and completely off the rails. Survive **15 workplace\ncrises** and one fiscal quarter, and hit **$1,000,000** — by literally *arguing\nyour way there*.\n\n**🎮 [Play it now](https://huggingface.co/spaces/build-small-hackathon/brad-did-something) · ▶️ [Watch the trailer](https://youtu.be/BJSE5WDZvPs)**\n\n![Gameplay](docs/media/gameplay.gif)\n\n## Not just a chatbot in a trenchcoat\n\n- **🧠 The AI is load-bearing.** NPC dialogue isn't decoration — through strict\n  JSON-schema validation the model's output *directly drives the game economy*.\n  Talk Brad into rescuing a deal and revenue jumps; offend him and the pipeline\n  collapses. The server hides the morale & relationship scores, so the model\n  moves them without ever seeing (or gaming) them.\n- **🎨 Comics on the fly.** Each crisis, the text model writes a scene and a\n  second GPU renders a wordless **FLUX.2 [klein] 4B** comic panel that drops over\n  the office — the caption is drawn crisp by the UI (FLUX can't spell, so words\n  stay out of the image).\n- **🕹️ A genuinely custom frontend.** A hand-built HTML5-canvas + DOM pixel-art\n  game served through **`gr.Server`** — zero default Gradio widgets, desktop **and**\n  touch.\n- **⚙️ Small models, tight plumbing.** Qwen3.5-9B (llama.cpp) for text, FLUX.2-\n  klein-4B for art, both on Modal GPUs, JSON-grammar-locked with a one-shot retry\n  and a fast fallback so a slow call never kills the joke's timing.\n\n📖 **[Read the full build story →](https://huggingface.co/blog/build-small-hackathon/brad-did-something)** — the sub-4B model bake-off, fighting the model's \"helpful AI\" reflex, and the comic pipeline.\n\n---\n\n## Run it yourself\n\n```bash\npip install -r requirements.txt\npython app.py     # → http://localhost:7860   (mock mode: fully playable, no keys)\n```\n\nWith no `MODAL_URL` set it uses a template-based offline generator. For the real\nAI, deploy the two Modal GPU apps and point the game at them:\n\n```bash\nmodal deploy modal_app/inference.py   # Qwen3.5-9B text  → MODAL_URL / MODAL_TOKEN\nmodal deploy modal_app/image.py       # FLUX.2-klein art → FLUX_URL / FLUX_TOKEN\n```\n\nOn the HF Space these four go in **Settings → Secrets**; locally, `.\\run_modal.ps1`\nwires them up for you.\n\n## Deploy (HF Docker Space)\n\nBuilt on **`gr.Server`** so the custom FastAPI frontend *is* the app — HF's\n`sdk: gradio` runner only launches a bare `demo`, so the Space runs `sdk: docker`\nwith a `Dockerfile` that does `uvicorn app:app` on port 7860. Dry-run locally:\n`docker build -t bds . && docker run -p 7860:7860 bds`.\n\n## Controls\n\n**WASD/arrows** move · **SPACE** talk / answer / advance comic · **G** gift ·\n**1/2** choose · **ENTER** send a typed reply · **ESC** close · **M** mute.\n**📱 Mobile:** an on-screen joystick + ACT/GIFT buttons appear automatically, and\ntapping the floor walks you there.\n\n## Tests\n\n```bash\npytest tests/                  # unit: validator, economy, events, comic, idle\npython tests/smoke_http.py     # full-quarter API playthrough (mock)\npython tests/smoke_browser.py  # headless-browser UI smoke\n```\n\n---\n\n*Code MIT. Text model: Qwen3.5-9B (Apache-2.0). Image model: FLUX.2 [klein] 4B\n(non-commercial — fine for this demo). Built for the Hugging Face **Build Small\nHackathon**, \"An Adventure in Thousand Token Wood.\"*\n"
  },
  {
    "repo_id": "build-small-hackathon/grandpas-bedtime-stories",
    "title": "Grandpas Bedtime Stories",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/grandpas-bedtime-stories",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "achievement",
      "grandpas",
      "bedtime",
      "stories",
      "summary"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 3,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Grandpa's Bedtime Stories\nemoji: 👀\ncolorFrom: gray\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.13'\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: 'A WebXR bedtime storybook'\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n---\n\n## Grandpa's Bedtime Stories \n### Ever wanted an imaginary grandpa with a British accent to tell you a story from a still picture that turns into an AR fantasy? No? Here it is, anyway! \n\n### Choose a photo from the photo book, or create your own and ask questions about anything you can find and grandpa will give you his insights. What a person is wearing, what is going on, what country he thinks it is - the limit is your imagination!\n\nDemo: https://youtu.be/kd8J4pKokFg \n\nSocial post: https://www.reddit.com/r/OculusQuest/s/kjCoBmAVha\n\n## Requirements\n- A **microphone** is required — you speak with Grandpa and he talks back.\n- **Headphones or a quiet space** are recommended so Grandpa's voice doesn't re-trigger the microphone.\n- An **HTTPS connection** (or localhost) is required for microphone and WebXR access.\n\n## Adding Your Own Pictures\nHit the **+** slot at the end of the picture book to upload a photo from your device. Grandpa will auto-caption it — you can edit the caption before saving. The photo is then converted into a full 3D Gaussian splat (this takes a minute or two, longer on the first run while the engine warms up on Modal) and added to your personal picture book. Photos up to 20 MB / 40 MP are supported; captions up to 400 characters.\n\n## Running Locally\n```bash\npip install -r requirements.txt\n# Deploy the Modal GPU endpoints first (only needed once, or after changes):\nmodal deploy modal_sharp.py\nmodal deploy modal_minicpmo.py\nmodal deploy modal_app.py\n# Copy .env.example to .env and fill in your Modal API keys and endpoint URLs.\n# Then start the Gradio server:\npython app.py\n# Open http://localhost:7860 in your browser.\n```\n\n## Controls\n\n### Desktop & Mobile Browser\n| Input | Action |\n|---|---|\n| Drag | Orbit — look around the scene |\n| Scroll wheel | Zoom in & out |\n| Two-finger pinch | Zoom in & out (trackpad / touchscreen) |\n| Tap / click | Quick-select an area |\n| Double-click + drag | Draw a custom selection box |\n\n### AR — Headset (Meta Quest / Pico)\n| Input | Action |\n|---|---|\n| Trigger | Aim controller & pull to draw a selection box |\n| Grip | Hold & move to slide the picture |\n| Grip × 2 | Two hands to pinch-zoom |\n| Thumbstick | Push forward / back to zoom |\n| HUD (lower left) | Back · Clear selection · Reset view |\n\nHand tracking also works: pinch replaces the trigger, and a closed fist replaces the grip.\n\n### AR — Phone (Android)\n| Input | Action |\n|---|---|\n| Single finger drag | Draw a selection box |\n| Tap | Quick-select an area |\n| Two-finger touch | Pinch to zoom · swipe to pan |\n\n### Talking to Grandpa\nHit **Start listening**, then speak your question out loud — Grandpa will answer in voice. You can tap or draw a box around part of the picture first to focus his attention on a particular area.\n\n## Note about AR capabilities / Supported Devices\nAR works on WebXR-capable devices: Meta Quest headsets in Meta Quest Browser, Pico headsets in Pico Browser, and ARCore-supported Android phones opened in Chrome with Google Play Services for AR installed. iPhone and iPad Safari do not support WebXR AR, though experimental WebXR browsers such as [Mozilla's WebXR Viewer](https://apps.apple.com/us/app/webxr-viewer/id1295998056) may work. If AR is unavailable, the same picture can still be explored in the desktop/mobile browser viewer.\n\n\n## Model Focus\n### Apple Sharp\nThe 3D reconstruction model is [Apple's Sharp model](https://huggingface.co/apple/Sharp), a roughly 1B-parameter feed-forward model that produces a photo into a `.ply` Gaussian splat, then build a `.rad` LoD scene so the picture can become a quickly explorable 3D space in desktop and AR. \n\n### OpenBMB MiniCPM-o 4.5\nThe core storytelling model is OpenBMB's [MiniCPM-o 4.5](https://huggingface.co/openbmb/MiniCPM-o-4_5), a small 9B-parameter-but-extremely-powerful omni model. We use it for the live voice loop: image plus user's speech go in, and Grandpa's answer comes back as text and speech seamlessly without needing any other models or endpoints (we are off-the-grid!). We also use MiniCPM-o for visual grounding, asking it to identify visible people and objects that we use to focus on. Essential to the app is MiniCPM-o's also supporting bounding boxes, and it is just extremely lucky to have everything in what was envisioned to be in one model.\n\n### Modal \n\nBig thanks to modal which is essential to the app, both for running everything and for several iterations of testing models and seeing what worked well. I have included the `unused` folder as proof of the ~~blood, sweat and tears~~ effort. Hugging Face serves the Gradio/FastAPI viewer, but the heavy model work runs on Modal GPU endpoints: "
  },
  {
    "repo_id": "build-small-hackathon/carbon-vepor",
    "title": "Carbon Vepor",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/carbon-vepor",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "track",
      "sponsor",
      "carbon",
      "vepor",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Carbon VEPor (Carbon powered Variant Effect Prediction (VEP) Engine)\nemoji: 🏢\ncolorFrom: green\ncolorTo: blue\nsdk: docker\napp_port: 7860\npinned: false\nshort_description: Carbon powered Variant Effect Prediction (VEP) Engine\ntags:\n- track:backyard\n- track:wood\n- sponsor:openbmb\n- sponsor:nvidia\n- achievement:offgrid\n- achievement:welltuned\n- achievement:llama\n- achievement:sharing\n- achievement:fieldnotes\n---\n\n# 🧬 Carbon VEPor : Carbon powered Variant Effect Prediction (VEP) Engine\n\nAn enterprise-grade analytical orchestration pipeline for multimodal genomic document processing, sequence variant alignments, and downstream pathogenicity determination.\n<br>\n**Blog**: https://huggingface.co/blog/build-small-hackathon/carbon-vepor\n<br>\n**Social Media Post (X)**: https://x.com/VivekS100/status/2066637662849737049\n<br>\n**Video (Youtube)**: https://youtu.be/uCzNCPttKio\n<br>\n**Author**: Vivek Silimkhan ([viveksil](https://huggingface.co/viveksil))\n\n\n## 🚀 Key Features\n\n* **Multimodal Extraction Layer:** Seamlessly parses structural tables, variant coordinate markers, and clinical assay boundaries directly from raw PDF clinical records using [openbmb/MiniCPM-V-4.6-gguf](https://huggingface.co/openbmb/MiniCPM-V-4.6-gguf).\n* **Genomic Foundation Processing:** Translates clinical mutation notations into precise nucleotide strings and executes deep Log-Likelihood Ratio (LLR) extractions against target baseline genome matrices using [HuggingFaceBio/Carbon-3B-GGUF](https://huggingface.co/HuggingFaceBio/Carbon-3B-GGUF).\n* **Pathogenicity Boundary Classification:** Maps complex model embeddings through an optimized classifier head to generate definite pathogenicity confidence scores and categorical thresholds.\n* **Sandbox Agent Telemetry Audit:** Includes a trace panel displaying background system operations, dataset fetching logic loops, and autonomous exception recovery sessions using [ml-intern](https://github.com/huggingface/ml-intern) and [nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16](https://huggingface.co/nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16).\n\n## 🛠️ System Architecture\n\nThe application runs inside a secure, containerized environment isolating network layers across local ports:\n* **Port 7860:** Public frontend interface powered by Gradio 6.0+.\n* **Port 8081:** Multimodal document parser and synthesis engine (MiniCPM-V-4.6).\n* **Port 8082:** Native logit provider for sequence alignments (Carbon).\n\n## 📦 Directory Structures\n\nEnsure your project space is populated with the following runtime architecture elements:\n```text\n├── data/\n│   ├── sample_report.pdf    # Sample input clinical report used by the UI loader\n|   ├── classifier_head.pt   # Classifier head\n|   ├── extracted_llr.pt     # Extracted features\n│   └── agent_session.jsonl  # Exported execution trace metrics from ml-intern\n├── src/\n│   ├── app.py               # Main UI rendering script and state router\n│   ├── orchestrator.py      # Core processing and multi-stage orchestration logic\n│   ├── extract.py           # Genomic feature extractor component\n|   ├── carbon_backend.py    # Runs classifier and pathogenic predictions\n│   └── train.py             # Downstream neural boundaries classifier head optimizer\n├── Dockerfile               # Optimized, cache-friendly CPU wheel deployment engine\n└── requirements.txt         # Standalone workspace Python package configurations\n```\n\n## Models and Dataset\n**Models**: https://huggingface.co/buckets/build-small-hackathon/carbon-vepor-models\n<br>\n**Dataset**: https://huggingface.co/datasets/build-small-hackathon/clinvar-cls (same as viveksil/clinvar-cls)\n<br>\n**Agent trace**: https://huggingface.co/datasets/build-small-hackathon/carbon-vepor-traces\n\n## 🐋 Local Development & Build\nThis engine is built to take full advantage of cache layers, utilizing pre-compiled Linux wheels for the underlying binding dependencies to guarantee instant, compile-free environment set up.\n\nTo build and run the Docker architecture locally:\n```bash\n# Build the container\ndocker build -t carbon-vep-engine .\n\n# Execute the application\ndocker run -p 7860:7860 -p 8081:8081 -p 8082:8082 carbon-vep-engine\n```\n\n## Note:\n1. Chrome/Chromium may block the PDF viewer due to X-Frame-Options. The demo has been tested on Mozilla Firefox.\n2. This space is running on basic CPU instance hence the response time is higher (~200-300 seconds). "
  },
  {
    "repo_id": "build-small-hackathon/Minecraftify",
    "title": "Minecraftify",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Minecraftify",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "minecraftify",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Minecraftify\nemoji: ⚡\ncolorFrom: blue\ncolorTo: red\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.13'\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Mincraftify converts all images into mc-style LIVE!\ntags:\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n# Minecraftify!\n\n**Minecraftify your images live.**\n\nMinecraftify is a Hugging Face Gradio Space that turns uploaded photos into a faithful vanilla Minecraft interpretation of the same scene. It is powered by a fine-tuned **FLUX.2-Klein-4B** img2img LoRA trained on a custom dataset generated with **Qwen-Edit-25-12**.\n\n## Live Demo\n\n* **Space:** Minecraftify\n* **Demo video:** [YouTube walkthrough](https://youtu.be/W-2yEjlTOK4)\n* **Blog post:** [Project blog](https://huggingface.co/blog/build-small-hackathon/minecraftify)\n\n## Current Status\n\nPlayable demo Space with:\n\n* still-image Minecraftification\n* live webcam mode\n* persistent model caching on Space storage\n* LoRA-based FLUX.2-Klein inference\n* Gradio UI with image upload, webcam input, and advanced settings\n\n## What It Does\n\nMinecraftify transforms an input image into a Minecraft-style version of the same scene while trying to preserve:\n\n* composition\n* camera angle\n* layout\n* objects already present in the scene\n* color relationships and overall structure\n\nThe model is tuned to:\n\n* replace realistic surfaces with Minecraft blocks and voxel geometry\n* keep the scene recognizable\n* avoid unnecessary scene changes\n* convert people, animals, and objects into Minecraft-style equivalents where needed\n\n## Project Artifacts\n\n* **Base model:** `black-forest-labs/FLUX.2-klein-4B`\n* **LoRA adapter:** `AnimeOverlord/flux2-klein-4b-mc-v2`\n* **Dataset:** 376 image pairs created with Qwen-Edit-25-12\n* **Training script:** `train_dreambooth_lora_flux2_klein_img2img.py`\n\n## Fine-Tuning Setup\n\nThe LoRA was trained with FLUX.2-Klein img2img using a paired dataset with:\n\n* `source_image` as the conditioning image\n* `edited_image` as the target image\n* `prompt_used` as the caption column\n\nTraining highlights:\n\n* `train_batch_size=1`\n* `gradient_accumulation_steps=4`\n* `mixed_precision=bf16`\n* `learning_rate=2e-6`\n* `lr_scheduler=constant_with_warmup`\n* `lr_warmup_steps=200`\n* `max_train_steps=1200`\n* `rank=64`\n* `cache_latents`\n* `use_8bit_adam`\n* `aspect_ratio_buckets` enabled\n\n## Hackathon Fit\n\nMinecraftify is designed as a compact, fun, image-to-image Space with a strong visual identity and an immediate demo loop.\n\nIt fits the small-model spirit because the core generation path is built around a **4B FLUX Klein model** with a LoRA adapter rather than a large general-purpose model.\n\n## How to Demo\n\n1. Upload an image or start the webcam.\n2. Choose still image or live mode.\n3. Adjust inference steps, guidance scale, and seed.\n4. Click **Minecraftify!**\n5. Download or inspect the result.\n\n## Recommended Demo Settings\n\n* **Inference steps:** 3\n* **Guidance scale:** 3.0\n* **Seed:** any fixed value for reproducibility\n* **Input:** well-lit images with clear objects and simple scenes\n\n## Features\n\n* image upload\n* webcam capture\n* live frame processing\n* prompt-based scene preservation\n* persistent model caching in Hugging Face Space storage\n* adjustable inference settings\n* output comparison view\n\n## Model and Runtime\n\nThe app loads the FLUX.2-Klein base model and then applies the Minecraft LoRA adapter.\n\nRuntime behavior:\n\n* models are cached on persistent Space storage\n* weights are reused across runs when present\n* the pipeline is kept in memory for the active session\n* image generation uses img2img inference for scene preservation\n\n## Space Storage\n\nThis Space is configured to use persistent storage so model files do not need to be downloaded every time the Space restarts.\n\n* model cache path: `/data/models`\n* Hugging Face cache path: `/data/.huggingface`\n\n## Architecture\n\n**Input image** → **FLUX.2-Klein img2img** → **Minecraft LoRA** → **Rendered output**\n\nFor live mode, webcam frames are captured continuously and only the latest frame is processed when the model becomes available.\n\n## Local Development\n\nThis project was trained locally with PyTorch and Accelerate, and the training workflow also supports pushing the fine-tuned model to the Hugging Face Hub.\n\n### Running locally with PyTorch\n\n#### 1) Install the training dependencies\n\nFor the most up-to-date Diffusers example scripts, it is recommended to install Diffusers from source:\n\n```bash\ngit clone https://github.com/huggingface/diffusers\ncd diffusers\npip install -e .\n````\n\nThen install the FLUX DreamBooth example requirements:\n\n```bash\ncd examples/dreambooth\npip install -r requirements_flux.txt\n```\n\n#### 2) Configure Accelerate\n\nSet up Accelerate for your environment:\n\n```bash\naccelerate config\n```\n\nIf you want the default configuration without answering prompts:\n\n```bash\naccelerate config default\n```\n\nIf you are runni"
  },
  {
    "repo_id": "build-small-hackathon/kids-ai-app",
    "title": "Kids Ai App",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/kids-ai-app",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "kids",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Kids Ai App\nemoji: 🦀\ncolorFrom: pink\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.13'\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: 'An app for kids that read, writes stories, generates images '\ntags:\n  - track:backyard\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# 🎓 Kids AI Academy\n> **An intelligent, multimodal educational platform for children powered by NVIDIA AI models (⚡ ~6.85 Billion Total Parameters) — deployed serverlessly on Modal.**\n\n> 📺 **[Watch the full video demo and post on X (Twitter)!](https://x.com/AKIS23820044161/status/2066583488749597121?s=20)**\n\n> Special Bonus Targets :  Best Demo · Off-Brand Award \n---\n\n## 📖 Overview\n\nKids AI Academy is a Gradio-based web application that uses multiple state-of-the-art NVIDIA AI models to create an immersive, magical learning experience for children. The platform generates original stories and poems, reads them aloud with a natural AI voice, produces vibrant illustrations, and can even scan and narrate physical book pages.\n\n---\n\n## 🚀 Features\n\n| Feature | Description |\n|---|---|\n| 🔮 **Magical Storybook** | Generate custom children's stories with adjustable length, custom theme & hero |\n| 🦁 **Wonder Cards (Poems)** | Pick animal characters and a theme — get a rhyming poem + illustrations |\n| 📖 **Book Scanner** | Upload a photo/PDF of a book page — get back structured markdown & audio read-aloud |\n| 🎨 **AI Illustrations** | Generate vibrant storybook-style illustrations per paragraph/stanza |\n| 🔊 **Audio Narration** | Every story and poem is read aloud using a natural NVIDIA AI voice |\n\n---\n\n## 🤖 AI Models Used\n\n### 1. `nvidia/NVIDIA-Nemotron-3-Nano-4B` — Story & Poem Generation\n- **Task:** Text generation (LLM chat)\n- **Total Parameters:** 4 Billion (4B)\n- **Format:** GGUF Q4_K_M quantised (runs efficiently on T4 GPU)\n- **Used for:** Generating children's stories (Short / Medium / Long) and rhyming animal poems\n- **Inference:** `llama-cpp-python` on Modal **T4 GPU**, n_ctx=8192\n\n### 2. `nvidia/magpie_tts_multilingual_357m` — Text-to-Speech\n- **Task:** Neural Text-to-Speech (TTS)\n- **Total Parameters:** 357 Million (357M)\n- **Used for:** Reading generated stories and poems aloud in a child-friendly voice\n- **Voice:** Sofia (speaker_id=1), female English, 22050 Hz WAV\n- **Fallback:** `gTTS` if Magpie fails to initialise\n- **Inference:** NeMo Toolkit on Modal **T4 GPU** (co-located with Nemotron LLM)\n\n### 3. `Efficient-Large-Model/Sana_1600M_1024px_diffusers` — Image Generation\n- **Task:** Text-to-Image diffusion (1024×1024 px)\n- **Total Parameters:** 1.6 Billion (1600M)\n- **Used for:** Vibrant children's book illustrations\n- **Story mode:** 1 image per paragraph\n- **Poem mode:** 1 image per 4 lines (+ remaining lines)\n- **Inference:** HuggingFace Diffusers on Modal **T4 GPU**\n\n### 4. `nvidia/NVIDIA-Nemotron-Parse-v1.1-TC` — Book Page OCR\n- **Task:** Vision-Language Model for structured document understanding\n- **Total Parameters:** 0.9 Billion (0.9B)\n- **Used for:** Scanning book page photos or PDFs → clean Markdown text\n- **PDF support:** Auto-rasterised at 200 DPI via PyMuPDF\n- **Inference:** HuggingFace Transformers (bfloat16) on Modal **A10G GPU**\n\n---\n\n## 🏗️ Architecture\n\n```\nGRADIO FRONTEND (app.py)\n  ├── pages/storybook.py   → Magical Storybook UI\n  ├── pages/flashcards.py  → Wonder Cards (Poems) UI\n  ├── pages/scanner.py     → Book Scanner UI\n  └── utils/backend.py     → HTTP client → Modal REST endpoints\n\nMODAL SERVERLESS BACKEND (modal_backend.py)\n  ├── Assistant [T4]       → /story  (Nemotron LLM + Magpie TTS)\n  │                        → /synthesize (TTS only)\n  ├── Art       [T4]       → /image  (Sana 1.6B diffusion)\n  └── Scanner   [A10G]     → /parse  (Nemotron-Parse VLM)\n```\n\n---\n\n## 🖥️ GPU Resources (Modal)\n\n| Container | GPU | Model(s) | Purpose |\n|---|---|---|---|\n| `Assistant` | T4 (16GB) | Nemotron-3-Nano-4B + Magpie TTS | Story/poem generation + narration |\n| `Art` | T4 (16GB) | Sana 1.6B | Image illustration generation |\n| `Scanner` | A10G (24GB) | Nemotron-Parse-v1.1-TC | Book page OCR / document parsing |\n\n---\n\n## ⚙️ Setup & Deployment\n\n```bash\n# 1. Install dependencies\npython -m venv venv && source venv/bin/activate\npip install -r requirements.txt\n\n# 2. Configure .env with Modal endpoint URLs\n# MODAL_STORY_URL=...\n# MODAL_IMAGE_URL=...\n# MODAL_PARSE_URL=...\n\n# 3. Deploy Modal backend\nmodal deploy modal_backend.py\n\n# 4. Run the Gradio frontend\npython app.py\n```\n"
  },
  {
    "repo_id": "build-small-hackathon/pensieve",
    "title": "Pensieve",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/pensieve",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "pensieve",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Pensieve\nemoji: 🎙️\ncolorFrom: gray\ncolorTo: red\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Speak a thought, get a markdown note, then chat with them\ntags:\n  - gradio\n  - build-small-hackathon\n  - track:backyard\n  - sponsor:modal\n  - sponsor:cohere\n  - achievement:offbrand\n  - best-demo\nmodels:\n  - Qwen/Qwen3-8B\n  - Qwen/Qwen3-Embedding-0.6B\n  - CohereLabs/cohere-transcribe-03-2026\n---\n\n# Pensieve\n\nSpeak a thought and Pensieve turns it into a clean markdown note in the background. Browse your\ngrowing collection of notes and ask questions across everything you have captured.\n\nCapture is asynchronous: stop recording and the transcribe, summarise and index pipeline runs as a\nbackground job, so you can record the next thought right away.\n\nI built this for my dad, who is always going on walks and recording voice notes of his thoughts.\nPensieve allows him to build a catalogue of his thoughts and recall them easier.\n\n## Demo\n\n- Video: https://huggingface.co/spaces/build-small-hackathon/pensieve/resolve/main/pensieve-demo.mp4\n- Social post: https://www.linkedin.com/feed/update/urn:li:activity:7472430389475610624/\n\n## How it works\n\nThe front end is a dark, minimalist Gradio app with a bottom tab bar that installs to the home\nscreen on an iphone as a progessive web app (PWA). All AI inference runs on Modal, and every model is under 32B parameters.\n\n- Record: capture audio, then a background job runs transcribe, summarise and index.\n- Jobs: a live view of each pipeline job and its stage.\n- Knowledge: a Chat and Notes view. Chat answers with RAG over your notes and cites sources. Notes\n  lets you search and read your captured notes.\n\n## Models (all < 32B)\n\n| Role       | Model                                | Runs on       |\n|------------|--------------------------------------|---------------|\n| ASR        | CohereLabs/cohere-transcribe-03-2026 | Modal, L4 GPU |\n| Embeddings | Qwen/Qwen3-Embedding-0.6B            | Modal, CPU    |\n| LLM        | Qwen/Qwen3-8B                        | Modal, L4 GPU |\n\n\n## Future work\n\nFirst order of business would be to move the data into some user owned data storage (like google drive).\nright now its on a dataset repo, but its not private as I have access, although each user can't see eachothers data.\n\nI could would then speed up inference by using per-token cost API's instead of having to cold-start GPUs.\nI am currently using memory snapshots and that does seem to speed things up a lot.  \n\nImprove RAG and prompting, currently using hybrid RAG with reciprocal rank fusion (RRF), but a re-ranker couldn't hurt.\n"
  },
  {
    "repo_id": "build-small-hackathon/Guarden",
    "title": "Guarden",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Guarden",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "guarden",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Guarden \nemoji: 🌱🛡️\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npinned: false\nshort_description: Virtual garden and weather-based plant care. 🌺\ntags:\n  - track:backyard\n  - achievement:tinytitan\n  - achievement:fieldnotes\n  - achievement:bestdemo\n  - achievement:offbrand\n---\n\n\n\n# Guarden 🌱🛡️\n\nAn AI-powered plant care companion that identifies plants from photos, helps users organize and monitor their virtual garden, and provides personalized care recommendations based on each plant's specific needs and local weather conditions.\n\n## Context\n\nThis project was originally made for my father, a passionate botany enthusiast who cares for a wide variety of plants. As his collection grew, keeping track of each plant’s specific needs—especially watering schedules—became increasingly challenging.\n\nGuarden was created to simplify plant care by bringing all of a user's plants together in a single, intuitive application. The app helps users identify their plants, organize them in a virtual garden, access species-specific information, and receive personalized watering recommendations based on local weather conditions.\n\nOur goal is to make plant care more accessible, accurate, and enjoyable for both experienced gardeners and beginners alike.\n\n### Team :\n[AntoineR974](https://huggingface.co/AntoineR974) and [Crocolil](https://huggingface.co/Crocolil)\n\n## Description\n\nGuarden helps users build and manage a virtual garden. Simply upload a picture of a plant, and the application identifies the species using a computer vision model trained on plant datasets. Once identified, the plant is added to the user's virtual garden, where they can view plant information, care requirements, and daily watering recommendations based on the weather forecast.\n\nThe application combines:\n\n* Plant image recognition\n* Species-specific care information\n* 7-day weather forecasting\n* Smart watering recommendations\n\n### Features\n\n* 📸 Plant identification from images\n* 🌿 Virtual garden management\n* 🌤 7-day weather forecast\n* 💧 Daily watering recommendations\n* 🤖🌿AI Plant care assistant\n* 📊 Plant care information dashboard\n\n## Demo Video\n\n🎥 Demo video: \n\n```text\nGuarden 🌱🛡️ An AI-powered plant care tool via @YouTube\n```\nhttps://youtu.be/dPty2SkEbtw?si=kMFB6Bg5Id_pernL\n\n## Social Media Post\n\nGuarden is an AI-powered gardening assistant that identifies plants from pictures and generates personalized watering recommendations based on local weather forecasts.\nLink : https://x.com/Alphaeo_/status/2066628380032704939\n\n## Technical Report\n\nFor a detailed explanation of the architecture, datasets, and implementation choices, see the [Technical Documentation](TECHNICAL_DOCUMENT.md).\n(FYI the two models we use represent 3.6B parameters combined)\n\n## Categories\nOur project applies for the following categories and badges :\n\n* Backyard AI\n* Tiny Titan\n* Field Notes\n* Best Demo\n* Off-Brand\n\n\n\n"
  },
  {
    "repo_id": "build-small-hackathon/meltmind-ai",
    "title": "Meltmind Ai",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/meltmind-ai",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "meltmind",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: MeltMind AI\nemoji: 🍫\ncolorFrom: yellow\ncolorTo: gray\nsdk: gradio\nsdk_version: 5.49.1\napp_file: app.py\npython_version: \"3.10\"\npinned: false\nmodels:\n  - openbmb/MiniCPM4-8B\ntags:\n  - gradio\n  - build-small\n  - backyard-ai\n  - best-use-of-codex\n  - best-minicpm-build\n  - off-brand\n  - off-the-grid\n  - best-agent\n  - llama.cpp\n  - minicpm\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n---\n\n# MeltMind AI\n\nMeltMind AI is a small-model dessert concierge built for MeltRoom. Customers\ndescribe their craving, group, occasion, and budget; MeltMind recommends a\nclear, explainable order from the real MeltRoom menu.\n\n## Demo and social\n\n- Demo video: https://youtu.be/8iYzD37YSj0\n- Launch post on X: https://x.com/i/status/2066683679087472850\n\n## Architecture\n\nMeltMind uses a hybrid local-first architecture:\n\n- **MiniCPM4-8B** independently reasons over the customer request, conversation\n  history, complete live menu, and a searchable corpus built from every\n  MeltMind JSON and Markdown knowledge file.\n- MiniCPM writes the answer, chooses relevant recommendation cards, and\n  explains every Perfect Melt plan.\n- Python tools retrieve verified menu, ingredient, FAQ, and policy facts and\n  expose the full knowledge corpus to MiniCPM.\n- The deterministic planner owns product selection, quantities, serving guidance,\n  allergen exclusions, and budget arithmetic.\n- Model output is validated before display. Unsupported currency figures and\n  unsafe claims fall back to the verified deterministic response.\n\nThis qualifies for the **OpenBMB Best MiniCPM Build** sponsor category because\nthe submitted app uses `openbmb/MiniCPM4-8B` as its language model. It does not\nqualify for Tiny Titan because the model is larger than 4B parameters.\n\n## Run MiniCPM locally\n\nThe app connects to the OpenAI-compatible llama.cpp server at\n`http://127.0.0.1:8080`. Install a current llama.cpp build that provides\n`llama-server`, then run:\n\n```bash\nchmod +x scripts/start_minicpm.sh\n./scripts/start_minicpm.sh\n```\n\nThe included project runtime uses `.runtime/llama.cpp/llama-b9637/llama-server`\nwhen available. The first run downloads the Q4_K_M GGUF, approximately 5 GB,\ninto `.runtime/models`. The GGUF is a quantized build of\n`openbmb/MiniCPM4-8B`; the official OpenBMB base model remains the model\nattributed in the app.\n\nIn another terminal, start the Gradio application:\n\n```bash\nsource .venv/bin/activate\npython app.py\n```\n\nOr start the local MiniCPM server and Gradio app together:\n\n```bash\n./scripts/start_meltmind.sh\n```\n\nThe deterministic grounded engine remains available when llama.cpp is offline,\nbut responses identify that state as a MiniCPM fallback.\n\n## Deploy on Hugging Face Spaces\n\nCreate a new **Gradio** Space and upload this repository without `.runtime`,\n`.venv`, `video`, or `artifacts`. In Spaces, `app.py` immediately serves the\ninterface while `space_runtime.py` downloads the quantized MiniCPM model and\nstarts the local llama.cpp-compatible backend in a background thread.\n\nThe first boot downloads approximately 5 GB. Until MiniCPM is ready, the\ndeterministic grounded engine remains available. Check live backend readiness at:\n\n```text\n/api/meltmind/model\n```\n\nDetailed manual deployment instructions are in `DEPLOYMENT.md`.\n\n## Setup from scratch\n\n```bash\npython -m venv .venv\nsource .venv/bin/activate\npip install -r requirements.txt\n```\n\n## Quality checks\n\n```bash\npython scripts/audit_knowledge.py\npython scripts/evaluate_meltmind.py\n```\n\nThe evaluation blocks release for critical price, policy, allergen, serving,\navailability, and budget-calculation failures.\n"
  },
  {
    "repo_id": "build-small-hackathon/ModuleMind",
    "title": "Modulemind",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/ModuleMind",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "modulemind",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Modular Mind 🧠\nemoji: 💭\ncolorFrom: green\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.15.2\napp_file: app.py\npinned: false\nshort_description: We modulate the mind and communicate!\nlicense: mit\ntags:\n  - track:wood\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n---\nSocial Media Post and Demo Video: https://www.linkedin.com/posts/dean-byrne-02a28b191_modular-mind-boss-fight-for-hugging-face-ugcPost-7472410483615084544-yUeC/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAC0RumIBxlIKTkKv5tF-hb2OU7TdZ19kxcQ\nModel Weights are in the Space's directory.\n# ⚔️ Modular Mind: Boss Fight\n\nA mini **Dark-Souls-style** duel where the boss is controlled by a **Modular Mind** —\na handful of tiny neural *specialists* that communicate through a **shared latent**\n(a `RecursiveLink` bridge) and a *coordinator* that reads the latent to choose the\nboss's next move. **The brain was trained by self-play reinforcement learning** — its\ntactics emerged from playing thousands of duels, nothing is scripted.\n\nYou play the **Fire Knight**. Defeat the **Demon Slime**.\n\n## How the Modular Mind works\n\nThis is the [ModularMind-on-V2](https://github.com/your-username/ModularMind) concept at *specialist scale*: instead of one\nmonolithic policy, six small networks each handle one concern and talk to each other\nthrough a latent channel.\n\n```\n game state ─▶ ┌──────────────┐   each specialist emits a latent\n               │ Aggressor    │──┐  (LatentProjection) and, if it OWNS\n               │ Stalker      │──┤   an action, a \"drive\" for that action\n               │ Survivor     │──┤\n               │ Baiter       │──┤        ┌───────────────┐      ┌─────────────┐\n               │ Punisher (M) │──┼─ sum ─▶│ RecursiveLink │─────▶│ Coordinator │─▶ action\n               │ Enrage   (M) │──┘        │ ReGLU+residual│ shared│  read-out   │\n               └──────────────┘           └───────────────┘ latent└─────────────┘\n```\n\n- **Four action-owning specialists** push their move's score directly:\n  **Aggressor → CLEAVE**, **Stalker → APPROACH**, **Survivor → RETREAT**, **Baiter → IDLE**.\n- **Two modulators (M)** — **Punisher** (\"the player is open!\") and **Enrage**\n  (\"we're low on HP — go berserk\") — **own no action**. Their *only* way to affect the\n  fight is the latent they write into the shared `RecursiveLink`, which the coordinator\n  turns into modulation. So training has to *learn to use the latent channel* — the whole\n  point of the architecture.\n\nThe right-hand panel shows all of this live: each specialist's activity, the shared\nlatent bridge, and the coordinator's modulation, for every decision the boss makes.\n\n## What emerged from training\n\nTrained on a reward that values *dealing damage* and *pressuring in range* over\nplaying it safe (landing a cleave ≫ whiffing, and stalling / staying out of range is\npenalised), the boss learned an **aggressive pressure** style:\n\n- **closes the distance** when you're far or at mid-range,\n- **cleaves on contact** — once you're in range and it's off cooldown it commits to a\n  lunging swing essentially every time (verified: in-range attack-rate ≈ **0.8–1.0**),\n- **retreats only when it can't swing** (mid-cooldown) to reset spacing,\n- **blocks your punish** — a **Defender** specialist raises a guard (negating ~90% of\n  your melee) when you swing at it and it can't cleave back,\n- **punishes your recovery** and gets **even more aggressive at low HP** — the Enrage\n  modulator raises CLEAVE through the shared latent.\n\nIt reaches a **~55–65% win rate** against a near-optimal scripted dodger (avg reward\n+8, up from −12 before the reward was tuned for aggression). Against a human it's a\nfair, readable fight: **dodge the red telegraph, then punish the recovery.**\n\n> The earlier version of the brain learned a degenerate \"space forever, never commit\"\n> policy that *technically* won but barely attacked — so the trainer now selects the\n> checkpoint on **win-rate + in-range attack-rate**, and a non-attacking policy can no\n> longer be saved. (`behavior()` in `train.py` measures this directly.)\n\n## It learns from your fights (online finetuning)\n\nThe model is tiny, so a gradient step is microseconds — the boss finetunes from real\nplay **on the free CPU**. Each HARD-tier fight is logged (state, action, HP per boss\ndecision) and POSTed to `/learn`; the server rebuilds per-decision rewards (damage\ndealt − taken, + kill / − death), computes REINFORCE returns, and takes **one Adam\nstep** ([`mm_grad.py`](mm_grad.py), numpy backprop verified against PyTorch to ~1e-8).\nA frozen copy of the sim-trained weights anchors the update so it can't drift into\nnonsense; the adapted weights feed straight back into the live boss.\n\n- **On by default, in-memory.** Set `MM_ONLINE=0` to disable.\n- **Persistent across restarts:** add Space secrets `HF_TOKEN` (write) and\n  `MM_DATASET_REPO` (e.g. `you/boss-fight-online`) and the adapted weights are pushed\n  to / pulled from that Dataset. Only HARD-tier fights train (keeps the"
  },
  {
    "repo_id": "build-small-hackathon/micro-rpg-engine",
    "title": "Micro Rpg Engine",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/micro-rpg-engine",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "rpg",
      "micro",
      "engine",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Micro RPG Engine\nemoji: 🍄\ncolorFrom: purple\ncolorTo: indigo\nsdk: gradio\nsdk_version: 5.50.0\napp_file: app.py\npinned: true\nlicense: apache-2.0\nshort_description: A whole RPG world generated live by a small 1B-4B model.\ntags:\n  - small-models-hackathon\n  - track:wood\n  - thousand-token-wood\n  - achievement:offbrand\n  - off-brand\n  - rpg\n  - text-adventure\n  - qwen\n  - minicpm\n---\n\n<!-- SUBMISSÃO:\n  Demo video: https://youtu.be/-XfaAcRHH28\n  Social post: https://www.linkedin.com/posts/luiz-felipe-barbedo-94188215a_buildsmall-smallmodels-llm-share-7472417718395301889-ZSKJ/\n  Track: Thousand Token Wood (entretenimento/whimsical)\n  ⚠️ Confirme o slug exato da tag de track no template da org build-small-hackathon.\n-->\n\n> **🎥 Demo video:** https://youtu.be/-XfaAcRHH28 &nbsp;•&nbsp; **📣 Social post:** https://www.linkedin.com/posts/luiz-felipe-barbedo-94188215a_buildsmall-smallmodels-llm-share-7472417718395301889-ZSKJ/\n\n# 🍄 Micro RPG Engine\n\nA text RPG where a **small language model (1B–4B)** generates *everything* in real\ntime — the world, NPCs, dialogue, combat, the shop, random events. There is no\npre-written content. **No AI, no game.** Every playthrough is unique.\n\n> Hugging Face Small Models Hackathon — **Track 2**\n\n## The technical bet\n\nThe hard part with small models isn't writing pretty prose — it's **narrative\nconsistency**: not forgetting your HP, your inventory, that you already killed the\ngoblin. A generic \"RPG-themed chatbot\" loses the plot in three turns.\n\nOur approach makes the **Python engine the source of truth**, not the model:\n\n```\n                 ┌─────────────────────────────────────────┐\n   player input  │  GameEngine (turn loop)                  │\n  ───────────────▶                                          │\n                 │  1. build context from GameState  ───────┼──▶  System prompt\n                 │  2. call the 1B-4B model                  │     + authoritative\n                 │  3. parse output  ◀──────────────────────┼─────  state snapshot\n                 │     ├─ <narrative> → shown to player      │\n                 │     └─ <state> tags → VALIDATED & applied │\n                 │  4. GameState mutates (HP, gold, items)   │\n                 └─────────────────────────────────────────┘\n```\n\nThe model never *remembers* the numbers — it receives them, fresh, every turn, and\nmay only *propose* deltas (`HP: -10`, `ITEM_ADD: Rusty Sword`) through a strict tag\nprotocol. The parser clamps and validates every change against the real state. The\nmodel handles imagination; Python handles bookkeeping. That's what keeps a 1.5B\nmodel coherent across a long dungeon crawl.\n\n## Run locally\n\n```bash\npip install -r requirements.txt\npython app.py\n```\n\nBy default it loads the model with `transformers`. To run with no local GPU, set a\nHugging Face token and it falls back to the serverless Inference API:\n\n```bash\n# Windows PowerShell\n$env:HF_TOKEN = \"hf_...\"\n$env:MICRORPG_BACKEND = \"inference_api\"\npython app.py\n```\n\n## Configuration (env vars)\n\n| Variable             | Default                       | Meaning                                  |\n|----------------------|-------------------------------|------------------------------------------|\n| `MICRORPG_MODEL`     | `Qwen/Qwen3-4B-Instruct-2507` | Model repo id                            |\n| `MICRORPG_BACKEND`   | `transformers`                | `transformers` \\| `inference_api` \\| `mock` |\n| `HF_TOKEN`           | —                             | Token for the Inference API backend      |\n| `MICRORPG_MAX_TOKENS`| `512`                         | Max new tokens per turn                  |\n\nSet `MICRORPG_BACKEND=mock` to run the full engine with a deterministic fake model\n(no weights, no network) — handy for testing the parser and UI.\n\n## Fine-tuning (the \"Well-Tuned\" quest)\n\nThe hard skill for a small model here is emitting the strict three-block tag format\nwith valid mechanics, every turn. We teach it with a **parser-validated synthetic\ndataset**: `build_dataset.py` generates RPG turns in the exact protocol, then runs\n**every single one through the real engine parser** and keeps only those that parse\nand apply cleanly. 100% of the training data is guaranteed well-formed.\n\n```bash\npip install -r requirements-train.txt          # GPU / Colab\npython -m finetune.build_dataset --n 1200       # offline, no model needed\npython -m finetune.train \\\n    --model Qwen/Qwen3-4B-Instruct-2507 \\\n    --out finetune/out/qwen3-4b-microrpg        # LoRA, ~few MB adapter\n```\n\nPlay with your fine-tuned model by pointing the engine at the adapter:\n\n```bash\n# Windows PowerShell\n$env:MICRORPG_ADAPTER = \"finetune/out/qwen3-4b-microrpg\"\npython app.py\n```\n\nThe dataset is model-agnostic — swap `--model` for MiniCPM, or a Llama for the\n**Llama Champion** quest. Add `--load-4bit` for QLoRA on a small GPU.\n\n## Project layout\n\n```\napp.py              Gradio UI + glue\nstyle.css           Custom theme (parchment / arcane)\nengine/\n  game_state.py     GameState: HP, gold, inventory"
  },
  {
    "repo_id": "build-small-hackathon/slipstream",
    "title": "Slipstream",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/slipstream",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "slipstream",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Slipstream\nemoji: 📈\ncolorFrom: blue\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.16.0\napp_file: app.py\npinned: false\nlicense: cc-by-4.0\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Slipstream\n\nThis space provides an interactive presentation and live demo of the Slipstream project-controls forecasting benchmark: a new benchmark for forecasting a project's final cost (EAC) and finish period from mid-flight Earned Value data, an agentic layer that reconciles the forecasting tools, and the distillation of that agent into small models for edge, air-gapped, and on-device forecasting.\n\nThe application is built with a buildless Preact frontend served by a Gradio Server (`gr.Server`), which also exposes the benchmark results through typed API endpoints. The headline edge agent is `openbmb/MiniCPM5-1B` distilled into a project-controls forecasting agent ([`slipstream-minicpm5-1b-evm`](https://huggingface.co/build-small-hackathon/slipstream-minicpm5-1b-evm)). Off the shelf, the 1B base model returns a usable forecast less than 2% of the time; after distillation, it reaches ~99% validity with a median cost error of roughly 2.7%, matching the traditional Earned Schedule baseline. It serves as the default model in the live demo. We also distilled `nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16`, a hybrid Mamba-2 + attention model, into [`slipstream-nemotron3-nano-4b-evm`](https://huggingface.co/build-small-hackathon/slipstream-nemotron3-nano-4b-evm) - our strongest student model, achieving about 2.37% cost error and a 0.61-period finish error. It was trained on its dedicated CUDA stack on Modal and is selectable in the live demo.\n\nDEMO: https://youtu.be/v-k7lxXskTA\n\n## Try it live\n\nThe final slide runs a real held-out project through the agentic layer live on Modal, next to Earned\nSchedule, TimesFM and TabPFN, and compares every forecast against the true outcome. Each run\ncold-starts a GPU, so expect roughly 5-7 minutes; the methods stream in as they finish.\n\n## Social post\n\nHackathon social post: https://x.com/NZXW63TF/status/2066647669540360315\n\nDisclaimer: I do not really use social media. This X / Twitter account exists only so I can scrape\ntweets from lists.\n\n## Links\n\n- Write-up / article: https://huggingface.co/blog/build-small-hackathon/slipstream\n- Distillation dataset: https://huggingface.co/datasets/build-small-hackathon/slipstream-evm-sft\n- MiniCPM5-1B agent: https://huggingface.co/build-small-hackathon/slipstream-minicpm5-1b-evm\n- Nemotron-3-Nano 4B agent: https://huggingface.co/build-small-hackathon/slipstream-nemotron3-nano-4b-evm\n- Gemma-E2B agent: https://huggingface.co/build-small-hackathon/slipstream-gemma4-e2b-evm\n- Organisation: https://huggingface.co/build-small-hackathon\n\nThe dataset is CC-BY-4.0; each model inherits its base model's licence. Everything was trained,\nevaluated and benchmarked on Modal including the live demo in this space.\n\n## Run locally\n\n```bash\npip install -r requirements.txt\npython app.py     # http://localhost:7860\n```\n\nThe live-demo backend is a separate deployed Modal app (`pipeline/agent/demo_modal.py`); set\n`SLIPSTREAM_DEMO_URL` to point the Space at your own deployment if you redeploy it.\n"
  },
  {
    "repo_id": "build-small-hackathon/agent-parkour",
    "title": "Agent Parkour",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/agent-parkour",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "agent",
      "parkour",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Agent Parkour\nemoji: 🏃\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.18.0\npython_version: 3.12\napp_file: app.py\nfullWidth: true\npinned: true\nlicense: other\nlicense_name: \"MIT code + third-party asset licenses\"\nshort_description: 3D parkour maps and races with a tiny RL agent.\ntags:\n  - track:wood\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# Agent Parkour\n\nAgent Parkour is a browser-playable 3D parkour game where generated\nfloating-platform courses become tests for a trained reinforcement learning\nagent. A player can build a course with procedural controls or with the built-in\nGemma 4 31B map mode. The trained agent is then dispatched in simulation, its\nbest clear becomes a replay, and the player races that run on the same map.\n\n## TL;DR For Judges\n\n| Field | Details |\n| --- | --- |\n| Track | Thousand Token Wood |\n| What it does | Generate a 3D parkour map either with an LLM or procedurally, dispatch a trained reinforcement learning agent to run it, then race against the agent on that same course. |\n| Map creation | Procedural generation by default, or optional LLM course drafting with the built-in Gemma 4 31B map mode. |\n| Small model | The game agent has 866,057 learned parameters. It is an end-to-end neural controller that reads egocentric movement, goal, and platform observations, then outputs steering, turning, jump, and sprint actions. |\n| Demo video | https://www.youtube.com/watch?v=cAzgDIMmTf8 |\n| Social post | https://x.com/marmcisgreat/status/2066669787087036785 |\n| Space | https://huggingface.co/spaces/build-small-hackathon/agent-parkour |\n\n## Directory Structure\n\n| Path | Purpose |\n| --- | --- |\n| `app.py` | Root Space entrypoint. Loads and launches the Gradio server. |\n| `app/app.py` | Public app server, map-generation controls, LLM map path, agent rollout loop, saved maps, and frontend serving. |\n| `app/frontend/` | React/Three.js game frontend: map builder, replay viewer, and playable race mode. |\n| `env.py` | Pure Torch parkour simulation: movement physics, observations, rewards, landings, falling, and completion. |\n| `runner.py` | Reinforcement learning agent architectures. The active model is the token-attention controller. |\n| `mapgen.py` | Procedural route motifs, distractor placement, trap placement, reachability checks, and geometry repair. |\n| `train.py`, `infer.py`, research scripts | Training and evaluation code used to build and inspect the agent. This code is included for transparency; the public app uses the trained model. |\n| `settings.py`, `config.py`, `runtime.py` | Runtime defaults, model path, physics constants, and device selection. |\n| `replay.py` | Replay serialization utilities. |\n\n## Reinforcement Learning Agent\n\nThe agent is the core technical experiment: can a neural controller learn to\nsolve 3D parkour puzzles through long sequences of movement decisions? It is\ntrained end-to-end inside a custom 3D parkour simulation using reinforcement\nlearning. The submitted behavior uses a controller that reads local state and\ndirectly outputs movement actions.\n\nA good run requires route solving as well as movement control. Some platforms\nact as bait, some routes bend away from the obvious goal direction, some jumps\nrequire committing before the goal is visually convenient, and some attractive\nbranches lead to failure. The agent has to keep making useful decisions across\nmany timesteps. Executing a single isolated jump well is only a small part of\nthe problem.\n\nThe observation design gives the agent an egocentric view of the course. The\nmodel receives local velocity, grounded state, the final goal as a local beacon,\nyaw, time remaining, and nearby platform tokens. Each platform token describes a\nplatform relative to the agent's current position, so the route has to be read\nduring the run from the agent's point of view.\n\nThe current agent uses a 188-number observation: 12 base values plus 16\nnearest-platform tokens with 11 values each, so `12 + (16 * 11) = 188`. The\ntoken tensor reserves a few internal platform-state bits, while the public map\nmodes in this submission are driven by geometry, goal visibility, and the\nagent's movement state.\n\n```text\nbase =\n  local forward / strafe / vertical velocity, divided by [6, 6, 8]\n  grounded flag\n  final-goal local x/y divided by 24\n  final-goal local z divided by 4\n  final-goal xy distance divided by 24\n  sin(yaw)\n  cos(yaw)\n  time remaining\n  progress slot, disabled for the current agent and therefore 0\n\nplatform token summary, repeated for the 16 nearest visible platforms =\n  hit flag\n  local platform x divided by 9.5 sensor range\n  local platform y divided by 9.5 sensor range\n  local platform z divided by 3\n  platform width divided by 1.7\n  platform depth divided by 1.5\n  platform distance divided by 9.5 sensor range\n  is final goal\n  reserved internal platform-state bits\n```\n\nThe active model uses 16 nearest-platform tokens, distance sorting, a 120 degree\nfield of view, and disabled progress observati"
  },
  {
    "repo_id": "build-small-hackathon/lingo-bridge",
    "title": "Lingo Bridge",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/lingo-bridge",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "lingo",
      "bridge",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Lingo Bridge\nemoji: 🌉\ncolorFrom: purple\ncolorTo: indigo\nsdk: docker\napp_port: 7860\npinned: true\nlicense: apache-2.0\nshort_description: Watch & hear a sentence gradually become another language.\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:tinytitan\n  - achievement:fieldnotes\n  - minicpm\n  - small-models\n  - translation\n  - tts\n---\n\n# 🌉 Lingo Bridge\n\n> **Watch and hear** a sentence gradually become another language — phrase by phrase, layer by layer.\n\nMost translators show you a destination. **Lingo Bridge shows you the journey.** One sentence becomes a **seven-stage transformation** from the source language to the target — meaning crosses first, then actions, then time words, then grammar glue, and finally the word order rearranges into something natural — rendered as an interactive **3D card stack** and **spoken aloud at every stage**. A language *toy*, not a translator.\n\n## 🎬 Demo\n\n▶ **[Demo video](docs/demo.mp4)**  ·  📣 **Social posts:** [X/Twitter](https://x.com/auracanvas/status/2066666490871558485) · [Bluesky](https://bsky.app/profile/auracanvas.bsky.social/post/3moehclek7k26)\n\n![Lingo Bridge](docs/poster.png)\n\n## 💡 The idea & tech (write-up)\n\nA single structured call to a **small text model (Qwen3-4B-Instruct, via llama.cpp)** decomposes the sentence into aligned phrase *units* `{source, target, type, order_target}`. The seven progressive layers, the purple→cyan colours, and the phrase-to-phrase links are then built **deterministically in Python** — so the JSON stays simple and **every link is valid by construction**. Phrases flip to the target language *by type* (so each layer is one coherent move, never random words), and word order migrates near the end, producing crossing ribbons. Each layer is spoken by **OpenBMB VoxCPM2** (a TTS model built on the **MiniCPM-4** backbone, 30 languages). The UI is a fully custom **Three.js** card stack mounted inside a Gradio Space; the GPU models run on **Modal** (scale-to-zero), with the demo examples pre-rendered (layers + audio) so they play instantly.\n\n## 🧠 Models — each well under the 32B cap\n\n| Role | Model | Size | Runtime |\n|------|-------|------|---------|\n| Text (decompose + align) | [`Qwen/Qwen3-4B-Instruct-2507`](https://huggingface.co/Qwen/Qwen3-4B-Instruct-2507) (Q4_K_M GGUF) | **4B** | llama.cpp |\n| Speech (per-layer TTS) | [`openbmb/VoxCPM2`](https://huggingface.co/openbmb/VoxCPM2) — built on **MiniCPM-4** | **2B** | voxcpm (GPU) |\n\n## 🌍 Languages (10)\n\nEnglish · Spanish · French · Italian · Portuguese · German · Russian · Japanese · Korean · Chinese — any pair, either direction.\n\n## 🏆 What we're entered for\n\n- **Track — Thousand Token Wood** (a delightful, AI-native language toy).\n- 🎨 **Off Brand** (`achievement:offbrand`) — a fully custom Three.js UI, far past the default Gradio look, mounted via `gr.mount_gradio_app`.\n- 🦙 **Llama Champion** (`achievement:llama`) — the text model (Qwen3-4B) runs through the **llama.cpp** runtime.\n- 🐜 **Tiny Titan** (`achievement:tinytitan`) — every model is ≤4B (Qwen3-4B + VoxCPM2 2B).\n- 📓 **Field Notes** (`achievement:fieldnotes`) — see *What I learned* below.\n- 🎬 **Best Demo** — app + demo video + social post.\n- 🏅 **Bonus Quest Champion** — multiple bonus criteria met.\n- **OpenBMB · Best MiniCPM Build** — speech by **VoxCPM2 (MiniCPM-4 backbone)**.\n- **Modal · Best Use of Modal** — Qwen3-4B + VoxCPM2 run on Modal (L4, scale-to-zero); see Architecture.\n\n## 📓 What I learned (field notes)\n\n- **Push structure into Python, not the prompt.** Asking the LLM for the full 7-layer graph produced broken links. Asking for *one* thing — aligned phrase units `{source, target, type, order_target}` — and building the layers deterministically in Python made **every link valid by construction**. The model does the part only a model can; code does the rest.\n- **Small genuinely won on latency.** Qwen3-4B nailed the decomposition across all 10 languages. I tried NVIDIA's Nemotron-9B-v2 for a sponsor prize, but its hybrid-Mamba decode took >120s for a single interactive translation — unusable for a *toy*. The 4B model was both good enough and fast enough.\n- **VoxCPM2 reads mixed-language text directly.** No language tag needed, which is exactly what the hybrid intermediate layers (half source, half target) require — a per-language TTS would have choked on them. Reusing one anchor clip kept the narrator voice consistent across layers.\n- **Thin Space + Modal GPU is the right split.** A free CPU Space serving the custom UI and proxying model calls to a scale-to-zero Modal L4 keeps the Space light and the GPU cheap, while pre-rendering the demo examples (layers **and** audio) makes the toy feel instant even on a cold backend.\n\n## 🏗️ Architecture\n\nA thin **Gradio Space (free CPU)** serves the custom UI and **proxies model calls to a Modal L4 GPU** that runs Qwen3-4B (llama.cpp) + VoxCPM2. The Space stays light and the GPU scales to zero. The 🎲 *Surp"
  },
  {
    "repo_id": "build-small-hackathon/signal-garden",
    "title": "Signal Garden",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/signal-garden",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "signal",
      "garden",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 2,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Signal Garden\nemoji: 🌱\ncolorFrom: green\ncolorTo: indigo\nsdk: docker\napp_port: 7860\ntags:\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# Signal Garden: A Game Engine That Keeps Mutating\n\nNico Pellereti | Sree Harsha Nelaturu | Alonso Urbano | Max Zimmer\n\n![Signal Garden Signal Heist round screenshot](assets/blog/screenshots/signal-heist.png)\n\nFor Build Small, we built **Signal Garden**: a tiny sci-fi arcade engine where an LLM keeps inventing new play while the player stays in the loop.\n\n[Watch the Signal Garden demo video](demo-video-final.mp4).\n[Repository reference](https://github.com/n-pelleriti/hackathon-clean).\n\nWe started with Codex building the engine: a shared grid renderer, input model, scoring, health, objectives, effects, and CC0 audio/assets. The key design goal was that a new mode should be small. Games like snake, tower defense, a reactor run, or a boss fight can be implemented as compact `GameMode` classes, often under 100 lines.\n\n![Signal Garden board key screenshot](assets/blog/screenshots/game-engine.png)\n\nOnce the engine worked, we used Codex to generate a suite of mini-games. Then we tuned them with two feedback sources: bots that test simple strategies, and our own playtesting. That let us adjust difficulty, health, enemy counts, win targets, and pacing without guessing.\n\n## The Latency Problem\n\nThen we started the open-ended loop: an LLM continuously mutates games by adding features, changing behavior, and proposing new variants.\n\nThe catch is latency. Full semantic code edits are slow. Some base games were evolved by Qwen 3.5 through larger rewrites, but live play cannot wait for a rewrite every round. If a player clears a game before the model finishes, the engine still needs to serve something new.\n\nSo Signal Garden uses a hierarchy of changes:\n\n```mermaid\nflowchart TD\n    A[Player clears or requests a new round] --> B[Instant heuristic parameter changes]\n    B --> C[Validated LLM JSON challenge changes]\n    C --> D[Background semantic code edits]\n    D --> E[New mechanics available for later rounds]\n```\n\n1. **Heuristic parameters:** instant changes to health, enemy count, target score, wave pressure, and starting state.\n2. **Structured LLM JSON:** validated proposals, recorded as JSONL, for bounded changes like spawning enemies, adding helpers, clearing pressure, or tuning supported goals.\n3. **Semantic code edits:** slower background method-level changes, validated as Python diffs before they affect live play.\n\nThis means the player always gets a fresh game immediately, while deeper semantic mutations keep generating in the background.\n\n![Signal Garden parametric changes screenshot](assets/blog/screenshots/parametric-changes.png)\n\n![Signal Garden code edit proposal screenshot](assets/blog/screenshots/code-edit.png)\n\n## Fine-Tuned For Game Generation\n\nFinally, we fine-tuned the model with SFT for this specific task: continuously generating Signal Garden games and safe mutations. We collected play traces, LLM proposals, validator results, and human feedback, then corrected or rejected outputs based on clarity, fairness, visibility, mode fit, and fun.\n\nThe model learned the house style:\n\n- Make readable changes, not noisy chaos.\n- Stay inside supported actions.\n- Keep live mutations small.\n- Avoid changing win conditions mid-round.\n- Match the mode's actual mechanics.\n\n![Signal Garden Gravity Well screenshot](assets/blog/screenshots/gravity_well.png)\n\n![Signal Garden Snake Labyrinth screenshot](assets/blog/screenshots/snake.png)\n\n![Signal Garden Reactor Breach screenshot](assets/blog/screenshots/reactor.png)\n\nSignal Garden is small, but the loop is open-ended: quick parameter changes prevent repetition, structured JSON keeps rounds moving, and background semantic edits keep expanding what the arcade can become.\n\n## Run\n\n```bash\nuv run python app.py\n```\n\nFor the full LLM arcade loop, run a llama.cpp OpenAI-compatible server at `http://127.0.0.1:8080/v1` before starting the app. The default model alias is `signal-garden-qwen35-code-mutation`.\n\n## Runtime Requirements\n\nThe Gradio app requirements are installed by `uv sync` from `pyproject.toml`: Python 3.10+, Gradio, and NumPy. The LLM path does not require the OpenAI Python package; it uses stdlib HTTP against an OpenAI-compatible `/v1/chat/completions` endpoint.\n\nFor local LLM play, install `llama-server` from llama.cpp and serve the model alias at `http://127.0.0.1:8080/v1`. Check the Python app, `uv`, local `llama-server`, and live model endpoint with:\n\n```bash\nuv run python scripts/check_runtime_requirements.py --require-live-llm\n```\n"
  },
  {
    "repo_id": "build-small-hackathon/README",
    "title": "Readme",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/README",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "readme",
      "summary",
      "available",
      "yet.",
      "static",
      "region"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 8,
    "sdk": "static",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: README\nemoji: 🏃\ncolorFrom: green\ncolorTo: indigo\nsdk: static\npinned: false\nthumbnail: >-\n  https://cdn-uploads.huggingface.co/production/uploads/60d2dc1007da9c17c72708f8/-lbDR4njKiyD17zNsO3GG.png\n---\n<style>\n/* ---------- scoped to .bsh so it can't leak into the README around it ---------- */\n.bsh {\n--cream: #f6efe1;\n--paper: #fbf6e8;\n--walnut: #5a3a22;\n--bark:  #3a2516;\n--ink:   #2a2118;\n--copper:#c98a3c;\n--rust:  #8a4a2b;\n--moss:  #5a6b3a;\n--sage:  #7a8c4a;\n--sun:   #e6a85c;\n--mush:  #b8553a;\n--line:  #8a6a48;\n\n  \n/* --- responsive helpers (no media queries) --- */\n/* Fluid horizontal padding: 22px on narrow, up to 44px on wide */\n--gutter: clamp(22px, 4.5vw, 44px);\n/* 0 below 760px, 1 at and above. Used to hide the seal. */\n--wide: clamp(0px, (100vw - 760px) * 9999, 1px);\n\nmax-width: 1000px;\nmargin: 0 auto;\npadding: 0;\ncolor: var(--ink);\nfont-family: Georgia, \"Iowan Old Style\", \"Palatino Linotype\", Palatino, serif;\nfont-size: 16px;\nline-height: 1.55;\nbackground:\nradial-gradient(ellipse at 20% 10%, rgba(201,138,60,0.08), transparent 50%),\nradial-gradient(ellipse at 80% 90%, rgba(90,107,58,0.08), transparent 50%),\nrepeating-linear-gradient(0deg, rgba(138,106,72,0.025) 0 1px, transparent 1px 3px),\nvar(--cream);\nborder: 2px solid var(--walnut);\nborder-radius: 14px;\nbox-shadow:\ninset 0 0 0 1px var(--cream),\ninset 0 0 0 6px var(--walnut),\ninset 0 0 0 8px var(--cream),\ninset 0 0 0 9px var(--copper),\n0 6px 24px rgba(58,37,22,0.18);\noverflow: hidden;\n}\n.bsh * { box-sizing: border-box; }\n.bsh .pad { padding: 36px var(--gutter); }\n\n/* ---------- type ---------- */\n.bsh h1, .bsh h2, .bsh h3 { font-family: Georgia, \"Iowan Old Style\", serif; color: var(--bark); margin: 0; letter-spacing: -0.01em; }\n/* h1: fluid between 40px and 56px */\n.bsh h1 { font-size: clamp(40px, 7vw, 56px); line-height: 0.95; font-weight: 700; }\n.bsh h2 { font-size: 28px; line-height: 1.1; font-weight: 700; }\n.bsh h3 { font-size: 18px; line-height: 1.2; font-weight: 700; }\n.bsh p  { margin: 0 0 12px 0; }\n.bsh .small { font-size: 13px; }\n.bsh .mono { font-family: \"SFMono-Regular\", Menlo, Consolas, monospace; }\n.bsh .caps { text-transform: uppercase; letter-spacing: 0.18em; font-size: 11px; font-weight: 700; }\n.bsh .muted { color: var(--walnut); opacity: 0.78; }\n.bsh a { color: var(--rust); text-decoration: underline; text-decoration-style: dotted; text-underline-offset: 3px; }\n.bsh a:hover { color: var(--bark); text-decoration-style: solid; }\n\n/* ---------- top strip ---------- */\n.bsh .strip {\ndisplay: flex; align-items: center; justify-content: space-between; gap: 24px;\npadding: 14px var(--gutter);\nbackground: var(--bark);\ncolor: var(--cream);\nborder-bottom: 1px dashed rgba(246,239,225,0.35);\nflex-wrap: wrap;\n}\n.bsh .strip .who { display: flex; gap: 12px; flex-wrap: wrap; align-items: center; font-size: 12px; }\n.bsh .strip .who span.caps { color: var(--sun); letter-spacing: 0.06em; }\n.bsh .strip .dates { font-family: Georgia, serif; font-style: italic; font-size: 14px; color: var(--cream); }\n.bsh .strip .dates b { color: var(--sun); font-style: normal; font-weight: 700; letter-spacing: 0.02em; }\n\n/* ---------- sponsors section ---------- */\n.bsh .sponsors {\npadding: 36px var(--gutter) 30px;\n/* No solid background here — leaving it transparent lets the parent\n   .bsh inset box-shadow frame (copper + walnut) remain visible through\n   this section. A subtle radial highlight is still painted on top. */\nbackground:\nradial-gradient(ellipse at 50% 0%, rgba(201,138,60,0.10), transparent 60%);\nborder-bottom: 1px solid rgba(138,106,72,0.25);\nposition: relative;\n}\n.bsh .sponsors::before {\ncontent: \"✦ ✦ ✦\";\nposition: absolute;\ntop: 14px; left: 50%; transform: translateX(-50%);\ncolor: var(--copper);\nletter-spacing: 0.6em;\nfont-size: 10px;\nopacity: 0.6;\n}\n.bsh .sponsors .sponsor-head {\ntext-align: center;\nmargin-bottom: 22px;\n}\n.bsh .sponsors .sponsor-head .caps {\ncolor: var(--moss);\ndisplay: inline-flex; align-items: center; gap: 10px;\n}\n.bsh .sponsors .sponsor-head .caps::before,\n.bsh .sponsors .sponsor-head .caps::after {\ncontent: \"❋\"; color: var(--copper); font-size: 12px;\n}\n.bsh .sponsors .sponsor-head h2 {\nmargin-top: 8px;\nfont-size: clamp(24px, 4vw, 32px);\n}\n.bsh .sponsors .sponsor-head h2 em { font-style: italic; color: var(--rust); }\n.bsh .sponsors .sponsor-head p {\nmargin-top: 6px;\nfont-size: 14px;\ncolor: var(--walnut);\nfont-style: italic;\n}\n\n.bsh .tier-label {\ndisplay: flex; align-items: center; gap: 14px;\ncolor: var(--rust); font-weight: 700;\nmargin: 22px 0 14px;\nfont-size: 11px; letter-spacing: 0.22em; text-transform: uppercase;\n}\n.bsh .tier-label::before, .bsh .tier-label::after {\ncontent: \"\"; flex: 1; height: 0;\nborder-top: 1px dashed var(--walnut);\nopacity: 0.5;\n}\n\n/* host contribution card — Hugging Face's own stake in the pool */\n.bsh .host-card {\nbackground: var(--paper);\nborder: 2px solid var(--walnut);\nborder-radius: 12px;\npadding: 18px 22px;\ndisplay: flex; align-items: center; gap: 20px;\nflex-wrap: wrap;\nbox-shadow:\ninset 0 0"
  },
  {
    "repo_id": "build-small-hackathon/ai-study-buddy",
    "title": "Ai Study Buddy",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/ai-study-buddy",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "track",
      "learning",
      "buddy",
      "summary"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Try one learning prompt and check if the explanation is clear.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: Ai Study Buddy\nemoji: 📉\ncolorFrom: blue\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.13'\napp_file: app.py\npinned: true\nlicense: apache-2.0\nshort_description: 'AI Study Buddy — your smart learning companion 📚 '\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\nhttps://github.com/iareebaiqbal/ai-study-buddy.git\nCheck out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference\nhttps://www.linkedin.com/posts/areeba-i-3079973a4_python-ai-huggingface-ugcPost-7471858104909848576-H8-B/?utm_source=share&utm_medium=member_android&rcm=ACoAAGMZ9fcBXYwH3zCGCG7ycupqVGkGAtA6EQM\ntags:\n  - build-small-hackathon\n  - tags:\n  - track:backyard\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:sharing\n---\n  "
  },
  {
    "repo_id": "build-small-hackathon/surgical-tissue-segmentation",
    "title": "Surgical Tissue Segmentation",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/surgical-tissue-segmentation",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "track",
      "surgical",
      "tissue",
      "segmentation",
      "summary"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: SurgiSight\nemoji: 🔬\ncolorFrom: indigo\ncolorTo: purple\nsdk: gradio\nsdk_version: \"5.29.0\"\npython_version: \"3.10\"\napp_file: app.py\npinned: false\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n<div align=\"center\">\n\n<img src=\"https://img.shields.io/badge/YOLOv26n--seg-Ultralytics-6366f1?style=for-the-badge&logo=python&logoColor=white\"/>\n<img src=\"https://img.shields.io/badge/Llama_3.1_8B-Meta_AI-8b5cf6?style=for-the-badge&logo=meta&logoColor=white\"/>\n<img src=\"https://img.shields.io/badge/Modal_GPU-T4_Inference-22c55e?style=for-the-badge\"/>\n<img src=\"https://img.shields.io/badge/Gradio-HuggingFace_Spaces-f97316?style=for-the-badge&logo=huggingface&logoColor=white\"/>\n<img src=\"https://img.shields.io/badge/CholecSeg8k-MICCAI_2020-ef4444?style=for-the-badge\"/>\n\n# 🔬 SurgiSight\n\n### Surgical Anatomy AI for Laparoscopic Training\n\n**Real-time danger-zone detection + AI anatomy explanations for surgical trainees.**  \nBuilt for **Build Small Hackathon 2026** — solo project, fully deployed, end-to-end.\n\n---\n\n🚀 **[Watch Demo](https://www.youtube.com/watch?v=Z-jaj31B-ss)**  &nbsp;|&nbsp; 📄 **[Blog](https://huggingface.co/blog/sugan04/surgical-tissue-segmentation)** &nbsp;|&nbsp; 🤗 **[HuggingFace Space](https://huggingface.co/spaces/build-small-hackathon/surgical-tissue-segmentation)**  &nbsp;|&nbsp; **[Social media - LinkedIn](https://www.linkedin.com/posts/sugan-subramanian_ai-machinelearning-medicalai-ugcPost-7469109783028076544-TVeL/?utm_source=share&utm_medium=member_desktop&rcm=ACoAACixJ8kBbDBD81FWoNnyJCVWR4Lrg1EcVv0)** &nbsp;|&nbsp; 📄 **[Agent traces](https://huggingface.co/datasets/sugan04/surgisight-traces)**\nHF ID: sugan04\n---\n\n</div>\n\n---\n\n##  The Problem\n\nEvery year, bile duct injuries occur in roughly **1 in 300 laparoscopic cholecystectomies** (gallbladder removal surgeries). This is the most common serious complication in one of the most frequently performed surgeries in the world (~1.2 million per year in the US alone). Many of these injuries happen because trainees — operating under pressure in a visually complex, blood-filled field — cannot reliably identify critical structures in real time.\n\nCurrent surgical training relies on:\n- **Static textbook diagrams** — no relevance to live video\n- **Senior surgeon supervision** — not always available, and creates cognitive load\n- **Experience alone** — acquired over years, with real patients\n\nThere is no tool that watches the surgical video alongside a trainee and says: *\"That's the hepatic vein. Don't touch it.\"*\n\n**SurgiSight is that tool.**\n\n---\n\n##  The Solution\n\nSurgiSight is an AI assistant for laparoscopic surgical training that:\n\n1. **Segments** any laparoscopic cholecystectomy frame using a fine-tuned YOLOv8n instance segmentation model, identifying 13 surgical structures in real time.\n2. **Flags danger zones** automatically — Hepatic Vein, Cystic Duct, and Blood trigger a red alert.\n3. **Explains the anatomy** using Meta Llama 3.1 8B, giving the trainee a 3-sentence teaching note grounded in the detected context.\n4. **Enables interactive Q&A** — the trainee can ask follow-up questions in natural language (\"Why is the cystic duct dangerous here?\") and get expert-level answers.\n5. **Exports clinical-grade reports** in both PDF and Word format, suitable for case review or portfolio use.\n6. **Supports multilingual responses** (English and French), with text-to-speech for each AI reply.\n\nEverything runs in a single Gradio interface, deployed on Hugging Face Spaces, with GPU inference handled by Modal.\n\n---\n\n##  Architecture\n\n```\n┌──────────────────────────────────────────────────────────────────┐\n│                     HUGGING FACE SPACES                          │\n│                      (Gradio Frontend)                           │\n│                                                                  │\n│   ┌─────────────┐    ┌──────────────────┐    ┌───────────────┐  │\n│   │  Image      │    │  Results Card    │    │  AI Chat      │  │\n│   │  Upload     │ →  │  (Detections,    │    │  (Llama 3.1   │  │\n│   │  + Conf     │    │   Alert, Brief)  │    │   8B via HF)  │  │\n│   │  Slider     │    └──────────────────┘    └───────────────┘  │\n│   └──────┬──────┘                                               │\n│          │ PIL Image bytes                                        │\n└──────────┼───────────────────────────────────────────────────────┘\n           │\n           ▼ modal.Cls.from_name() remote call\n┌──────────────────────────────────────────────────────────────────┐\n│                        MODAL (GPU T4)                            │\n│                                                                  │\n│   ┌─────────────────────────────────────────────────────────┐   │\n│   │  SurgiSightDetector.run()                               │   │\n│   │  ├── Load YOLOv26n-seg weights (CholecSeg8k fine-tune)   │   │\n│   │  ├── Run instance segmentation"
  },
  {
    "repo_id": "build-small-hackathon/MiniCPM5-1B-Agent",
    "title": "Minicpm5 1B Agent",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/MiniCPM5-1B-Agent",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "track",
      "minicpm5",
      "agent",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 1,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: MiniCPM5-1B-Agent\nemoji: 🛠️\ncolorFrom: gray\ncolorTo: yellow\nsdk: docker\napp_port: 7860\npinned: true\n# Build Small Hackathon tags: official tracks/sponsors/badges + descriptive build facts (for reviewers).\ntags:\n  - best-minicpm-build   # sponsor: full fine-tune of MiniCPM5-1B (core entry)\n  - backyard-ai          # track: local, self-hosted AI on CPU\n  - best-use-of-codex    # sponsor: code + Codex-attributed commits on GitHub\n  - best-use-of-modal    # sponsor: GGUF evaluated on Modal (see \"How it was built\")\n  - off-brand            # badge: custom UI well past the default Gradio look\n  - best-agent           # badge: the write -> run -> read -> debug -> verify loop\n  - best-demo            # badge: GIF + video demo\n  - tiny-titan           # badge: a 1B doing the real agentic loop\n  - bonus-quest-champion # badge: most bonus criteria across the board\n  - judges-wildcard      # badge: auto-considered for every entry\n  - well-tuned           # full fine-tune of MiniCPM5-1B, published on the Hub\n  - llama-champion       # served on the llama.cpp runtime\n  - off-the-grid         # runs fully local on a CPU, no cloud model APIs\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n---\n\n# 🛠️ MiniCPM5-1B-Agent\n\n**A tiny agentic coding agent that runs the whole write → run → read → debug → verify loop on a free CPU.**\n\nSocial [media post link](https://discord.com/channels/879548962464493619/1514734596930142218); demo video:\n<table><tr>\n<td><img src=\"minicpm5-1b-agent-demo.gif\" alt=\"MiniCPM5-1B-Agent demo GIF\" width=\"480\"></td>\n<td><video src=\"https://huggingface.co/spaces/build-small-hackathon/MiniCPM5-1B-Agent/resolve/main/long_demo_build-small-hackathon_MiniCPM5-1B-Agent_16x.mp4\" controls width=\"480\"></video></td>\n</tr></table>\n\nA full fine-tune of [`openbmb/MiniCPM5-1B`](https://huggingface.co/openbmb/MiniCPM5-1B) (1B params), served as a\nQ8_0 GGUF on llama.cpp, no GPU. Give it a task; it reasons in `<think>`, then uses `bash` / `write` / `read` /\n`edit` in a sandbox to build, run, and fix code, and renders the result (charts, images, live HTML) inline in\nthe chat. Multi-turn: files and history persist across messages. It is also exposed as an **MCP tool**\n(`run_coding_task` at `/gradio_api/mcp/`).\n\n\n## What it is\n\nMost coding agents are 70B+ behind a cloud API. This is the opposite: a **1B** model doing the *real* agentic\nloop on a **2-vCPU CPU Space**, no GPU. It writes a file, runs it, reads the output, debugs, and shows you the\nartifact, the same loop a big agent runs, shrunk to something you could host in your own backyard.\n\n## How it was built\n\n- Modal platform was used to evaluate the model via inference.\n- **Data (`train_v4`, 45,762 rows):** the proven v2 backbone (retail-filtered teacher mixes + real-usage agent\n  traces) kept whole, plus ~3,538 curated additions, gated to a small served tool vocab and solution-aware\n  MinHash-deduped. Bundled on the model repo under `dataset/`.\n- **SFT:** full fine-tune (not LoRA, the long agentic mix needs the capacity) of the abliterated base, 1 epoch,\n  24k context, fits in ~15-18 GB VRAM (direct Liger fused cross-entropy + mem-efficient SDPA).\n- **DPO (on-policy):** run the SFT model over the training prompts and capture its OWN behaviour. *chosen* = a\n  valid `<function>` tool call; *rejected* = its real miss (rambling in `<think>` / answering with no call).\n  ~649 pairs. This rewards ACTING over stalling.\n- **Serving:** Q8_0 GGUF on llama.cpp; a two-phase decode bounds the `<think>` separately from the action so the\n  model acts instead of looping; produced files render inline (charts, images, sandboxed live-HTML iframes).\n\n## Try it\n\n- \"Write a Python script that makes a bar chart of 30, 45, 25 (A, B, C), save chart.png, then run it.\" → writes + runs it; the PNG renders inline.\n- \"Write an HTML page quote.html with a button that shows a random quote each click (hard-coded, no internet).\" → writes the file; renders live in a sandboxed iframe.\n\nIt is a tiny 1B on a free CPU: expect **~4 min per simple turn**, longer for multi-step tasks (the demo video\nshows it working end-to-end, so it can be judged even if a live run is slow).\n\n\n<details><summary><b>Output examples</b></summary>\n\nReal single-turn outputs from the Q8_0 GGUF, generated on Modal (CPU):\n\n**Prompt:** Write a Python function `fib(n)` that returns the nth Fibonacci number iteratively. Print fib(10).\n\n```python\ndef fib(n):\n    if n == 0:\n        return 0\n    elif n == 1:\n        return 1\n    a, b = 0, 1\n    for _ in range(2, n + 1):\n        a, b = b, a + b\n    return b\n\nprint(fib(10))\n```\n\n**Prompt:** Convert an integer (1-3999) to a Roman numeral; demonstrate on 1994.\n\n```python\ndef int_to_roman(num):\n    if not 1 <= num <= 3999:\n        raise ValueError(\"Number must be between 1 and 3999\")\n    roman_map = [(1000,'M'),(900,'CM'),(500"
  },
  {
    "repo_id": "build-small-hackathon/Case-Lantern",
    "title": "Case Lantern",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Case-Lantern",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "case",
      "lantern",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Case Lantern\ncolorFrom: pink\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.15.2\napp_file: app.py\npinned: false\nlicense: apache-2.0\nmodels:\n  - lastmass/Qwen3.5-Medical-GSPO\n\n---\n\n# 🏮 Case Lantern\n\nShort Demo Video：\n\nhttps://youtu.be/Bf3t5Cq6XuA\n\nCase Lantern is a fictional medical mystery game for the\n[Build Small Hackathon](https://huggingface.co/build-small-hackathon).\nPlayers investigate a short Chinese case, request clues, avoid red herrings, and\nsubmit a diagnosis within six turns.\n\nThe experience uses [`lastmass/Qwen3.5-Medical-GSPO`](https://huggingface.co/lastmass/Qwen3.5-Medical-GSPO),\na small Chinese medical reasoning model with roughly 4.66B parameters, as the\ngame master and scorer. Inference runs locally via **llama.cpp** (GGUF Q4_K_M).\n\n## Track & Merit Badges\n\n| Item | Detail |\n|------|--------|\n| Track | An Adventure in Thousand Token Wood |\n| AI role | Load-bearing game master, clue writer, and scoring judge |\n| Constraint | Small model under 32B parameters |\n| UI | Gradio Space with custom dark frontend |\n\n| Badge | Status |\n|-------|--------|\n| 🏕️ Off the Grid (LOCAL-FIRST) | ✅ Model runs locally in the Space |\n| 🎸 Well-Tuned (FINE-TUNED) | ✅ Uses fine-tuned model published on HF |\n| 🦙 Llama Champion | ✅ Runs via llama.cpp runtime |\n| 🎨 Off-Brand (CUSTOM UI) | ✅ Dark glassmorphism theme, custom CSS |\n\n## Safety framing\n\nThis is not a diagnosis or treatment tool. It only uses fictional cases for\nlearning-oriented gameplay. Users are explicitly asked not to provide personal\nhealth information.\n\n## Deployment notes\n\nThe app is designed for **free CPU Spaces** on Hugging Face. It does not require\na GPU. The GGUF model (~2.78 GB, Q4_K_M) is downloaded from the Hub at first\nlaunch and cached.\n\nIf you deploy on **ZeroGPU**, keep the prebuilt CPU `llama-cpp-python` wheel.\nThe `requirements.txt` file uses the CPU wheel index\n(`llama-cpp-python/whl/cpu`) plus `--only-binary=llama-cpp-python`, and pins to\nthe latest available prebuilt wheel in that index. This keeps the Space from\ntrying to compile llama.cpp from source. Do not use the CUDA wheel URL\n(`llama-cpp-python/whl/cu124`) unless the Space image also provides CUDA runtime\nlibraries such as `libcudart.so.12`; otherwise model loading can fail when the\nfirst button click triggers inference.\n\n- Set `DEMO_MODE=auto` (default) to allow a graceful scripted fallback if the\n  model cannot load.\n- Set `DEMO_MODE=true` to skip model loading entirely (instant UI-only demo).\n- Set `DEMO_MODE=off` if you want model-loading failures to surface immediately.\n\n## Local run\n\n```bash\npip install -r requirements.txt\nDEMO_MODE=true python app.py\n```\n\nOn Windows PowerShell:\n\n```powershell\n$env:DEMO_MODE=\"true\"\npython app.py\n```\n"
  },
  {
    "repo_id": "build-small-hackathon/roast-my-repo",
    "title": "Roast My Repo",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/roast-my-repo",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "roast",
      "repo",
      "summary",
      "available"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Roast My Repo\nemoji: 🔥\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\nsdk_version: 5.29.0\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: AI-powered brutal code review for your GitHub repos\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# 🔥 Roast My Repo\n\n> Paste a GitHub URL. Brace yourself.\n\nAI-powered code review that tells you what your friends won't. Built for the [HuggingFace Build Small Hackathon](https://huggingface.co/build-small-hackathon) — Chapter One: Backyard AI.\n\nPowered by **[MiniCPM4-8B](https://huggingface.co/openbmb/MiniCPM4-8B)** (OpenBMB) served via **[Modal](https://modal.com)**.\n\n---\n\n## What it does\n\nPaste any public GitHub repo URL and get:\n\n- 🔥 **The Roast** — brutal, funny, specific critique referencing actual filenames and code\n- 📊 **Scorecard** — rated across Code Quality, Documentation, Security, Structure, and Portfolio Value\n- 🚨 **Red Flags** — specific issues found in this repo, not generic advice\n- 📄 **Generated README** — a production-quality README you can copy and use immediately\n- 💼 **Hire Me Score** — would a recruiter close the tab or keep reading?\n\n---\n\n## Who it's for\n\nFinal-year CS students and junior developers who want honest feedback on their GitHub portfolio before applying for jobs. Built because most people's repos look worse than their actual skills — and nobody tells them.\n\n---\n\n## Tech Stack\n\n| Layer | Technology |\n|---|---|\n| UI | Gradio 5 (custom terminal CSS) |\n| Inference | [MiniCPM4-8B](https://huggingface.co/openbmb/MiniCPM4-8B) via vLLM on Modal |\n| Serving | Modal A10G GPU · OpenAI-compatible `/v1/chat/completions` |\n| Repo fetching | GitHub REST API (tree + contents) |\n\n\n---\n\n## Why MiniCPM4-8B?\n\nMiniCPM4-8B from OpenBMB packs serious reasoning quality into 8B parameters — trained on 8 trillion tokens. It fits comfortably on a single A10G (24GB VRAM) in fp16, keeps Modal costs low, and handles code review prompts with chain-of-thought quality that rivals much larger models. For a hackathon constraint of \"small model, real output\", it's the right call.\n\n---\n\n## How it works\n\n```\nGitHub URL\n    │\n    ▼\nFetch repo metadata + file tree + up to 12 key files (GitHub API)\n    │\n    ▼\nBuild context string → two sequential MiniCPM4-8B calls\n    │\n    ├── Call 1: Structured JSON  (roast · scorecard · red_flags · hire_score)\n    └── Call 2: Plain markdown   (generated README — avoids JSON escape hell)\n    │\n    ▼\nRender terminal UI (Gradio + custom CSS)\n```\n\n---\n\n## Local Setup\n\n### Prerequisites\n\n- Python 3.11+\n- A [Modal](https://modal.com) account (free tier works)\n- A [GitHub token](https://github.com/settings/tokens) (for higher rate limits)\n- Optional: [Groq API key](https://console.groq.com) for local dev without Modal\n\n### Install\n\n```bash\ngit clone https://huggingface.co/spaces/Yokiatch/roast-my-repo\ncd roast-my-repo\npip install -r requirements.txt\n```\n\n### Configure\n\nCreate a `.env` file:\n\n```env\nMODAL_ENDPOINT=https://your-workspace--roast-my-repo-serve.modal.run\nGITHUB_TOKEN=your-github-token\n\n```\n\n### Deploy the Modal inference server\n\n```bash\nmodal deploy modal_app.py\n```\n\nCopy the printed URL into `MODAL_ENDPOINT` in your `.env`.\n\n### Run locally\n\n```bash\npython app.py\n```\n\n---\n\n## HuggingFace Space Setup\n\nAdd these under **Settings → Variables and secrets**:\n\n| Secret | Value |\n|---|---|\n| `MODAL_ENDPOINT` | Your deployed Modal URL |\n| `GITHUB_TOKEN` | GitHub personal access token |\n\nThe Space runs `app.py` directly — no other config needed.\n\n---\n\n## Project Structure\n\n```\nroast-my-repo/\n├── app.py              # Gradio UI + roast_repo handler\n├── analyzer.py         # Two-call MiniCPM4 analysis logic\n├── github_fetcher.py   # GitHub API: tree fetch + file contents\n├── modal_app.py        # vLLM server on Modal (MiniCPM4-8B)\n├── requirements.txt\n└── .env.example        # Template — never commit real secrets\n```\n\n---\n\n## Security Notes\n\n- `.env` files are **detected** (flagged as a red flag) but **never fetched** — contents are not read\n- Private repos return a clean \"not found\" error\n- `GITHUB_TOKEN` is read from Space secrets, never hardcoded\n\n---\n\n## Credits\n\n- **[OpenBMB](https://github.com/OpenBMB)** — [MiniCPM4-8B](https://huggingface.co/openbmb/MiniCPM4-8B) model\n- **[Modal](https://modal.com)** — GPU inference infrastructure\n\n---\n\n## Dataset\nOne analysis trace logged publicly for transparency: [moosetape/roast-my-repo-traces](https://huggingface.co/datasets/moosetape/roast-my-repo-traces)\n\n---\n\n## Links\n\n- 🔗 **GitHub** — [github.com/Yokiatch/roast-my-repo](https://github.com/Yokiatch/roast-my-repo)\n- 📝 **Blog Post** — [Roast My Repo — HF Blog](https://huggingface.co/blog/build-small-hackathon/roast-my-repo)\n- 💼 **LinkedIn** — [Post](https://www.linkedin.com/feed/update/urn:li:activity:7471526568008347648/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAD84PFkBGQabpy"
  },
  {
    "repo_id": "build-small-hackathon/patient-virtuel-dentiste",
    "title": "Patient Virtuel Dentiste",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/patient-virtuel-dentiste",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "modal",
      "patient",
      "virtuel",
      "dentiste",
      "summary"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Find one topic this Space teaches well.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: Patient Virtuel · Français en médecine dentaire\nemoji: 🦷\ncolorFrom: red\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.16.0\napp_file: src/server_app.py\npinned: false\nlicense: apache-2.0\ntags:\n  - hackathon\n  - build-small-hackathon\n  - backyard-ai\n  - voice-ai\n  - french-learning\n  - dental\n  - gradio\n  - modal\n  - llama-cpp\n  - gemma\n  - track:backyard\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Patient Virtuel · Français en médecine dentaire\n\nHackathon submission for [build-small-hackathon](https://huggingface.co/build-small-hackathon).\n\n**Track**: Backyard AI — a real tool for a real learner: a dental hygienist training professional French.\n\nUses `gradio.Server` with a custom HTML/CSS/JS frontend (Off-Brand Award entry).\n\n**Llama Champion badge**: Model runs through the llama.cpp runtime via `llama-cpp-python` with flash attention and L4 GPU optimizations.\n\n**Off the Grid badge**: Zero cloud APIs. STT → faster-whisper (local), LLM → Gemma 4 (local), TTS → piper-tts (local neural, `fr_FR-siwis-medium` voice).\n\nSee [`space_README.md`](space_README.md) for the full description.\n\n📖 [Field Notes](FIELD_NOTES.md) — architecture decisions, lessons learned, and the story behind the app.\n📡 [Agent Trace](https://huggingface.co/datasets/build-small-hackathon/patient-virtuel-trace) — curated build log on HF Hub (Sharing is Caring badge).\n\n[Demo video](https://youtu.be/oP3KrlZFUPk) · [LinkedIn post](https://www.linkedin.com/posts/carlosduplar_my-wife-is-a-dentist-re-entering-the-workforce-activity-7471897489332289536-p5Ne)\n\n## Architecture\n\n```\nsrc/\n  server_app.py       → gr.Server backend (API endpoints)\n  core.py             → Shared session logic\n  llm_engine.py       → Gemma 4 26B-A4B via Modal (llama.cpp runtime)\n  stt_engine.py       → faster-whisper via Modal\n  tts_engine.py       → piper-tts (local neural, fr_FR-siwis-medium)\n  prompts.py          → System prompts\n  parse_feedback.py   → Feedback parsing\nfrontend/\n  custom_index.html   → Custom vanilla HTML/CSS/JS frontend\n  style.css           → Styles\nmodal/\n  modal_app.py        → Modal deployments (llama.cpp + CUDA)\n```\n\n## Running locally\n\n```bash\npip install -r requirements.txt\npython src/server_app.py\n```\n\n## Deploying to HF Spaces\n\nThe Space uses `space_README.md` as its README and `src/server_app.py` as the entry point.\n"
  },
  {
    "repo_id": "build-small-hackathon/Kintsugi-Garden",
    "title": "Kintsugi Garden",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Kintsugi-Garden",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "kintsugi",
      "garden",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: The Kintsugi Garden\nemoji: 🪷\ncolorFrom: yellow\ncolorTo: gray\nsdk: gradio\nsdk_version: \"6.16.0\"\npython_version: \"3.12\"\napp_file: app.py\nthumbnail: logo.png\npinned: false\nlicense: mit\nshort_description: A symbolic mirror for dreams and inner transitions.\ntags:\n  - track:wood\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n<p align=\"center\">\n  <img src=\"logo.svg\" alt=\"The Kintsugi Garden — K and G mended by a gold seam\" width=\"220\">\n</p>\n\n# The Kintsugi Garden\n\n> *A symbolic mirror for dreams, journals, and inner transitions.*\n\n**This is not therapy, diagnosis, prediction, or advice. It is a symbolic\nreflection tool.**\n\n---\n\n## Links\n\n- **Try it live:** <https://huggingface.co/spaces/build-small-hackathon/Kintsugi-Garden>\n- **3-minute demo video:** <https://huggingface.co/spaces/build-small-hackathon/Kintsugi-Garden/resolve/main/video/render/kintsugi-garden-demo.mp4>\n  (also embedded in the app, under *About the Garden → Watch the 3-minute demo*)\n- **Shared agent traces:** <https://huggingface.co/datasets/build-small-hackathon/Kintsugi-Garden-traces>\n- **Write-up (LinkedIn):** <https://www.linkedin.com/pulse/small-model-challenge-ai-sherpa-035cc/>\n- **Social post (X):** <https://x.com/aisherpa_org/status/2065672879212257770>\n- **Track:** Thousand Token Wood (the whimsical / creative track)\n- **Bonus badges submitted for:** Off Brand, Best Demo\n- **Sponsor prize:** Modal — the model was fine-tuned on Modal (training compute)\n  - Fine-tuned on Modal for training compute — see [`scripts/modal_qlora_train.py`](scripts/modal_qlora_train.py).\n\n---\n\n## The idea\n\nThe Kintsugi Garden is a small-model symbolic reflection app. You give it a\ndream, a journal entry, an emotional trigger, a relationship pattern, a\nrecurring symbol, or a life transition, and it offers back a *symbolic\nreading*: a Mirror, Key Symbols, Archetypal Themes, a possible Shadow Pattern,\nan Individuation Signal, and a Gentle Question — plus a session **Soul Map** and\na deterministic mandala.\n\nLike the Japanese art of *kintsugi* — mending broken pottery with gold — the\napp treats the cracks and wounds in our inner stories as places where meaning\ncan gather, never as something to diagnose or fix.\n\n---\n\n## How it was built — small model, strong scaffolding\n\nThe Build Small Hackathon is about doing more with less. The Kintsugi Garden\nsurrounds one small language model with deterministic Python scaffolding so the\nmodel only ever has to do what it is genuinely good at — composition and tone:\n\n- a curated **symbolic lexicon** — 42 symbols, each with meanings, archetypes,\n  shadow motifs, and individuation signals (236 aliases for matching);\n- **symbol extraction** with aliases and plural handling;\n- a **four-layer safety stack** (see *Safety* below);\n- **prompt compression** — only the current entry and its extracted symbols\n  reach the model; past entries are never sent;\n- **structured, parsed output** rendered as a calm reading;\n- a **session Soul Map** that gathers recurring symbols and archetypes;\n- a **deterministic mandala** drawn with PIL — no second model.\n\nIf the language model cannot be loaded, the scaffolding still composes a\nmeaningful, fully deterministic reading on its own — the app never hard-crashes.\n\n---\n\n## The model\n\nThe live model is **[`ai-sherpa/Qwen3-8B-Kintsugi-GGUF`](https://huggingface.co/ai-sherpa/Qwen3-8B-Kintsugi-GGUF)**\n— a **QLoRA fine-tune of Qwen3-8B** trained on a small seed dataset for the\nsymbolic-reflection voice, quantized to `Q4_K_M` GGUF (~4.7 GB).\n\n**Built on Modal.** The fine-tune was trained on a single H100 on **Modal**\n(QLoRA r=16 / α=32, ~$3 of compute) via `scripts/modal_qlora_train.py`, which\nruns the whole pipeline — base download, 4-bit quantization, LoRA, SFT, merge,\nand push-to-Hub — as one `modal run`. The live Space then loads the resulting\nGGUF. *(Modal is used for training/development; runtime inference is in-process\nllama.cpp, so the Modal claim is on the development arm.)*\n\n- **Backend:** in-process **`llama-cpp-python`** (the default). On the Space it\n  runs under `@spaces.GPU` on **ZeroGPU (A10G)** with `n_gpu_layers=-1`, so the\n  whole model offloads to the shared GPU when a worker attaches. First boot\n  downloads the GGUF to the container cache (2–5 min); later calls attach in\n  ~1–3 s.\n- **Size:** 8B parameters — comfortably under the hackathon's 32B cap.\n- **No second model, no external APIs.** The mandala is PIL; symbol content\n  comes from the lexicon. Generation is invoked with thinking disabled so the\n  output is clean Markdown prose rather than reasoning traces.\n\nTwo rollback backends remain behind an env var for parity and resilience:\n`KINTSUGI_BACKEND=transformers` (base `Qwen/Qwen3-8B` via `transformers`) and\n`KINTSUGI_BACKEND=ollama` (a local `qwen3:8b` tag). The deterministic\nscaffolding, Soul Map, mandala, and safety check are identical across all thre"
  },
  {
    "repo_id": "build-small-hackathon/mini-fam",
    "title": "Mini Fam",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/mini-fam",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "mini",
      "fam",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: MiniFam\nemoji: 🏡\ncolorFrom: yellow\ncolorTo: pink\nsdk: gradio\nsdk_version: 6.16.0\npython_version: \"3.12\"\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: 🏡 A local AI assistant for family logistics\ntags:\n  - track:backyard\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# MiniFam 🏡\n\n_The family logistics assistant that stays home: notes, recipes, meal plans and a shared calendar, run by a small open model on your own machine. Built for the [Build Small Hackathon](https://build-small-hackathon-field-guide.hf.space/)._\n\n## 🏅 Badges I'm going for\n\nMiniFam is my entry for the Build Small Hackathon. Here is the track and the badges I am submitting for:\n\n- 🏡 **Backyard AI** (the practical track): a small, local tool that solves a real problem for someone you know. For me, that someone is my own family — and they actually use it for notes, recipes and meal planning.\n- 🔌 **Off the Grid**: no cloud AI. Every model runs on your own machine (Ollama / llama.cpp) or on the Space's own GPU (ZeroGPU), and your family's data lives in plain local files. Nothing is sent to a model provider.\n- 🎨 **Off Brand**: the UI is a fully custom React app served from `gradio.Server`, miles past the default Gradio components.\n- 🦙 **Llama (llama.cpp)**: MiniFam speaks plain OpenAI-compatible HTTP, so it runs on llama.cpp's `llama-server` directly (the `scripts/serve-minicpm.sh` path) and locally through Ollama, which is built on llama.cpp.\n- 📓 **Field Notes**: a write-up of the build and what I learned — this README, plus a longer [blog post](https://dev.to/eloigil/minifam-the-family-assistant-that-stays-home-483p).\n\n▶ **[Live demo](https://huggingface.co/spaces/build-small-hackathon/mini-fam)** · 🎬 Demo video: [YouTube](https://youtu.be/hYdaaVkZwU0) · 📝 Blog post: [dev.to](https://dev.to/eloigil/minifam-the-family-assistant-that-stays-home-483p)\n\n![MiniFam](https://dev-to-uploads.s3.amazonaws.com/uploads/articles/11yuvvpka9b615vtalb5.png)\n\n**MiniFam is a warm little assistant for the chaos of family logistics.** You talk to it like a person — _\"plan our dinners for the week, Maya is vegetarian\"_, _\"save grandma's banana bread recipe\"_, _\"write down that Leo's shoes are size 8 now\"_ — and it keeps the shared notes, a recipe cookbook, a weekly meal plan that turns itself into a shopping list, and a family calendar. The model runs on your own machine (or on the Space's own GPU), and every piece of your family's data lives in plain files you can open yourself.\n\n### The story behind it\n\nI built this while on parental leave, with a toddler who never stops moving and a baby who is still figuring out the world. People imagine parental leave as rest. It is not — it is beautiful, it is loud, and most days it is a happy little pandemonium. MiniFam became my project in the cracks of the day: twenty minutes during a nap, an hour after bedtime, something quiet and _mine_ while the rest of the house was upside down.\n\nThe reason is simple. Family planning with one kid is a lot; with more than one it is honestly insane — two sets of food preferences, recipes you never remember, notes on every surface, appointments out of nowhere. I wanted a small assistant that takes a little of that weight off, that lives on our own computer, that I would actually trust with my kids' names and habits. Not a giant cloud product quietly collecting our family's life. And if it helps other families too, even better.\n\n## How it works\n\nMiniFam is a **`gradio.Server`** app that serves a hand-built **React** frontend and exposes a small JSON API. Behind the chat is one orchestrator and a registry of tools — no agent swarm — and the model behind it is a single small open model that runs locally or on the Space's own GPU.\n\n```\na family member types in the chat\n        │\n        ▼\norchestrator (agent.py)  ── one model step + a tool loop, until it answers in plain text\n        │\n        ├──► complete(messages, tools)  ──►  the model:\n        │        • local dev / paid-GPU Space:  Ollama  →  Qwen3-30B-A3B\n        │        • ZeroGPU Space:  transformers in-process  →  Qwen3-30B-A3B (4-bit)\n        │\n        ├──► tool calls  ──►  @tool registry:\n        │        notes · recipes · meal plan + shopping list · family roster · calendar\n        │\n        ▼\nplain, human-readable files  ──►  data/shared/notes.md · data/recipes/*.md · data/*.ics · data/members.json\n        │\n        ▼\nJSON  ──►  custom React UI: the reply, the \"agent working\" cards, and live views\n```\n\n### A turn, step by step\n\n1. **A family member types a message.** The UI tells the agent who is talking, so it can address them and respect their preferences.\n2. **The orchestrator builds the prompt.** It injects today's date, the family roster, and everyone's food preferences, then calls `complete(messages, tools)`.\n3. **The model may call tools.** If it does, MiniFam runs each one, appends the result, and loops (up to 12 rounds). Every tool error "
  },
  {
    "repo_id": "build-small-hackathon/storybear",
    "title": "Storybear",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/storybear",
    "zone": "Creative Studio",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "track",
      "sponsor",
      "achievement",
      "storybear",
      "summary",
      "available"
    ],
    "difficulty": "Casual",
    "quest": "🎨 Creative Studio: Find one surprising or delightful result.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🎨",
    "readme_text": "---\ntitle: Storybear\nemoji: 🐨\ncolorFrom: blue\ncolorTo: pink\nsdk: gradio\nsdk_version: 6.18.0\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: From science to fairytale via agent-assisted storytelling\nthumbnail: >-\n  https://raw.githubusercontent.com/latticetower/storybear/dev/assets/storybear-logo.png\nmodels:\n- black-forest-labs/FLUX.2-klein-4B\n- openbmb/MiniCPM-V-4.6\n- sentence-transformers/all-MiniLM-L6-v2\n- facebook/esm2_t6_8M_UR50D\n- RaphaelMourad/Mistral-DNA-v1-138M-bacteria\n- DeepChem/ChemBERTa-10M-MLM\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# STORYBEAR: from science to fairytale via agent-assisted storytelling.\n\n![project logo](./assets/storybear-logo.png)\n\n## Features & Limitations\n* The project is an autoEDA framework with the rich user experience.\n* The resulting plots are supposed to be 'redrawn' by AI with some effects (and distortions!).\n* Any non-standard plots appearing in the report is a courtesy of authors mind.\n* In the end it should process some particular bioinformatics data formats (protein sequences, SMILES strings), because I myself am too lazy to do explicit exploratorials by hand every time.\n* Default captions are hard-coded. \n* The report header is designed to add some amount of exaggeration.\n* The plots are generated 'as is', after that they are processed with LLMs. LLMs might hallucinate. I also might hallucinate during plots coding. This means that any results might not correctly represent the input data. **Use at your own risk**.\n* ZeroGPU local inference is not supported, yet, it is possible to run the pipeline from the console with local GPU/MPS only. This is available via the github repo (and described in its readme), in this mode gradio app with the local inference also can be run (but it doesn't work with ZeroGPU).\n\n## Links\n* Linkedin Post: https://www.linkedin.com/posts/tatiana-malygina-56b95425_storybear-from-science-to-fairytale-via-activity-7472431307227308034-L4Kc\n* Medium Post: https://medium.com/@latticetower/storybear-from-science-to-fairytale-via-agent-assisted-storytelling-b63721477d37\n* Main pipeline code is available at: https://github.com/latticetower/storybear/tree/dev\n* Video: https://drive.google.com/file/d/1OjpYw4ts1izV9pfIg9OIJ-4Lq-04YTU-/view?usp=sharing"
  },
  {
    "repo_id": "build-small-hackathon/gitopadesh",
    "title": "Gitopadesh",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/gitopadesh",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "gitopadesh",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Gitopadesh\nemoji: 🪔\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.11'\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: Multilingual Gita advisor with a published local 1.5B GGUF\nthumbnail: https://huggingface.co/spaces/build-small-hackathon/gitopadesh/resolve/main/banner.jpg\ntags:\n  - build-small-hackathon\n  - backyard-ai\n  - bhagavad-gita\n  - rag\n  - llama-cpp\n  - fine-tuned\n  - gradio\n  - track:backyard\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:fieldnotes\n  - achievement:demo\n---\n\n<p align=\"center\">\n  <a href=\"https://www.loom.com/share/5f2d6134fdd7457f8d438def0667fc7a\" title=\"Watch the 90-second demo\">\n    <img src=\"banner.jpg\" alt=\"GITOPADESH — the Bhagavad Gita as a living advisor\" width=\"100%\">\n  </a>\n</p>\n\n<h1 align=\"center\">🪔 GITOPADESH</h1>\n<p align=\"center\"><b>A private, multilingual Bhagavad Gita advisor with a published fine-tuned 1.5B model.</b><br>\nSpeak the struggle you carry. Krishna answers in your language, grounds the guidance<br>\nin the most relevant shloka, and can run locally as a GGUF through llama.cpp.</p>\n\n<p align=\"center\">\n  <a href=\"https://www.loom.com/share/5f2d6134fdd7457f8d438def0667fc7a\"><img src=\"https://img.shields.io/badge/▶_Watch_the_90s_Demo-FF0000?style=for-the-badge&logo=loom&logoColor=white\"></a>\n  <a href=\"https://huggingface.co/spaces/build-small-hackathon/gitopadesh\"><img src=\"https://img.shields.io/badge/🪔_Open_the_App-FF8C00?style=for-the-badge\"></a>\n  <a href=\"https://x.com/jmadhanai/status/2066438838059467068\"><img src=\"https://img.shields.io/badge/Read_the_Story-000000?style=for-the-badge&logo=x&logoColor=white\"></a>\n</p>\n\n<p align=\"center\">\n  <img src=\"https://img.shields.io/badge/Track-Backyard_AI-FF8C00?style=for-the-badge\">\n  <img src=\"https://img.shields.io/badge/Model-Qwen2.5--1.5B-D4A017?style=for-the-badge\">\n  <img src=\"https://img.shields.io/badge/Runs-on_a_laptop-2E7D32?style=for-the-badge\">\n  <img src=\"https://img.shields.io/badge/Languages-EN_·_हिंदी_·_తెలుగు-B0303A?style=for-the-badge\">\n</p>\n\n<p align=\"center\"><b>▶ <a href=\"https://www.loom.com/share/5f2d6134fdd7457f8d438def0667fc7a\">Watch the demo</a> &nbsp;·&nbsp; 🪔 <a href=\"https://huggingface.co/spaces/build-small-hackathon/gitopadesh\">Open the app</a> &nbsp;·&nbsp; 🐦 <a href=\"https://x.com/jmadhanai/status/2066438838059467068\">The story on X</a></b></p>\n\n---\n\n## ⚡ Why judges should care\n\n| The problem | The build-small answer |\n|---|---|\n| Spiritual dilemmas are deeply private | A **Qwen2.5-1.5B GGUF** can run locally through llama.cpp |\n| Generic advice can invent scripture | Semantic RAG grounds every response in **701 real Gita verses** |\n| Wisdom should not require English | Full guidance in **English, हिंदी, and తెలుగు** |\n| Small models often lose personality | A focused **LoRA fine-tune on Modal** teaches Krishna's voice and response structure |\n\n**Verifiable proof:** [LoRA adapter](https://huggingface.co/jmadhanplacement/gitopadesh-krishna-1.5b-lora) ·\n[merged model](https://huggingface.co/jmadhanplacement/gitopadesh-krishna-1.5b-merged) ·\n[Q4_K_M GGUF](https://huggingface.co/jmadhanplacement/gitopadesh-krishna-1.5b-gguf) ·\n[field notes](FIELD_NOTES.md) · [training data](train_data.jsonl)\n\n## 🏆 Prize targets and proof\n\n| Category | Why GITOPADESH qualifies |\n|---|---|\n| **Backyard AI** | Solves a personal, daily-life problem with a local-capable model and private on-device path. |\n| **Best Use of Codex** | Codex-attributed commits cover tests, CI, inference hardening, accessibility, repository cleanup, README strategy, and deployment. |\n| **Best Use of Modal** | Modal A10 training produced the public LoRA, merged model, and Q4_K_M GGUF artifacts. |\n| **Off Brand** | A fully custom sacred visual system, landing experience, shloka cards, and chapter map beyond stock Gradio. |\n| **Best Demo** | Public 90-second Loom demo plus the linked launch story/social post. |\n| **Field Notes** | A public, technical build report with decisions, failures, and lessons learned. |\n\nThe live Space keeps the reliable Hugging Face cloud backend available for judging.\nThe repository also contains the complete local `llama.cpp` path for the published\n1.5B GGUF; set `KRISHNA_BACKEND=local` when running on suitable local CPU hardware.\n\n## ⏱️ Judge it in 60 seconds\n\n1. [Open the Space](https://huggingface.co/spaces/build-small-hackathon/gitopadesh) and choose a real dilemma.\n2. Switch between English, हिंदी, and తెలుగు; inspect the cited Sanskrit shloka.\n3. Try **Copy guidance** and download the generated shloka card.\n4. Open the [GGUF repository](https://huggingface.co/jmadhanplacement/gitopadesh-krishna-1.5b-gguf) to verify the local 1.5B artifact.\n\n## 🌙 Why this exists — a tool I built for myself\n\nI was stuck on the biggest decision of my life. I kept turning to the Bhagavad\nGita — but at 1am, paralyzed, nobody can hunt through 700 verses to find the one\nthat fits *their* exact situation. So I built the thing I ne"
  },
  {
    "repo_id": "build-small-hackathon/Rizz-Therapy",
    "title": "Rizz Therapy",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Rizz-Therapy",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "rizz",
      "therapy",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Rizz Therapy\nemoji: 🫰\ncolorFrom: pink\ncolorTo: red\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: Anxious? Get some Rizz therapy 💀\ntags:\n  - track:wood\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# Rizz Therapy 🫰\n\nIt is a show where two AIs bring their rizz problems to a therapist. The therapist then breaks down their issues and guides them to improve their rizz.\n\nDemo: [YouTube](https://youtu.be/6YjPRCt0MZw)\n\nBlog: [Community Article](https://huggingface.co/blog/build-small-hackathon/rizz-therapy)\n\nSocial Media Post: [Linkedin](https://www.linkedin.com/posts/inventwithdean_hugging-face-and-gradio-asked-us-to-build-activity-7469668912960503808-TIVa)\n\n#### **Important**: Make sure you're on a latest Chrome/Firefox/Brave browser. As of now Safari and Edge [don't fully support](https://docs.unity3d.com/2018.1/Documentation/Manual/webgl-browsercompatibility.html) WebGL 2, which this Unity build uses.\n\n## 📽️ Features\n- **Entertaining**: Creative scripts written by Gemma 4 26b a4b.\n- **Expressive Delivery**: Top tier emotional Text-To-Speech with OmniVoice TTS 0.6b.\n- **Bilingual**: Supports both English and Hindi generations.\n- **Expressions**: Characters show emotions (using blendshapes).\n- **Precise Gaze changes**: Word level gaze changes, thanks to mms-1b-all which is used for forced alignment.\n- **Lot of Costumes**: Each character has 5 costumes.\n- **Instantly Shareable**: Episodes become shareable with friends the moment they finish generating.\n\n## ⚡ Speed (ZeroGPU)\n\n### A full episode gets generated in *under 1 minute*.\n- llama.cpp runs Gemma 4 26b a4b at ~110 tok/sec on ZeroGPU.\n- OmniVoice 0.6B being a diffusion based TTS generates audio over 10x faster than realtime.\n\n### What this means?\n- **Widely Accessible**: Even unauthenticated users get 2 minutes of ZeroGPU per day, which means if they want, they can generate one episode per day, and still have half their quota left.\n- **Costs**: $0, thanks to open-source and ZeroGPU.\n\n\n## 🧠 Architecture\n\n- **Episode Generation Pipeline**: Gemma generates the script, OmniVoice outputs audio, mms-1b-all extracts visemes and timings for look_at tags.\n\n- **Compression**: Everything gets zipped, and saved to disk (or uploaded to cloudflare R2). A full episode (3 minutes) is just ~6MB on disk, thanks to high ratio ZIP compression of raw audio bytes and JSON.\n\n- **Off-Grid**: ZeroGPU version uses Cloudflare R2 for distribution speed, local clone would use disk storage, as the repo ships with the Unity build files needed, and the server already mounts everything. Just change the global variable, named **online** to False in **app.py** before running.\n\n![architecture](architecture.png)\n\nAnd here is the architecture of Unity Side: \n\n![architecture_unity](architecture_unity.png)\n\n## ❤️ Open Source\n\n|Model|License| Runtime|\n| -- | -- | -- |\n| [google/gemma-4-26B-A4B-it](https://huggingface.co/google/gemma-4-26B-A4B-it) | apache-2.0 | llama.cpp\n| [k2-fsa/OmniVoice](https://huggingface.co/k2-fsa/OmniVoice) | apache-2.0 | PyTorch\n| [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) | cc-by-nc-4.0 | PyTorch\n\n- PyTorch: https://pytorch.org/ \n- llama.cpp: https://github.com/ggml-org/llama.cpp\n- llama.cpp python bindings: https://github.com/abetlen/llama-cpp-python\n\nNOTE: If we only used English, then we could have used [facebook/wav2vec2-base-960h](https://huggingface.co/facebook/wav2vec2-base-960h) instead of mms-1b-all, which is apache-2.0 licensed.\n\n\n## 🧘 Why Rizz Therapy?\nLet's be real for a second.\nAI companions are having a moment, but they aren't a replacement for real human connection. They never will be (unless we literally hack our brains with neural chips lol). Nature spent millions of years evolving human-to-human dynamics for a reason. Trying to build an AI to be your permanent romantic partner isn't just technologically uphill, it's fighting against nature itself.\n\n**Rizz Therapy** exists because we believe AI shouldn't be an alternative to human relationships; it should be a mirror for them.\n\nBy watching AIs fumble through their rizz problems, and get comedic guidance from a therapist, it triggers a thought loop in your own brain. It's a low stakes proxy designed to make you laugh and cringe, while also giving you a tiny spark of confidence to actually want to pursue a human connection."
  },
  {
    "repo_id": "build-small-hackathon/paint_match",
    "title": "Paint_Match",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/paint_match",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "paint_match",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Paint Match\nemoji: 🎨\ncolorFrom: yellow\ncolorTo: red\nsdk: gradio\nsdk_version: 6.16.0\napp_file: server.py\nlicense: apache-2.0\nshort_description: Paint codes from kit photos — finetuned 1B VLM, no cloud\npinned: false\nmodels:\n  - build-small-hackathon/paint-match-minicpm\ntags:\n  - vision\n  - llama-cpp\n  - scale-modelling\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# Paint Match\n\n**A finetuned 1-billion-parameter vision model running on a $50 ARM board identifies paint codes from scale model instruction sheets — no cloud API, no GPU.**\n\nUpload a photo of an instruction sheet or a shop screenshot. Paint Match extracts the paint codes, converts Humbrol references to Tamiya equivalents, and shows shop links. Paints you already own are flagged automatically.\n\n## Who it's for\n\nI build plastic scale models. Every kit ships an instruction sheet listing paint codes in a manufacturer's own system (Humbrol, Tamiya, Meng), and matching those to what's actually on the shelf — and to what I already own — is tedious manual cross-referencing before every build. Paint Match is the tool I use on my own kits: photograph the sheet, get a Tamiya shopping list, with paints already in my Google Sheets inventory flagged so I don't re-buy them.\n\nThe UI is styled after Airfix Dogfighter — a 2000 PC game I played as a kid. The aesthetic felt right for a tool that's part of the same hobby.\n\n## How it works\n\n- Photo is resized to 960px (A/B tested: 640px drops codes on small print; 1280px adds 30s with no recall gain)\n- [MiniCPM-V-4.6](https://huggingface.co/openbmb/MiniCPM-V-4.6) finetuned on paint instruction data, runs via llama.cpp on a Radxa Dragon Q6A — 4 CPU cores, no GPU\n- Fine-tuned on Modal.com H100 (16 min, ~391 training examples); training data scraped from Airfix instruction sheets and paper photos\n- JSON schema constraint forces structured output; no regex fragility\n- Humbrol→Tamiya conversion from a hand-curated CSV; inventory check via Google Sheets\n- Inference: ~64s average (down from ~190s with the base InternVL model)\n\n## Benchmark (MiniCPM-V-4.6 finetuned, running on Radxa CPU)\n\n| Metric | Score |\n|--------|-------|\n| F1 — benchmark (10 images) | **0.935** |\n| F1 — shop holdout (53 images) | **0.927** |\n| F1 — paper holdout (7 images) | **0.928** |\n| Avg latency | ~64s |\n\nBase model comparison (InternVL3.5-2B): F1=0.873, ~190s latency. Gemma 4B timed out on every instruction sheet image on this hardware.\n\n## Architecture\n\nAll inference is 100% local, on hardware I own. This HF Space is a thin proxy UI only — it holds no model and calls no hosted LLM API. Photos are forwarded over a Cloudflare Tunnel to a private Radxa SBC, where the finetuned 1B model runs on CPU via llama.cpp; the Space just renders the result. Off-grid where it counts: the AI never leaves the board.\n\n## Demo & social\n\n- Demo video - https://youtu.be/mNMjyIMAUuo\n- Social post - https://x.com/szwendacz/status/2064679708877291630\n- Field notes - [blog post](https://llmops.pl/posts/paint_match_first_hackathon_first_vlm_on_a_board/) \n"
  },
  {
    "repo_id": "build-small-hackathon/PaperProf",
    "title": "Paperprof",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/PaperProf",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "paperprof",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: PaperProf\nemoji: 📄\ncolorFrom: purple\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: false\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\nCheck out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference\n\n---\n\n# PaperProf — AI Study Buddy\n\n## Demo\n\nVideo walkthrough: https://youtu.be/eyoXrGMjXWc\n\nLinkedIn post: https://www.linkedin.com/posts/ryad-gazenay_buildsmallhackathon-huggingface-gradio-ugcPost-7471900513991729152-Th-Y/\n\n## Models used\n\n- [build-small-hackathon/MiniCPM4-8B-PaperProf](https://huggingface.co/build-small-hackathon/MiniCPM4-8B-PaperProf) — QLoRA fine-tune of openbmb/MiniCPM4-8B on SQuAD, used for question generation and answer evaluation\n- [black-forest-labs/FLUX.1-schnell](https://huggingface.co/black-forest-labs/FLUX.1-schnell) — FLUX.2-klein-4B, used for session image generation\n\n## Sponsor prize categories\n\n- OpenBMB (MiniCPM4.1-8B)\n- Black Forest Labs (FLUX.2-klein-4B)\n\nPaperProf turns any course PDF into an interactive study session.\nUpload your lecture notes or textbook, receive auto-generated questions drawn\ndirectly from the material, type your answers, and get instant, constructive\nfeedback powered by a local LLM (MiniCPM4-8B).\n\n---\n\n## How it works\n\n```\nPDF upload\n    └─► core/parser.py      — extract raw text with PyMuPDF\n         └─► core/chunker.py — split text into thematic chunks\n              └─► core/questioner.py — LLM generates a question from a chunk\n                   └─► student answers\n                        └─► core/evaluator.py — LLM evaluates & explains\n```\n\nThe LLM (loaded once at startup via `model/llm.py`) handles both question\ngeneration and answer evaluation.  Everything runs locally — no API keys needed.\n\n---\n\n## File structure\n\n```\nPaperProf/\n├── app.py                  # Gradio UI — entry point\n├── requirements.txt        # Python dependencies\n├── README.md               # This file\n├── core/\n│   ├── __init__.py\n│   ├── parser.py           # PDF → plain text  (PyMuPDF)\n│   ├── chunker.py          # plain text → thematic chunks\n│   ├── questioner.py       # chunk → study question  (LLM)\n│   └── evaluator.py        # (question, chunk, answer) → feedback  (LLM)\n└── model/\n    ├── __init__.py\n    └── llm.py              # singleton LLM wrapper  (MiniCPM4-8B / Transformers)\n```\n\n### File roles\n\n| File | Role |\n|---|---|\n| `app.py` | Builds the Gradio interface and wires the pipeline together. |\n| `core/parser.py` | Opens the PDF with PyMuPDF (`fitz`) and extracts plain text page by page. |\n| `core/chunker.py` | Splits the raw text on paragraph boundaries, merging short paragraphs and capping chunk size so the LLM isn't overloaded. |\n| `core/questioner.py` | Sends a chunk to the LLM with a professor-style prompt and returns one open-ended question. |\n| `core/evaluator.py` | Sends the question, source chunk, and student answer to the LLM, which returns a structured verdict + model answer. |\n| `model/llm.py` | Loads `openbmb/MiniCPM4-8B` once via Transformers, exposes a `generate(prompt)` method, and caches the instance as a singleton. |\n| `requirements.txt` | Pins all Python dependencies needed to run the project. |\n\n---\n\n## Setup\n\n```bash\n# 1. Create a virtual environment\npython -m venv venv\nsource venv/bin/activate   # Windows: venv\\Scripts\\activate\n\n# 2. Install dependencies\npip install -r requirements.txt\n\n# 3. (Optional) override the model or device\nexport PAPERPROF_MODEL=\"openbmb/MiniCPM3-4B\"   # smaller model for testing\nexport PAPERPROF_DEVICE=\"cuda\"                  # cuda | mps | cpu | auto\n\n# 4. Launch\npython app.py\n```\n\nThe Gradio app will open at `http://localhost:7860`.\n\n---\n\n## Usage\n\n1. Click **Upload course PDF** and choose your file.\n2. Click **Load PDF** — PaperProf parses the document and reports how many\n   chunks were found.\n3. Click **New Question** to get a question generated from a random chunk.\n4. Type your answer in the **Your Answer** box.\n5. Click **Submit Answer** to receive structured feedback.\n\nRepeat steps 3–5 as many times as you like to practice the full material.\n\n---\n\n## Requirements\n\n- Python ≥ 3.10\n- A GPU with ≥ 10 GB VRAM is recommended for MiniCPM4-8B in bfloat16.\n  CPU inference works but is slow; set `PAPERPROF_MODEL` to a 4B variant for\n  faster CPU runs.\n"
  },
  {
    "repo_id": "build-small-hackathon/neilA",
    "title": "Neila",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/neilA",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "neila",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: First Contact\nemoji: 🛸\ncolorFrom: green\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.16.0\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Teach an alien that knows words but has never lived a life.\ntags:\n- track:wood\n- sponsor:modal\n- achievement:offgrid\n- achievement:offbrand\n---\n\n# First Contact\n\nA small-model game for the **Build Small** hackathon — *An Adventure in Thousand\nToken Wood* track. You teach\nan alien that knows human *words* but has never experienced human life. It acts in\na tiny sandbox world, accumulates *concepts* as you teach them, and eventually\n**generalizes** a learned concept to a brand-new situation on its own. That\n\"it finally understood me\" moment is the payoff.\n\nThe model never learns in the weights sense. The alien's growing understanding\nlives in a plain-Python **concept ledger** injected into the prompt every turn.\nThe model is a stateless function: given *(ledger + world + your words)* it returns\n*(one action + an in-character reply + structured notes)*. The host code applies\nthe action deterministically, checks the win condition **mechanically** (never the\nmodel judging \"success\"), and gates whether a new concept is learned. That loop —\nnot the model — is the game. See [`SPEC.md`](SPEC.md) for the full contract.\n\n## Links\n\n- **Demo video + post (X):** https://x.com/MrChonkyboi/status/2066654526963081589\n\n## How to play\n\n1. Read the current challenge at the top.\n2. Type instructions to the alien in plain language.\n3. It can only *do* one thing from a small, closed action set, but it can *say*\n   anything — and it tells you honestly what it could **not** understand.\n4. When it proposes a new concept, confirm \"it learned that\" to add it to its\n   ledger. Later challenges test whether it can apply what it learned **without\n   being re-taught**.\n\n## Architecture\n\n```\ngr.State (per session)  ──►  build_prompt  ──►  Brain.respond  (@spaces.GPU)\n   ledger / world / challenge        │                 │ strict JSON\n        ▲                            │                 ▼\n        └──── learn (gated) ◄─ check_win ◄─ apply_action ◄─ parse + validate\n                                (mechanical)  (deterministic)  (retry once → safe wait)\n```\n\n| module | role |\n|--------|------|\n| `game/models.py` | dataclasses: Concept, Obj, Agent, WorldState, Action, Challenge, GameSession |\n| `game/world.py` | `apply_action` (deterministic), `check_win` (mechanical), initial world |\n| `game/ledger.py` | seed primitives, gated concept add, `times_applied` tracking |\n| `game/challenges.py` | the 5-challenge arc + win predicates (2 generalization beats) |\n| `game/prompt.py` | `build_prompt(ledger, world, challenge, utterance)` |\n| `game/parsing.py` | tolerant JSON extract + validate + §4 retry / safe fallback |\n| `game/brain.py` | `Brain` protocol + `StubBrain` \\| `LocalBrain` \\| `ModalBrain` |\n| `game/engine.py` | the turn loop (Gradio-free, fully testable) |\n| `app.py` | Gradio Blocks UI + wiring (the Space entrypoint) |\n\n## The model is swappable (protect GPU quota)\n\nSelected via the `BRAIN` env var:\n\n- `stub` *(default locally)* — deterministic, **zero GPU**. The entire loop and\n  the whole challenge arc are playable and testable against it.\n- `local` *(set this on the Space)* — a ≤32B instruct model loaded onto `cuda` at\n  module level; inference runs inside `@spaces.GPU`.\n- `modal` — optional dev/serving endpoint. Never the submission path; `requests`\n  is imported lazily so Modal is never a hard dependency.\n\nPick the local model with `MODEL_ID` (default `Qwen/Qwen2.5-14B-Instruct`) and the\nsampler heat with `LOCALBRAIN_TEMPERATURE` (default `0.9`; `0` = greedy). Both\ndefaults come from the bake-off below: the JSON envelope held 100% at *every*\ntemperature for every candidate, so the model pick was decided by arc completion\nplus concept invention (14B was the only one strong at both), and 0.9 buys\nnear-peak voice at zero measured reliability cost.\n\n## Develop / test (no GPU)\n\n```bash\n# run the full test suite (loop, parsing/fallback, world) against StubBrain\nuv run --with pytest pytest -q\n\n# run the app locally on the stub brain\nuv run --with gradio python app.py\n```\n\n## Model selection (bake-off)\n\n`bakeoff.py` picks the local model empirically — which ≤32B model emits clean,\nschema-valid JSON *reliably* — without burning quota blind. It calls `respond()`\nfor raw text and parses **once, with no retry** (the §4 retry path would mask the\nfailures we're counting).\n\n```bash\npython bakeoff.py --self-test                      # prove the scorer (zero GPU)\npython bakeoff.py --make-battery battery.jsonl      # battery from the arc (zero GPU)\n# on the Space (or via a Modal endpoint with --brain modal):\npython bakeoff.py --models <id1>,<id2> --brain local --repeats 5 --arc\npython bakeoff.py --models <id> --brain local --temps 0.0,0.3,0.5,0.7,1.0 --repeats 5\npython bakeoff.py --models <id> --brain local --arc-transcript   # eyeball the arc\n```\n\nThe `--temps` sweep is the decision tool: per"
  },
  {
    "repo_id": "build-small-hackathon/Omniscient-Novel-Reader",
    "title": "Omniscient Novel Reader",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Omniscient-Novel-Reader",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "track",
      "omniscient",
      "novel",
      "reader",
      "summary"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 1,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Omniscient Novel Reader\nemoji: 📚\ncolorFrom: indigo\ncolorTo: yellow\nsdk: docker\napp_port: 7860\npinned: false\nlicense: mit\nshort_description: Break the fourth wall with the Omniscient Novel Reader\ntags:\n  - track:backyard\n  - track:wood\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# Blog, Social media post and Demo\n\nBlog link: https://medium.com/@parvmittal31757/chronicle-the-ai-novel-reader-01a6883e17f3\n\nSocial Media post: https://x.com/parv_mittal07/status/2065337040028450846?s=20\n\nDemo link: https://www.youtube.com/watch?v=6n9lhJjK10U\n# Novel Reader\n\nParameters size:\n\n| Model | Parameters |\n|-------|------------|\n| Gemma 4 31b | 31b |\n| BAAI/bge-base-en-v1.5 | 109M |\n| BAAI/bge-reranker-base | 278M |\n| Chatterbox-Turbo | 350M |\n\n### Total parameters: 31.74b (within the hackathon's specified limit)\n\n### Note: The TTS takes about a minute for first start because of Modal. \n### If you want to test, prefer epub format, pdf format's support is not very good. \n\nA self-hosted novel reader with a clean reading UI, automatic novel **parsing**\n(character extraction, dialogue attribution, emotion tagging, voice-actor\nassignment), and per-character **text-to-speech narration** powered by\nChatterbox-Turbo.\n\nThe system is made of three cooperating pieces:\n\n| Component | Tech | Role |\n|-----------|------|------|\n| **Main app** (`app.py`, `novel_reader/`) | FastAPI + Gradio | Reading UI, library, ingestion, TTS orchestration |\n| **Parser** (`novel_parser/`) | PostgreSQL + LLM | Extracts characters, dialogue, emotions, assigns voices |\n| **TTS service** (`tts_service/`) | FastAPI + Chatterbox-Turbo (GPU) | Turns text + a voice sample into speech |\n\nThe TTS service runs as a **separate container/process** because it has heavy,\npinned ML dependencies (torch 2.6, transformers 5.2, …). The main app talks to\nit over HTTP, so you can run it locally now and move it to **Modal** later by\nonly changing one URL.\n\n---\n\n## Architecture\n\n```\n                ┌──────────────────────────────────────────────┐\n   Browser ───► │  Main app  (FastAPI + Gradio)  :8060          │\n                │                                                │\n                │   /dashboard   /reader   /tts/*                │\n                │        │            │         │                │\n                │     SQLite      SQLite     Orchestrator        │\n                │   (library)   (sections)   (novel_reader/tts)  │\n                └──────────────────┬──────────────┬─────────────┘\n                                   │              │ HTTP\n                          ┌────────▼─────┐   ┌────▼──────────────┐\n                          │ PostgreSQL    │   │ TTS service :8070 │\n                          │ (parsed data) │   │ Chatterbox-Turbo  │\n                          └───────────────┘   │ (GPU)             │\n                                   ▲          └────────┬──────────┘\n                                   │                   │ reads\n                          ┌────────┴─────┐    ┌────────▼──────────┐\n                          │ novel_parser  │    │ voice_samples/    │\n                          │ (LLM passes)  │    │ *.wav references  │\n                          └───────────────┘    └───────────────────┘\n```\n\n- **Reader data** (sections, reading progress, bookmarks) lives in SQLite at\n  `data/reader.sqlite3`.\n- **Parsed data** (characters, `dialogue_entries` with speaker/emotion/voice)\n  lives in **PostgreSQL**. The two are linked by the novel's `uuid`.\n- **TTS** consumes only `(text, voice_ref)` and returns a WAV — it never touches\n  either database.\n\n---\n\n## Prerequisites\n\n- **Python 3.10+** (the project venv uses 3.10.12)\n- **Docker + Docker Compose** (for PostgreSQL and the TTS service)\n- **An NVIDIA GPU + [NVIDIA Container Toolkit](https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/install-guide.html)** — required for the TTS container (CPU works but is very slow)\n- **An LLM** for parsing — either a local [Ollama](https://ollama.com) or a cloud key (Groq / OpenRouter / Gemini). Parsing is **optional**; reading works without it.\n\n---\n\n## 1. Environment setup (main app)\n\n```bash\ngit clone <your-repo-url>\ncd \"novel reader\"\n\n# Create and activate a virtual environment\npython3 -m venv .venv\nsource .venv/bin/activate\n\n# Install the main app dependencies (lightweight — no ML libs)\npip install -r requirements.txt\n```\n\nCopy the example environment file and edit as needed:\n\n```bash\ncp .env.example .env\n```\n\nKey variables (see `.env.example` for the full list):\n\n| Variable | Default | Purpose |\n|----------|---------|---------|\n| `DATABASE_URL` | `postgresql://novel_reader:changeme@localhost:5432/novel_reader` | PostgreSQL for the parser |\n| `ENABLE_NOVEL_PARSING` | `false` | Turn on automatic parsing after upload |\n| `LLM_PROVIDER` / `LLM_BASE_URL` / `LLM_MODEL` | `ollama` / local | LLM used for parsing |\n| `VOICE_SAMPLES_DIR` | `./voice_samples` | Reference voice clips |\n| `TT"
  },
  {
    "repo_id": "build-small-hackathon/octopus-ai",
    "title": "Octopus Ai",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/octopus-ai",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "octopus",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Octopus AI — Stress Test the Octopus\nemoji: 🐙\ncolorFrom: purple\ncolorTo: green\nsdk: gradio\nsdk_version: 5.33.0\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Can you break a self-monitoring modular AI?\ntags:\n  - track:wood\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# 🐙 Octopus AI — Stress Test the Octopus\n\n**Can you break a self-monitoring modular AI?**\n\n## What it does\n\nOctopus is a modular AI architecture where a central brain (Mistral 7B) \nconnects to 4 specialized arms: code generation, testing, code review, \nand CI/CD. Each arm trains independently. Remove one — zero impact on \nthe others.\n\nThe system monitors its own structural health in real-time using a \nFragility Index (FI). When an arm degrades, the brain detects it, \nreroutes traffic, and continues operating.\n\n## Try it\n\n1. Click **Execute Mission** with a coding prompt\n2. Watch the brain decompose, route, and generate code across 4 arms\n3. Click **Run Failure Simulation** — watch an arm go down\n4. Run the same mission again — the system survives\n5. **Restore All Arms** — full recovery\n\n## Tech stack\n\n- **Brain**: Mistral 7B Instruct, fine-tuned with Structurally \n  Adaptive Learning (SAL) — 2000 steps on StarCoder Python\n- **Arms**: 4 LoRA adapters (code_gen, testing, code_review, cicd), \n  each instruction-tuned on CodeAlpaca/StarCoder\n- **Monitoring**: Real-time Fragility Index, 5-stage self-regulation \n  lifecycle (Healthy → Degraded → Isolated → Fallback → Recovery)\n- **Inference**: Live on ZeroGPU, Mistral 7B 4-bit quantized\n\n## Demo video\n\n🎬 [Watch on YouTube](https://www.youtube.com/watch?v=1qnp8cfIybI)\n\n## Social posts\n\n- [LinkedIn](https://www.linkedin.com/posts/applied-ai-ch_buildsmall-huggingface-gradio-activity-7469858393814450176-k4Qx)\n- [X/Twitter](https://x.com/AI_Owner_Book/status/2064090324285149470)\n\n## Built by\n\n[Cognitive Engineering](https://cognitive-engineering.dev) 🇨🇭\n[appliedai.ch](https://appliedai.ch)\n"
  },
  {
    "repo_id": "build-small-hackathon/1000-Rooms",
    "title": "1000 Rooms",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/1000-Rooms",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "1000",
      "rooms",
      "summary",
      "available"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: 1000 Rooms\nemoji: 🗝️\ncolorFrom: yellow\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: false\nshort_description: LLM-generated escape room\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n(Social media post)(https://www.linkedin.com/in/alexandru-buia-881a70173/?skipRedirect=true)\n\n(Demo Video)(https://youtu.be/G5OHbmP2rCE)\n\n(Presentation)(https://youtu.be/fvFnO3HiVgo)\n\n## Bonus badges:\n\n1. Off the Grid\n2. Well-Tuned https://huggingface.co/build-small-hackathon/Nemotron-nano-4b-escape-room\n3. Off-Brand\n4. ~ Llama Champion (the text model runs in llama cpp)\n5. Sharing is Caring (shared the codex traces until I ran out of credits) https://huggingface.co/datasets/build-small-hackathon/1000-Rooms-traces\n6. Field notes https://huggingface.co/blog/build-small-hackathon/1000-rooms-post\n\n## Sponsor tracks:\n\n1. used OpenBMB's VoxCPM2\n2. used Nvidia's Nemotron 3 nano 4b\n3. used Black Forest Labs' Flux 2 Klein 4b\n4. used Modal's cloud compute for fine-tuning Nemotron\n5. used OpenAI's Codex and ChatGPT for the space (GitHub repo)(https://github.com/Ferrariedhgs/LLR)"
  },
  {
    "repo_id": "build-small-hackathon/BuzzwordsMisdemeanors",
    "title": "Buzzwordsmisdemeanors",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/BuzzwordsMisdemeanors",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "buzzwordsmisdemeanors",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 2,
    "sdk": "docker",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Buzzwords & Misdemeanors\nemoji: ⚖️\ncolorFrom: indigo\ncolorTo: red\nsdk: docker\napp_port: 7860\npinned: false\nshort_description: Courtroom deduction game on one 1B model, local on CPU\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - off-the-grid\n  - well-tuned\n  - off-brand\n  - llama-champion\n  - sharing-is-caring\n  - field-notes\n  - track:wood\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# ⚖️ Buzzwords & Misdemeanors\n\nYou wake up in a courtroom. A judge, a prosecutor and a defense counsel argue your case\n— burying you in dense, barely-comprehensible **jargon you picked yourself**. That jargon\nis a *smokescreen*: it has nothing to do with what you actually did. See through it, then\nguess your **real profession and the charge against you**. A model scores you 0–100% and\nreveals the hidden truth.\n\nBuilt for the Hugging Face **Build Small** hackathon — small models only, fully off-grid.\n\n## What this is (hackathon submission)\n\n**Track — 🍄 Thousand Token Wood.** A delightful, AI-native game: a whole courtroom\nhearing is improvised by one ~1B model, and the fun *is* the model doing the work — it\ndirects the trial, acts every role in dense jargon, drops the clues, and grades your plea.\n\n**The idea.** You pick a jargon (aviation, medical, corporate…) that becomes a\n*smokescreen*; the court buries your real (unrelated) profession and charge under it.\nYou read past the buzzwords and the evidence board, then plead in plain English. A model\nscores you and the truth is revealed.\n\n**The tech.** One **MiniCPM5-1B** base in 4-bit GGUF, served by a single\n**`llama-server`** (llama.cpp, AVX2) on **pure CPU** — a free `cpu-basic` Space, no GPU.\nIt wears small LoRA adapters we fine-tuned and published: one distilled *director* and\none per jargon *style*. Code (GBNF grammars + deterministic guards in\n`buzzwords/contracts.py`) enforces the rules; the models provide the flavor.\n\n**Merit badges we're going for:**\n\n| Badge | Why we qualify |\n|---|---|\n| 🔌 **Off the Grid** (local-first) | No cloud APIs — the whole game runs on CPU in the Space. |\n| 🎯 **Well-Tuned** (fine-tuned) | Our own LoRAs, published on the Hub (director + 8 styles). |\n| 🎨 **Off-Brand** (custom UI) | A hand-built courtroom front-end, well past the default Gradio look. |\n| 🦙 **Llama Champion** (llama.cpp) | Every token runs through a `llama-server` (llama.cpp) runtime. |\n| 📡 **Sharing is Caring** (open trace) | 64 full agent traces published as a dataset. |\n| 📓 **Field Notes** | A complete build log: [`docs/FIELD_NOTES.md`](docs/FIELD_NOTES.md). |\n\n**Links** — Blog: [`docs/FIELD_NOTES.md`](docs/FIELD_NOTES.md) ·\nAdapters: [director](https://huggingface.co/BastienHot/buzzwords-director-lora),\n[styles](https://huggingface.co/BastienHot/buzzwords-style-loras) ·\nTraces: [dataset](https://huggingface.co/datasets/BastienHot/buzzwords-agent-trace) ·\nDemo video: [YouTube](https://www.youtube.com/watch?v=0eOhEZv1-Nk) ·\nSocial post: [LinkedIn](https://www.linkedin.com/posts/hottelet-bastien_buildsmall-smallmodels-llm-share-7472374792478765056-RUkf)\n\n## How it works\n\nThe whole game is **one ~1B model** (MiniCPM5-1B) wearing small LoRA adapters, served by\na single **llama-server** (llama.cpp, AVX2) on **pure CPU** — it runs on a free\n`cpu-basic` Space, no GPU anywhere.\n\n- **The truth is sampled in code** from a curated pool (`buzzwords/pools.py`): a\n  profession + a domain-matched fault, with the jargon's own domain excluded — the\n  smokescreen holds by construction.\n- **Game Master** — the base + a distilled **director** LoRA. It writes the oblique\n  clue facts, then per beat emits a GBNF-constrained JSON decision (who speaks, beat\n  type, which clue to surface, intensity, stage direction, wrap-up) and doubles as the\n  scoring judge.\n- **Deterministic guards** (`buzzwords/contracts.py`) enforce the courtroom invariants\n  the model only *mostly* learned: beat/speaker compatibility (only the defense\n  pleads), never the same speaker three times running, the defense heard by mid-game,\n  and every clue fact forced out before the hearing ends.\n- **Actors** — the same base + **one LoRA per jargon style**. Three roles = three\n  system prompts. Actors see the last two public lines, the stage direction, and the\n  clue to weave in — **never the truth**, so they cannot leak it.\n- **No waiting room** — the hearing starts on the first generated beat; the rest is\n  generated in the background while you read.\n\nEvery prompt, grammar, and rule lives in **`buzzwords/contracts.py`** — the single\nsource of truth shared verbatim with the training pipeline, so the training\ndistribution *is* the inference distribution.\n\nThe LoRAs are trained offline on Modal — see [`training/`](training/README.md).\n\n## Run locally\n\nCPU-only. You need a **llama-server** binary built with AVX2 (the Dockerfile does this\nfor the Space; locally, grab a [l"
  },
  {
    "repo_id": "build-small-hackathon/thousand-token-wood-sim",
    "title": "Thousand Token Wood Sim",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/thousand-token-wood-sim",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "wood",
      "sponsor",
      "thousand",
      "token",
      "modal"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Thousand Token Wood\nemoji: 🍄\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.16.0\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Five small models run a market drama; you are the financier\ntags:\n- track:wood\n- sponsor:openbmb\n- sponsor:nvidia\n- sponsor:modal\n- achievement:welltuned\n- achievement:offbrand\n- achievement:sharing\n- achievement:fieldnotes\n- thousand token wood\n- off-brand\n- well-tuned\n- best demo\n- best agent\n- field notes\n- sharing is caring\n- minicpm\n- nemotron\n- modal\n---\n\n# 🍄 Thousand Token Wood\n\nA small-model **finance drama** for the [Build Small Hackathon](https://huggingface.co/build-small-hackathon).\nFive woodland creatures, each driven by a *different lab's* small model, trade goods\nfor pebbles, gossip, scheme, and panic. You are the **Patron of the Wood**, a shadow\nfinancier: you lend, whisper tips, short the market, bribe, and broker alliances, while\na magistrate watches for cheats. The chart is the scoreboard; the story is the people.\n\n## Watch and read\n- 🎬 **Demo video**: [YouTube (100s)](https://youtu.be/ugFyFumUCgs)\n- 📣 **Social post**: [on X](https://x.com/RealLesterLeong/status/2064904967391961483)\n- 📓 **Field notes** (the build, in four parts):\n  [1 — emergent market drama](https://huggingface.co/blog/build-small-hackathon/thousand-token-wood-sim) ·\n  [2 — the five-lab council](https://huggingface.co/blog/build-small-hackathon/thousand-token-wood-sim-v2) ·\n  [3 — the crash that vanished](https://huggingface.co/blog/build-small-hackathon/thousand-token-wood-sim-v3) ·\n  [4 — the wood fights back](https://huggingface.co/blog/build-small-hackathon/thousand-token-wood-sim-v4) ·\n  [Medium mirror](https://medium.com/@LesterLeong/thousand-token-wood-emergent-market-drama-from-3-billion-parameter-agents-22545d5982bf)\n- 🧠 **Fine-tuned model** (one of the five minds): [AdmiralTaco/ttw-trader-0.5b](https://huggingface.co/AdmiralTaco/ttw-trader-0.5b)\n- 📡 **Open agent traces** on the Hub: [dataset](https://huggingface.co/datasets/build-small-hackathon/thousand-token-wood-traces)\n\n## Five labs, five minds\nEvery creature thinks with a different small model, so the council is a live argument\nbetween four labs, all under the 32B cap and served with vLLM on **Modal**:\n\n| Creature | Model | Lab |\n|---|---|---|\n| Oona (owl) | gpt-oss-20b | OpenAI |\n| Bramble (squirrel) | MiniCPM3-4B | OpenBMB |\n| Fenn (fox) | Nemotron-Mini-4B | NVIDIA |\n| Mossback (tortoise) + Pip (mouse) | ttw-trader-0.5b (fine-tuned) | ours |\n\nThe 0.5B was distilled from cleaned traces of a 3B teacher (self-buy mistakes stripped),\nso a model one-sixth the size trades *more* cleanly than its teacher: zero self-buys,\n100% valid offers. gpt-oss-20b doubles as the Narrator that writes the wood's headlines.\n\n## Why \"small\" is load-bearing\nA living economy needs *many* agents thinking *many* times per turn. Frontier models are\ntoo slow and costly for that. Small models are what make a real-time council of traders\nfeasible, and the heterogeneity (four labs) is what makes the drama: each model has its\nown temperament, so the market is an emergent negotiation, not a script.\n\n## How Modal powers the wood\nThe entire model layer runs on **Modal**, end to end:\n- **Four vLLM engines, four Modal apps** — one per lab's model, each on its own GPU,\n  invoked per turn with **per-engine batching** (every creature on the same engine shares\n  one batched call). All apps **scale to zero** when idle, so the wood costs ~nothing\n  between sessions and the whole hackathon ran on a fraction of the starter credits.\n- **Fine-tuning on Modal too**: `AdmiralTaco/ttw-trader-0.5b` was LoRA-trained on a Modal\n  A10G from cleaned traces of its 3B teacher, then merged and pushed to the Hub — build,\n  train, eval, and serve, all on one platform.\n- **Cold-start craft**: a warmup ping precedes recorded runs, the UI streams a \"waking the\n  wood\" frame during the first cold call, and the attract reel plays instantly with zero\n  GPU so the Space is never blank while engines wake.\n\n## Sponsor models in the council\n- **OpenBMB MiniCPM3-4B** is Bramble the squirrel — the council's scrappy haggler.\n- **NVIDIA Nemotron-Mini-4B** is Fenn the fox — the opportunist who buys the dip.\n- **OpenAI gpt-oss-20b** is Oona the owl *and* the Wood Street Journal narrator.\nEach model's temperament is visible in its creature's 💭 thoughts and raw JSON (both\nexposed in the UI), so you can watch four labs' small models argue a market into being.\n\n## What you can do (you are the Patron)\n- **Power moves**: lend pebbles at interest, whisper a tip, short a good, bribe a creature,\n  broker an alliance, or fund a corner. Every move costs you purse, reputation, or heat,\n  and the creatures react.\n- **The information war**: a tip can be *true* (real foresight of the next Wood Legend, your\n  insider edge) or *false* (bait). Profit from an insider tip and your **heat** rises;\n  cross the line and **Magistrate Heron** opens an investigation that ends in a fine, froz"
  },
  {
    "repo_id": "build-small-hackathon/picklld",
    "title": "Picklld",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/picklld",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "picklld",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Pickldd\nemoji: 🥒\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: \"6.17.3\"\napp_file: app.py\npinned: false\ntags:\n  - build-small-hackathon\n  - track:wood\n  - achievement:offbrand\n  - achievement:tinytitan\n  - gradio\n  - pickle\n  - huggingface\n  - vision\n---\n\n# Pickldd 🥒 — Pickle Rater\n\nThe internet's most *serious* pickle review platform. Rate pickles by crunchiness, sourness, garlic level, and spice. The AI Pickle Sommelier writes tasting notes. Scan a jar photo to auto-detect brand and style.\n\n**🔗 Links**\n- 🚀 [Live Space](https://huggingface.co/spaces/build-small-hackathon/picklld)\n- 💻 [GitHub](https://github.com/penelopeg/picklld)\n- 📣 [Social post](https://x.com/penelope_tg/status/2066646172379369775)\n\n## Demo\n\n<video src=\"https://github.com/penelopeg/picklld/raw/main/demo.webm\" controls width=\"100%\"></video>\n\n## AI features (Tiny Titan — all models ≤4B)\n\n| Feature | Model | Params |\n|---|---|---|\n| 🍷 Pickle Sommelier | `Qwen/Qwen2.5-3B-Instruct` | 3B |\n| 📸 Jar photo scan | `google/gemma-3-4b-it` | 4B |\n\nBoth served via [HF Inference Providers](https://huggingface.co/docs/inference-providers) (featherless-ai). Requires `HF_TOKEN` set as a Space secret.\n\n## Setup\n\n**Requirements:** Python 3.9 or later.\n\n```bash\ngit clone https://github.com/penelopeg/picklld\ncd picklld\npython3 -m venv .venv\nsource .venv/bin/activate\npip install -r requirements.txt\npython app.py\n```\n\nThe app runs at `http://localhost:7860`. Set `HF_TOKEN` in your environment to enable AI features.\n"
  },
  {
    "repo_id": "build-small-hackathon/Myco",
    "title": "Myco",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Myco",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "myco",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Myco - Tiny AI Mushroom Companion\nemoji: 🍄\ncolorFrom: red\ncolorTo: green\nsdk: gradio\nsdk_version: 6.16.0\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Build Small Hackathon\ntags:\n  - thousand-token-wood\n  - track:wood\n  - sponsor:openai\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\ndatasets:\n  - build-small-hackathon/Myco\n---\n\n# ❋ Thousand Token Wood Submission\n*The Whimsical Track: Whimsical, delightful, AI-native apps that push the boundaries of fun. Wander somewhere stranger and show off what small models can dream up.*\n\n---\n\n## 🎖️ Achieved Merit Badges\n*Stack 'em on your sash! Hover a patch to see it lift.*\n\n<div style=\"display: flex; flex-wrap: wrap; gap: 15px; margin: 20px 0;\">\n\n  <div style=\"flex: 1; min-width: 200px; padding: 15px; border: 1px solid #e5e7eb; border-radius: 12px; transition: transform 0.2s ease, box-shadow 0.2s ease; cursor: pointer; background: #fff;\" onmouseover=\"this.style.transform='translateY(-5px)'; this.style.boxShadow='0 10px 15px -3px rgba(0,0,0,0.1)'\" onmouseout=\"this.style.transform='translateY(0)'; this.style.boxShadow='none'\">\n    <div style=\"font-size: 24px; margin-bottom: 5px;\">🐜</div>\n    <strong>Tiny Titan</strong>\n    <p style=\"font-size: 12px; color: #6b7280; margin: 5px 0 0 0;\"><strong>Genuinely tiny:</strong> Myco's brain is <code>google/gemma-3-1b-it</code> — a 1B-parameter open-weight model, well under the 4B bar.</p>\n  </div>\n\n  <div style=\"flex: 1; min-width: 200px; padding: 15px; border: 1px solid #e5e7eb; border-radius: 12px; transition: transform 0.2s ease, box-shadow 0.2s ease; cursor: pointer; background: #fff;\" onmouseover=\"this.style.transform='translateY(-5px)'; this.style.boxShadow='0 10px 15px -3px rgba(0,0,0,0.1)'\" onmouseout=\"this.style.transform='translateY(0)'; this.style.boxShadow='none'\">\n    <div style=\"font-size: 24px; margin-bottom: 5px;\">🎯</div>\n    <strong>Off the Grid</strong>\n    <p style=\"font-size: 12px; color: #6b7280; margin: 5px 0 0 0;\"><strong>Local-first:</strong> No cloud APIs. <code>google/gemma-3-1b-it</code> loads via <code>transformers</code> and runs entirely local on the host environment.</p>\n  </div>\n\n  <div style=\"flex: 1; min-width: 200px; padding: 15px; border: 1px solid #e5e7eb; border-radius: 12px; transition: transform 0.2s ease, box-shadow 0.2s ease; cursor: pointer; background: #fff;\" onmouseover=\"this.style.transform='translateY(-5px)'; this.style.boxShadow='0 10px 15px -3px rgba(0,0,0,0.1)'\" onmouseout=\"this.style.transform='translateY(0)'; this.style.boxShadow='none'\">\n    <div style=\"font-size: 24px; margin-bottom: 5px;\">🎨</div>\n    <strong>Off-Brand</strong>\n    <p style=\"font-size: 12px; color: #6b7280; margin: 5px 0 0 0;\"><strong>Custom UI:</strong> A hand-built forest scene, status panels, and live AI log — well past default Gradio styling.</p>\n  </div>\n\n  <div style=\"flex: 1; min-width: 200px; padding: 15px; border: 1px solid #e5e7eb; border-radius: 12px; transition: transform 0.2s ease, box-shadow 0.2s ease; cursor: pointer; background: #fff;\" onmouseover=\"this.style.transform='translateY(-5px)'; this.style.boxShadow='0 10px 15px -3px rgba(0,0,0,0.1)'\" onmouseout=\"this.style.transform='translateY(0)'; this.style.boxShadow='none'\">\n    <div style=\"font-size: 24px; margin-bottom: 5px;\">🧠</div>\n    <strong>Codex Co-Author</strong>\n    <p style=\"font-size: 12px; color: #6b7280; margin: 5px 0 0 0;\"><strong>Best Use of Codex:</strong> Developed and structured using Codex for workspace orchestration and environment testing.</p>\n  </div>\n\n  <div style=\"flex: 1; min-width: 200px; padding: 15px; border: 1px solid #e5e7eb; border-radius: 12px; transition: transform 0.2s ease, box-shadow 0.2s ease; cursor: pointer; background: #fff;\" onmouseover=\"this.style.transform='translateY(-5px)'; this.style.boxShadow='0 10px 15px -3px rgba(0,0,0,0.1)'\" onmouseout=\"this.style.transform='translateY(0)'; this.style.boxShadow='none'\">\n    <div style=\"font-size: 24px; margin-bottom: 5px;\">📡</div>\n    <strong>Sharing is Caring</strong>\n    <p style=\"font-size: 12px; color: #6b7280; margin: 5px 0 0 0;\"><strong>Open trace:</strong> Shared comprehensive agent execution traces on the Hub for open community learning.</p>\n  </div>\n\n</div>\n\n---\n\n## 🌐 Community Links\n* **📺 Video Walkthrough:** [Watch the Myco Demo on YouTube](https://youtu.be/uTlIWqJA9RQ)\n* **🔗 Read the LinkedIn post:** [Read the full project post on LinkedIn](https://www.linkedin.com/posts/noriko-kono_huggingface-gradio-gemma3-share-7472374564023427072-C64t/?utm_source=share&utm_medium=member_desktop&rcm=ACoAADV81lIBHfqnWPcrqTwi8q3nrm4-wpvkldE)\n\n---\n\n# 🍄 Myco — Tiny AI Mushroom Companion\n\nMyco is a small Gradio AI Companion built for Hugging Face's **Build Small Hackathon**, entering **Chapter Two — \"An Adventure in Thousand Token Wood.\"** Wander a glowing forest with Myco, a tiny sentient mushroom companion. Discover strange mushrooms, chat with Myco about them, study and collect them, and "
  },
  {
    "repo_id": "build-small-hackathon/voice-sales-logger",
    "title": "Voice Sales Logger",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/voice-sales-logger",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "modal",
      "sponsor",
      "voice",
      "sales",
      "logger"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Voice Sales Logger\nemoji: 😻\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.15.2\napp_file: app.py\npinned: false\nhardware: cpu-basic\ntags:\n- backyard-ai\n- tiny-titan\n- off-brand\n- nemotron\n- bonus-quest-champion\n- modal\n- track:backyard\n- sponsor:nvidia\n- sponsor:modal\n- achievement:offgrid\n- achievement:offbrand\n- achievement:fieldnotes\nthumbnail: >-\n  https://cdn-uploads.huggingface.co/production/uploads/6421f850eaad1bcb28b2561c/DWWCv3LCVnc1I_d-gVGKV.png\nshort_description: small ai to solve big problems, one step at a time.\n---\n\n# 🥦 Produce Sales Tracker\n\n**Track: Backyard AI &nbsp;·&nbsp; Build Small Hackathon**\n\nA hands-free, voice-driven sales logger built for the oldest continuously operating outdoor produce market in America — Philadelphia's 9th Street Italian Market, open since 1873.\n\n---\n\n### [View Full Presentation →](https://marketmon.github.io/sales-tracker-presentation/)\n\n**[Demo Video →](https://youtu.be/MCdxrAyBASU)** &nbsp;|&nbsp; **[Blog Post →](https://huggingface.co/blog/angrysquash/hands-free-small-ai-solution)**\n\n---\n\n## The Story\n\n<img src=\"readme_materials/tran_from_trans_produce.webp\" width=\"600\">\n\n**Meet Tran.** She's owned her produce stand on 9th Street for over 40 years, since moving to Philadelphia after the Vietnam War. Her staff serves customers in Spanish and English. She runs everything in Vietnamese. Every sale is tracked by mental math and a pocket calculator — no records, no data, no way to know what sold best last Tuesday.\n\nThis app gives her that, completely hands-free.\n\nCash transactions leave no paper trail — and no trail means no loan, no grant, no SBA application. Vendors like Tran have decades of revenue that simply doesn't exist on paper. Every sale logged here is timestamped, itemized proof of business that she can hand to a lender. For the first time, the market has a record.\n\n---\n\n## How It Works\n\n```\nMic → Nemotron ASR (0.6B) → Qwen 2.5 (1.5B) → Confirm → SQLite\n```\n\n1. **Record** — tap mic, speak the order in any supported language\n2. **Transcribe** — [nvidia/nemotron-3.5-asr-streaming-0.6b](https://huggingface.co/nvidia/nemotron-3.5-asr-streaming-0.6b) converts speech to text\n3. **Parse** — [Qwen/Qwen2.5-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) extracts items, quantities, and prices in whatever language was spoken\n4. **Confirm** — vendor reviews and adjusts before saving\n5. **Log** — confirmed sale written to SQLite with full line items\n\n<img src=\"readme_materials/screenshot_capture.png\" width=\"600\">\n\n<img src=\"readme_materials/screenshot_confirm.png\" width=\"600\">\n\n<img src=\"readme_materials/screenshot_dashboard.png\" width=\"600\">\n\n---\n\n## Technical Details\n\n| | |\n|---|---|\n| **Speech-to-text** | nvidia/nemotron-3.5-asr-streaming-0.6b (0.6B) |\n| **Order parsing** | Qwen/Qwen2.5-1.5B-Instruct (1.5B) |\n| **Total parameters** | 2.1B |\n| **Languages** | Vietnamese · Spanish · English |\n| **UI** | Gradio 6.15.2 + custom Svelte wizard component |\n| **Storage** | SQLite |\n| **Inference** | Modal GPU containers (nemotron-asr + qwen-parse) |\n| **Space hardware** | cpu-basic — all heavy compute offloaded to Modal |\n\n---\n\n## Prizes Targeted\n\n| Prize | Why We Qualify |\n|---|---|\n| **Tiny Titan** ($1,500) | 2.1B total params (≤ 4B cap) |\n| **Off Brand** ($1,500) | Fully custom Svelte wizard component — not default Gradio UI |\n| **Bonus Quest Champion** ($2,000) | Stacking Tiny Titan + Off Brand + Nemotron + Modal |\n| **NVIDIA Nemotron** (RTX 5080 × 2) | ASR on nvidia/nemotron-3.5-asr-streaming-0.6b |\n| **Modal** ($20,000 credits) | Modal is used for the development and runtime of this app — ASR and order parsing run on Modal GPU containers during local development (`modal_asr.py`, `modal_qwen.py`) |\n\n---\n\n## Try It\n\n**[Live Demo →](https://huggingface.co/spaces/build-small-hackathon/voice-sales-logger)**\n\n> First order takes ~60s (model loading). Every order after is ~5s."
  },
  {
    "repo_id": "build-small-hackathon/mycelium",
    "title": "Mycelium",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/mycelium",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "mycelium",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Mycelium\nemoji: 🍄\ncolorFrom: red\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.13'\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: Stop losing ideas. Capture, connect, recall.\ntags:\n  - build-small-hackathon\n  - backyard-ai\n  - tiny-titan\n  - nvidia-nemotron\n  - off-brand\n  - off-the-grid\n  - field-notes\n  - track:backyard\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n# Mycelium — Personal Knowledge Agent\n\n> Capture fast. Think later. Let the system surface what matters.\n\nMycelium is a local-first AI knowledge companion that closes the loop between saving something and actually learning from it. No more screenshot graveyards or forgotten browser tabs.\n\n## Demo\n\n📹 **[Demo video](https://www.youtube.com/watch?v=Kr7LxRm0JBs)**\n\n📝 **[LinkedIn post](https://www.linkedin.com/posts/ajit3259_mycelium-stop-losing-ideas-capture-connect-share-7472039183830810624-TtU_/)**\n\n📓 **[Field Notes — what I built, what broke, what I learned](https://huggingface.co/blog/build-small-hackathon/mycelium)**\n\n## The problem\n\nEveryone has the same graveyard: saved links, screenshots, notes-to-self — all gone dark in a week. The capture habit exists. The recall loop doesn't.\n\nMycelium fixes the loop.\n\n## What it does\n\n- **Capture** notes, URLs, and images — each processed into a structured summary with intent classification (`learn` / `act` / `reference` / `ephemeral`) and semantic tags\n- **ASK** — semantic search across your knowledge base with LLM synthesis, follow-up questions, and Feynman self-testing\n- **BRIEF** — daily digest of what you saved, with synthesis across captures and a weekly thread\n- **REVIEW** — spaced repetition (SM-2) targeting specific claims from your own notes, not generic flashcards\n- **GRAPH** — visual map of how your ideas connect via embedding similarity\n\n## How it works\n\n1. You capture a note, URL, or image\n2. **NVIDIA Nemotron-Mini-4B** extracts the core insight, classifies intent, generates tags and recall questions\n3. **Qwen2.5-VL-7B** handles image captures — describe a whiteboard, diagram, or screenshot\n4. **BGE-base-en-v1.5** embeds summaries into a 768-dim vector space\n5. Related captures link automatically via cosine similarity\n6. The surface engine resurfaces what you should revisit, weighted by intent and time\n\n## Tech\n\n- **LLM**: `nvidia/Nemotron-Mini-4B-Instruct` via HF Transformers + ZeroGPU\n- **Vision**: `Qwen/Qwen2.5-VL-7B-Instruct` for image capture\n- **Embeddings**: `BAAI/bge-base-en-v1.5` (768-dim, top MTEB retrieval)\n- **Backend**: FastAPI + SQLite (persistent at `/data/mind.db`)\n- **Frontend**: React + TypeScript + Tailwind CSS\n\n## Prizes targeting\n\n- **Backyard AI** track — practical tool solving a real daily problem\n- **Tiny Titan** badge — Nemotron-Mini-4B drives all text intelligence (4B parameters)\n- **NVIDIA** — built on `nvidia/Nemotron-Mini-4B-Instruct`\n\n## Built with\n\nBuilt for the [Build Small Hackathon](https://huggingface.co/build-small-hackathon) · June 2026\n"
  },
  {
    "repo_id": "build-small-hackathon/color-ux-access",
    "title": "Color Ux Access",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/color-ux-access",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "color",
      "access",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Color-UX-Access\nemoji: 🔍\ncolorFrom: red\ncolorTo: green\nsdk: gradio\nsdk_version: \"6.17.3\"\napp_file: app.py\npython_version: \"3.12\"\ndependencies: requirements.txt\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:modal\n  - achievement:offgrid\n---\n\n# Color-UX-Access\n\n**HF Build Small Hackathon** · Track: Backyard AI · ≤32B parameters · Gradio + HF Space\n\n> 🔍 Test any webpage screenshot for colorblind accessibility issues — 10 CVD simulations + WCAG 2.1 report via 32B VLM.\n\n**Live:** [salgadev-color-ux-access.hf.space](https://salgadev-color-ux-access.hf.space)\n**Code:** [github.com/salgadev/color-ux-access](https://github.com/salgadev/color-ux-access)\n**Built for:** [NARWALL](https://narwall.tech) — automated accessibility testing.\n\n---\n\n## How It Works\n\n```\nScreenshot (file upload or URL capture)\n       │\n       ▼\nStage 1: CVD Simulation (CPU) — 10 variants via DaltonLens\n       │\n       ▼\nStage 2: VLM Inference (GPU) — CohereLabs/aya-vision-32b → WCAG 2.1 JSON\n       │\n       ▼\nStage 3: WCAG Report — Severity · Criterion · Description · Remediation\n```\n\n**VLM:** [CohereLabs/aya-vision-32b](https://huggingface.co/CohereLabs/aya-vision-32b) via HF Router (OpenAI-compatible API).\n**CVD:** 10 types via DaltonLens (Machado2009, Vienot1999, Brettel1997) + Rec.709 grayscale for achromatopsia.\n\nSee `docs/ARCHITECTURE.md` for detailed pipeline internals.\n\n---\n\n## CVD Types Supported\n\n| Type | Description | Prevalence |\n|------|-------------|-----------|\n| Deuteranopia | Red-green (green-deficient) | ~1% males |\n| Protanopia | Red-green (red-deficient) | ~1% males |\n| Tritanopia | Blue-yellow | rare |\n| Deuteranomaly | Red-green (green-weak) | ~5% males |\n| Protanomaly | Red-green (red-weak) | ~1% males |\n| Tritanomaly | Blue-yellow (weak) | rare |\n| Severe Deuteranopia | Full green-deficient | — |\n| Severe Protanopia | Full red-deficient | — |\n| Achromatopsia | Complete grayscale | ~0.003% |\n| Achromatomaly | Partial grayscale | rare |\n\n---\n\n## Project Structure & Constraints\n\n- All tests under `tests/` (e.g., `tests/test_*.py`)\n- No root-level fix scripts (`apply_*fixes*.py`, `*_fix*.py`, `insert_cache.py`, etc.)\n- Runtime logic in `app.py`, `server_app.py`, or clearly named helper modules\n- TDD mandatory: write tests first, then implementation\n\n---\n\n## Quick Start\n\n```bash\ngit clone https://github.com/salgadev/color-ux-access.git\ncd color-ux-access\nuv venv --python 3.12\nsource .venv/Scripts/activate\nuv pip install -e \".[dev]\"\ncp .env.example .env   # add HF_TOKEN\npytest -m smoke         # verify setup\npython app/app.py       # launch locally\n```\n\nSee `docs/DEVELOPMENT.md` for full setup, dependency groups, and git workflow.\n\n---\n\n## Development workflow (agents & humans)\n\n**Run the app**\n```bash\nuv run app.py\n```\nGradio does **not** auto-reload on code changes. After any edit to `app.py` or UI-affecting modules, stop the server (`Ctrl+C`) and restart with the command above.\n\n**E2E test flow (Nous Browser)**\n```bash\n# 1. Start the app (see above)\n# 2. In another terminal, run the browser test against the local instance\npytest -m e2e --base-url=http://127.0.0.1:7860\n```\nTest steps: upload `tests/fixtures/UR.webp` → click **Analyze** → verify the CVD grid renders and the WCAG report panel shows criteria rows.\n\n> Detailed contributor rules live in `agents.md`. This section covers the mechanical loop only.\n\n---\n\n## Documentation\n\n| Doc | What it covers |\n|-----|----------------|\n| `docs/ARCHITECTURE.md` | System design, CVD pipeline, VLM prompt, Gradio 6 compat |\n| `docs/DEVELOPMENT.md` | Setup, uv venv, dep groups, git workflow, code style |\n| `docs/TESTING.md` | Test markers, fixtures, TDD pattern |\n| `docs/DEPLOYMENT.md` | HF Space + Modal deploy, known issues |\n| `docs/EVALUATION.md` | Sponsor prize matrix (full detail) |\n| `docs/CVD_USER_AUDIT_MODEL.md` | CVD audit methodology (EN/ES) |\n\n---\n\n## Sponsor Prize Eligibility\n\n| Sponsor                 | Prize | Status                   |\n|-------------------------|-------|--------------------------|\n| HuggingFace             | $15,000 | ✅ Eligible — top project |\n| OpenBMB (MiniCPM-V 4.6) | $10,000 | ✅ Already deployed       |\n| Modal                   | $250 credits | ✅ Already deployed       |\n\n**Required:** \n- ✅ Gradio App: (Space)[https://huggingface.co/spaces/build-small-hackathon/color-ux-access]\n- ✅ Demo video: (YouTube)[https://www.youtube.com/watch?v=ynwuZNcqRtY] \n- ✅ Social Media Post: [LinkedIn](https://www.linkedin.com/posts/salgadev_build-small-hackathon-build-small-hackathon-share-7472421346992476161--5sG/)\n\n---\n\n## Environment\n\nUse `uv` for isolated venvs — prevents Hermes Agent interference. Install via pyproject.toml groups:\n\n```bash\nuv pip install -e \".\"          # core only (pillow, daltonlens, numpy)\nuv pip install -e \".[dev]\"     # + playwright, pytest\nuv pip install -e \".[space]\"   # + gradio, spaces, torch (for HF Space)\nuv pip install -e \".[all]\"     # everything\n```\n\n## Examples\nVLM should come to similar conclusions than the s"
  },
  {
    "repo_id": "build-small-hackathon/wpl-discovery",
    "title": "Wpl Discovery",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/wpl-discovery",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "wpl",
      "discovery",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Find one implementation idea worth remixing.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Worcestershire Libraries Live Assistant\ncolorFrom: green\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npinned: true\nlicense: mit\nshort_description: Live, source-cited answers about your local library\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:fieldnotes\n---\n\n# Team Members\n\nJulian Elliott ( GameTheory )\n\n# Worcestershire Libraries - Live Assistant\n\nA small-model (**≤ 32B**) agent that answers real questions about Worcestershire\nLibraries from official data **at every granularity** - service pages → every\npage of the Hive's site → catalogue items → the individual copies on a branch's\nshelf - mined live at question time wherever the source allows it.\n\n> Built for the [Build Small Hackathon](https://huggingface.co/build-small-hackathon)\n> · **Backyard AI** track. The \"someone I know\" is **Jack, a library resources\n> manager in Worcester**, whose recurring complaint is that *the library never\n> shouts loudly enough about everything it offers*. This is the megaphone.\n\nPrimary Video Demo: [Youtube](https://youtu.be/0JoOqYBQv_4)\n\nPrimary Video Demo: [Screen Studio](https://preview.screen.studio/share/Td6VxRdL?state=uploading)\n\nBackup Video Demo: [Arcade Demo](https://app.arcade.software/share/p5cZQyn7OZUXmR6Kqxbk)\n\n\n## What it does\n\nAsk in plain English and it routes your question to a live tool, reads what comes\nback from the council's own systems, and explains it:\n\n| You ask… | It mines… | Source |\n|---|---|---|\n| “How do I **get** Wolf Hall?” | the item's **copy-level holdings** - which branch has it on the shelf *now*, free reservation, or the eBook on BorrowBox tonight | `wcc.ent.sirsidynix.net.uk` |\n| “Do you have Harry Potter audiobooks?” | the **SirsiDynix catalogue** (books, eBooks, audio, DVDs) | `wcc.ent.sirsidynix.net.uk` |\n| “What can I do at The Hive?” | **every page of the Hive's site** - archives & archaeology, 800+ study spaces, room hire, open 8:30am–10pm daily | `thehiveworcester.org` |\n| “When does the mobile library visit Abberley?” | the **mobile-library timetable** (154 villages) | `worcestershire.gov.uk` |\n| “What's on this week?” | **events & activities** | `worcestershire.gov.uk` |\n| “How do I print from my phone?” | **Print Your Way** steps & prices | `worcestershire.gov.uk` |\n\nEvery reply carries a **“checked live just now” footer** with the tool used and a\nlink back to the official page.\n\nEach answer is designed around a customer journey - what the person is trying\nto do and the next action the reply must hand them. See **[JOURNEYS.md](JOURNEYS.md)**.\n\nAs a trusted hub within the local community, libraries often act as the public face for many wider council functions. They work with tight budgets and a mix of old systems, all while offering a huge range of services; everything from baby activity sessions to business support.\n\nThis simple library chatbot helps cut through this complexity by giving people one clear, modern way to get information. Instead of fighting through outdated interfaces, residents can ask questions in plain language and receive quick, accurate answers that reflect everything the library provides.\n\nBecause this low-cost tool can be built to run as a website, add-on or app, it becomes an easy, always-available point of contact. Each response can be tailored to automatically highlight related services using analytics-driven prompts, helping surface both well-used resources and lesser-known parts of the library offer. The result, a lightweight, affordable upgrade that improves access, boosts discovery, and supports libraries without requiring major system changes.\n\n## The honest small-model fit\n\nThe model never invents library facts. The intelligence lives in the **live\nretrieval**; a 7-8B model is more than enough to *route* the question and *phrase*\nthe answer. That's a genuine fit with the brief - not a 32B model pretending to\nknow a catalogue it was never trained on.\n\n- **Model:** `meta-llama/Llama-3.1-8B-Instruct` by default, with an in-app\n  selector for OpenBMB MiniCPM 3 4B and Qwen 2.5 7B (set `MODEL_ID` to swap up\n  to any ≤32B model).\n- **Graceful degradation:** with no `HF_TOKEN`, the app still works in *no-LLM\n  mode* - deterministic keyword routing + the raw live data. The demo never\n  breaks.\n\n## The Hive, with provenance\n\nI originally excluded thehiveworcester.org because parts of it run stale. The\nre-think: **include it page-by-page, but tag every fact with its source** -\nbecause the Hive's site is the *only* place much of Worcester's extended offer\nis described (Explore the Past archives & archaeology, Europe's first joint\nuniversity+public library, 800+ study spaces, room hire, the BIPC, the Youth\nHub). The rules that keep answers trustworthy:\n\n- every Hive fact carries its **source page + crawl date** (`hive_kb.json`,\n  rebuilt by `build_hive_kb.py`)\n- where Hive and council pages conflict (hours, prices, membership), the\n  **council "
  },
  {
    "repo_id": "build-small-hackathon/repair-guy",
    "title": "Repair Guy",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/repair-guy",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "openbmb",
      "nvidia",
      "repair",
      "guy"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Repair Guy\nemoji: 🔧\ncolorFrom: gray\ncolorTo: blue\nsdk: gradio\n# 6.17.3 is the newest gradio that allows huggingface-hub<1.0, which\n# transformers 4.57.x (required by the MiniCPM/ColEmbed remote code) pins;\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: false\npreload_from_hub:\n  - openbmb/MiniCPM4.1-8B # 8b, performs well quantized to int8 but take perf hit. need to try finetuning 1b.\n  - openbmb/MiniCPM-V-4_5 # 8b\n  - nvidia/llama-nemotron-embed-vl-1b-v2 # 1b\ntags:\n  - backyard-ai\n  - openbmb\n  - nvidia\n  - off-the-grid\n  - off-brand\n  - sharing-is-caring\n  - field-notes\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n# Repair Guy: Hands-Free Manual Navigator\n<p align=\"center\">\n  <img src=\"frontend/assets/app_screenshot.png\" alt=\"Repair Guy screenshot\" width=\"700\">\n</p>\n\n**▶️ [Watch the Demo Video](https://www.youtube.com/watch?v=PFUOHoVQUsI)**  \n**🐦 [Social Media Post](https://www.linkedin.com/posts/ray-martinez1_i-spent-the-past-week-working-on-the-build-ugcPost-7472117118377918464-X7Bb/?utm_source=share&utm_medium=member_desktop&rcm=ACoAAB255oEBSd0o7PIGG3mbO4BhG43UTxgV0u0)**  \n**📝 [Read the Field Notes Blog Post](https://raymartinez.dev/blog/repair-guy-field-notes/)**\n\n## 💡 The Problem & Solution\nMechanics with greasy hands can't scroll through 500-page PDFs. **Repair Guy** is a fully local, voice-activated manual navigator. \nIt visually highlights exact diagrams and troubleshooting steps, and allows for precise page navigation, all hands-free.\n\n## ⚙️ The Tech Stack (All <32B Parameters)\n*   **Agent Model:** `openbmb/MiniCPM4.1-8B` (8B) - Handles core logic. *(A ~1B brain is on the roadmap for on-device use.)*\n*   **Vision Model:** `openbmb/MiniCPM-V-4_5` (9B) - Handles visual reasoning, component pinpointing, and generating table/image descriptions. *(A smaller VLM is on the roadmap for on-device use.)*\n*   **Parsing Model:** `nvidia/NVIDIA-Nemotron-Parse-v1.2` (0.9B) - Turns dense Toyota Forklift and Hyundai Genesis manual pages into structured elements (sections, tables, figures with bounding boxes) that feed the text-parsed index. *(Runs only in the cloud indexing pipeline.)*\n*   **Embedding Model:** `nvidia/llama-nemotron-embed-vl-1b-v2` (1B) - Produces the dense chunk/query embeddings used for retrieval over the parsed manuals.\n*   **Speech Model:** `moonshine/tiny` (27M) - Runs directly in-browser for ultra-fast, real-time Speech-to-Text.\n*   **Infrastructure:** `Modal` - Powers the batch indexing pipeline and automated model evaluations. *(Note: Indexing was offloaded to the cloud as a time-saving measure and to prevent heavy battery drain on edge devices).*\n*   **Observability:** `Langfuse/In App` - Stores agent execution traces (for future finetuning) and app displays a diagnostic tab.\n\n## 🎛 Other Features (mostly for engineers that want to experiment)\n*   **Speak Responses:** Toggle voice readouts for true hands-free feedback \n*   **Careful Pointing:** Forces the VLM to reason before circling components, increasing accuracy on complex diagrams. (Increased latency but, if used with speak responses, you can get a ping when it's done)\n*   **Dynamic Indices:** Swap between text-parsed indexing (best for specs/tables) and visual ColEmbed indexing (best for diagrams) for fun to see the difference ;)\n*   **Model Swapping:** Swap between different models for the agent brain\n*   **VRAM Logging:** Built-in logging to monitor GPU memory during model load/evict cycles.\n\n## 🏆 Bonus Quests Achieved\n*   **Off the Grid:** 100% local execution. Zero external cloud APIs used.\n*   **Off-Brand:** Custom frontend architecture using `gr.Server`.\n*   **Sharing is Caring:** Built-in UI Diagnostic Tab and Langfuse integration for agent traces.\n*   **Field Notes:** Detailed write-up covering the architecture.\n\n## 🚀 How to Test It\n1. Select the **Toyota Forklift** or **Hyundai Genesis** manual.\n2. Click the audio stream or type a message.\n3. Use commands like: \n   * *\"Show me the oil change procedure\"*\n   * *\"Troubleshoot slipping clutch\"*\n   * *\"Go to the next page\"* / *\"Go back a page\"*\n   * *\"Go to page 512\"*\n"
  },
  {
    "repo_id": "build-small-hackathon/global-leaders",
    "title": "Global Leaders",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/global-leaders",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "nvidia",
      "sponsor",
      "achievement",
      "global",
      "leaders",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Global Leaders\nemoji: 🌍\ncolorFrom: green\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npinned: false\nlicense: mit\nshort_description: Govern a real 2025 world leader; a small model runs it all\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - track:wood\n  - nemotron\n  - nvidia\n  - sponsor:nvidia\n  - sponsor:modal\n  - off-brand\n  - off-the-grid\n  - achievement:offbrand\n  - achievement:offgrid\n  - best-agent\n  - best-demo\n  - community-choice\n---\n\n<div align=\"center\">\n\n# 🌍 GLOBAL LEADERS\n\n### *Take the chair. Hold the line. Survive 2025.*\n\n**A political-strategy game where a small language model runs the world —**\n**and you govern a real leader through the real headlines of 2025.**\n\n`🇺🇸 Trump` · `🇧🇷 Lula` · `🇷🇺 Putin` · `🇨🇳 Xi` · `🇦🇷 Milei` · `🇫🇷 Macron`\n\n![hackathon](https://img.shields.io/badge/Build_Small-Thousand_Token_Wood-33ff88?style=for-the-badge)\n![model](https://img.shields.io/badge/NVIDIA_Nemotron-≤32B-76b900?style=for-the-badge&logo=nvidia&logoColor=white)\n![gradio](https://img.shields.io/badge/Gradio-6.x-ffb000?style=for-the-badge&logo=gradio&logoColor=black)\n![local](https://img.shields.io/badge/Runs-100%25_Local_capable-7fd1ff?style=for-the-badge)\n\n</div>\n\n## 🏆 Build Small Hackathon — submission\n\n- **Track:** Adventure in Thousand Token Wood\n- **🎬 Demo video:** [73s walkthrough on YouTube](https://youtu.be/Zfci2XcYfJE)\n- **▶️ Play it live:** https://huggingface.co/spaces/build-small-hackathon/global-leaders\n- **📣 Social post:** [on LinkedIn](https://www.linkedin.com/posts/leonardo-camilo_buildsmall-thousandtokenwood-nvidia-share-7472435234341527552-zZil/)\n- **🧠 Model:** NVIDIA Nemotron 30B-A3B (≤32B, 3B active) — self-hosted on Modal (L40S), or fully local via Ollama\n\n---\n\n```\n╔══════════════════════════════════════════════════════════════════════╗\n║  ● GLOBAL LEADERS            FRANCE · EMMANUEL MACRON         JUL 2025  ║\n╠══════════════════════════════════════════════════════════════════════╣\n║  ▸ EU-US TRADE DEAL COLLAPSES AMID TARIFFS                             ║\n║    Washington slaps 20% on European exports. Brussels wants you to     ║\n║    retaliate; your industries want a deal; the markets want calm.      ║\n║                                                                        ║\n║    🔴 Le Pen: \"Let his government crumble — we inherit the wreckage.\"  ║\n║    🟡 EU Commission: \"Hold the line, or the bloc fractures.\"           ║\n║                                                                        ║\n║    ▶ Pivot to strategic autonomy   ▶ Seek a US exemption   ✎ your move ║\n╚══════════════════════════════════════════════════════════════════════╝\n```\n\nYou take over a **real world leader on 1 January 2025** and govern for **twelve months**, reacting to\nthe real events of that year. A small model (**NVIDIA Nemotron**, ≤32B) is the game master: it writes\nyour objectives, voices your cabinet and your rivals, narrates each crisis and judges your decisions.\nPick a suggested move **or type your own** — it interprets anything you throw at it.\n\n> 🏆 Built for the **Build Small / Thousand Token Wood** hackathon. The whole point: do something rich,\n> reliable and *fun* with a small, cheap, **local-capable** model.\n\n---\n\n## ⚙️ Why this is a *small-model* project (the secret sauce)\n\nLLM games usually fail because the model has to *be* the rules engine — and small models are bad at\narithmetic, state and consistency. **We invert it:**\n\n| | |\n|---|---|\n| 🧠 **The code is the source of truth** | A deterministic Python engine owns the 8 indicators, hidden faction meters, the dice, win/lose logic and every guardrail. |\n| ✍️ **The model only narrates & proposes** | Always through a **validated JSON schema** — parsed, validated, **retried** on failure. |\n| 🛡️ **Guardrails clamp creativity** | The engine clamps proposed effects to legal ranges, enforces a *no-free-lunch* trade-off, rolls an uncertainty die, then applies. The model can be wild; it can't break the game. |\n| 🪶 **Token-frugal by design** | Reasoning off (`think:false`), history compressed to a rolling digest, tight role-specific prompts. The header shows your live **token count**. |\n| 🔌 **Never crashes** | No key? A deterministic `FakeLLM` produces schema-valid output, so the demo always runs — perfect for offline judging. |\n\nThe payoff: a **≤32B model reliably runs a 6-country political sim** with named real figures, branching\nconsequences, hidden coups and early game-overs.\n\n---\n\n## 🎮 What you can do\n\n- **🪑 Pick your chair** — 6 leaders, each with a curated deck of **real 2025 events** (domestic *and*\n  international) and an 8–12 person cast of real figures.\n- **⚖️ Make case-method calls** — no single right answer, incomplete information, conflicting stakeholders.\n- **♟️ Play the game theory** — every figure has its own utility vector and a written persona (in\n  [`engine/prompts/countries/`](engine/prompts/countries)); they reward or punish you based on *their*\n  interests, not yours.\n- **🍽️ Take rivals to lunc"
  },
  {
    "repo_id": "build-small-hackathon/Tiny-Debate-Club",
    "title": "Tiny Debate Club",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Tiny-Debate-Club",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "tiny",
      "debate",
      "club",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 5,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Tiny Debate Club\nemoji: 🎭\ncolorFrom: purple\ncolorTo: pink\nsdk: gradio\nsdk_version: \"6.17.3\"\napp_file: app.py\npinned: false\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - gradio\n  - computer-vision\n  - multi-agent\n  - tiny-model\n  - ai-agents\n---\n\n# 🎭 Tiny Debate Club\n\n> What happens when multiple anime-inspired AI agents see the same image… and completely disagree about what it means?\n\nTiny Debate Club is a chaotic multi-agent AI experience where different personalities debate uploaded images in real time.\n\nUpload any image and watch:\n\n- 🧪 **Dr. Stone (Senku):** Breaks down the physics and calls magic 10 billion percent impossible.\n- 🍜 **Naruto Uzumaki:** Finds the hidden pain and power of friendship in every pixel.\n- 🍎 **Light Yagami:** Reframes the scene as a chess piece in his perfect game of justice.\n- 👊 **Saitama:** Undercuts the drama because he's late for a supermarket bargain sale.\n- ⚖️ **The Grand Arbiter:** A theatrically pompous judge who decides the winner based on pure logic.\n\nBuilt for the 🍄 Thousand Token Wood track of the Hugging Face Build Small Hackathon.\n\n---\n\n# 🎬 Demo Video\n\n[▶️ Watch the Demo Video Here](https://www.youtube.com/watch?v=b1gtegN4__A)\n\n---\n\n# 📱 Social Post\n\n[View the LinkedIn Post](YOUR_LINKEDIN_POST_URL)\n\n---\n\n# ⚡ Features\n\n- Multi-agent AI debates\n- Image understanding using Google's Gemma 3 4B\n- Dynamic personality interactions\n- Real-time conversational chaos\n- Fun, weird, and surprisingly believable arguments\n\n---\n\n# 🛠 Tech Stack\n\n- Gradio\n- Python\n- Google Gemma 3 4B (Vision-Language Model)\n- Modal (Serverless Inference)\n- Hugging Face Spaces\n\n---\n\n# 🍄 Hackathon Track\n\n**An Adventure in Thousand Token Wood**\n\nBuild something delightful that wouldn't exist without AI.\n\n---\n\n# 🤖 Why AI?\n\nTiny Debate Club only works because multiple AI personalities interpret both:\n\n1. the uploaded image\n2. each other’s arguments\n\nThe experience becomes unpredictable, chaotic, and weirdly human.\n\n---\n\n# 🧠 Small Model Philosophy\n\nLarge models try to sound correct.\n\nTiny models confidently start arguments.\n\nThat makes them perfect for Tiny Debate Club.\n\n---\n\n# 🚀 Try It Yourself\n\nUpload an image and witness artificial personalities spiral into intellectual chaos in real time.\n"
  },
  {
    "repo_id": "build-small-hackathon/TurboSkillSlug",
    "title": "Turboskillslug",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/TurboSkillSlug",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "turboskillslug",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: TurboSkillSlug\nemoji: 🐌\ncolorFrom: purple\ncolorTo: yellow\nsdk: gradio\nsdk_version: \"6.16.0\"\npython_version: \"3.12\"\napp_file: app.py\npinned: false\nshort_description: Turn a coding session into a skill, recap, and shell.\ntags:\n  - hackathon\n  - build-small-hackathon\n  - track:wood\n  - sponsor:openai\n  - sponsor:modal\n  - achievement:welltuned\n  - achievement:offbrand\n  - achievement:fieldnotes\nmodels:\n  - legendarydragontamer/slugvoice-qwen2.5-1.5b-lora\n  - legendarydragontamer/slugextract-qwen2.5-1.5b-lora\n  - Qwen/Qwen2.5-1.5B-Instruct\n  - openai/whisper-large-v3-turbo\ndatasets:\n  - legendarydragontamer/turboskillslug-groundedness-eval\n---\n\n# TurboSkillSlug\n\n**Feed it a coding session. Get back a reusable SKILL.md, a grounded spoken\nrecap, and a procedural shell that encodes the whole session as art.**\n\nThe session goes in one of two ways: narrate it aloud, or drop a real Claude Code\nor Codex CLI session log (`.jsonl`). A fine-tuned 1.5B model extracts what you\ntried, what failed, and what finally worked. Total active pipeline: **~2.6B\nparameters**, measured to match a 7B on groundedness at a third the size.\n\nThen the slug gives you four things:\n\n1. **A SKILL.md** another LLM can actually use: the non-obvious gotchas\n   (symptom → cause → fix), the approaches that fail and why, the breakthrough.\n   Built to give a frontier model real uplift, not a session summary.\n2. **A spoken recap** in a fine-tuned \"slug voice,\" every line grounded in\n   something that happened, never reciting invented numbers.\n3. **A shell** whose spiral, knots, jewels, and colors all derive from your\n   session, born on screen as a scroll that unrolls along its own arm, with a\n   byōbu-style battle inked across it (dead ends are fallen warriors, the\n   breakthrough is a dragon).\n4. **A receipt** like a thermal printout: approaches tried, dead ends, mood.\n\n### The slug witnesses every kind of session, not just debugging\n\nMost coding sessions are not bug hunts. They are exploring an unfamiliar repo,\nwriting docs, setting up tooling, building a feature. A witness that only has\neyes for \"what broke and what fixed it\" leaves those sessions with a hollow\nshell.\n\nSo the slug detects the session's genre (debugging, exploration, authoring,\nfeature, refactor, setup) and witnesses the right thing for each:\n\n- debugging → the struggle: dead ends and the breakthrough\n- exploration → the discoveries: the non-obvious facts learned about the codebase\n- authoring → the decisions, and the false assumptions caught before they became wrong docs\n- feature / refactor / setup → what was built or changed, and what would break if done naively\n\nThe shell's vocabulary adapts with it: for an exploration session the rim jewels\nare discoveries, the aperture is the clearest insight. Genre detection is pure\npattern-matching: no model call, no added latency.\n\nWhy this matters concretely: on a real exploration trace, the slug surfaced that\na project's checkpoint mirror uses a custom git ref namespace\n(`refs/entire/...`) that a standard `git fetch --all` will miss. That is exactly\nthe kind of private, non-derivable knowledge a SKILL.md exists to carry, and it\ncame from a session that had no \"bug\" at all.\n\nEvery shell is unique because every session is unique.\n\n## Try it in one click\n\nTwo tabs, two sample inputs:\n\n- **narrate aloud** — a sample build session (audio)\n- **drop a session trace** — a sample Claude Code `.jsonl` trace\n\nOr bring your own: upload a recording, or drag a real session log from\n`~/.claude/projects/.../*.jsonl` or `~/.codex/sessions/.../*.jsonl`. Judges can\nfeed their own agent logs and watch the slug read them.\n\n## Demo\n\nWatch the demo: **[youtu.be/qSP9olWRv7o](https://youtu.be/qSP9olWRv7o)**\n\n## Social\n\nThe launch post: **[x.com/anubhav27071997](https://x.com/anubhav27071997/status/2063970171010826540)**\n\n## Why this is hard the right way\n\nThe slug's entire promise is a witness that only says what it saw. That makes\n**groundedness** the core engineering problem: a small model that invents facts\nis worthless here. So we measured it, honestly, and published the data.\n\n### Groundedness: does the small model hallucinate more than the 7B it replaced?\n\nOn 25 held-out transcripts, comparing the shipped fine-tuned 1.5B against the\nQwen-7B it replaced and its own un-tuned 1.5B base:\n\n| system | semantic groundedness | lexical | parse | facts |\n|---|---:|---:|---:|---:|\n| prompted 7B | 0.716 | 0.576 | 24/25 | 272 |\n| prompted 1.5B base | 0.565 | 0.390 | 21/25 | 140 |\n| **fine-tuned 1.5B LoRA** | **0.762** | 0.378 | 21/25 | 195 |\n\nThe fine-tuned 1.5B **matches and slightly exceeds the 7B (0.76 vs 0.72) at a\nthird of the active size.** It does this by paraphrasing rather than copying:\nlowest lexical overlap, highest semantic groundedness, the signature of a model\nthat restates meaning instead of echoing words.\n\nReported with its costs, not spun: the LoRA produces valid JSON less often\n(21/25 vs 24/25), and the semantic metric passed 5/6 hand-labele"
  },
  {
    "repo_id": "build-small-hackathon/Tianwen",
    "title": "Tianwen",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Tianwen",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "tianwen",
      "summary",
      "available",
      "yet.",
      "gradio",
      "build-small-hackathon"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Tianwen\nemoji: 🔮\ncolorFrom: indigo\ncolorTo: pink\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: mit\ntags:\n  - build-small-hackathon\n  - track-thousand-token-wood\n  - badge-off-the-grid\n  - badge-well-tuned\n  - badge-sharing-is-caring\n  - badge-field-notes\n  - prize-openbmb\n  - prize-modal\n  - prize-tiny-titan\n  - minicpm\n  - llama-cpp\n  - on-device\n---\n\n# 天问 · Tianwen\n\n**An entirely on-device Eastern self-reflection app — a 1B model that translates 3,000 years of\nChinese metaphysical language into one small, doable step for today.**\n\n> Built for the **Build Small Hackathon**. Named after Qu Yuan's ancient poem *Tianwen* (《天问》,\n> \"Questions to Heaven\") — a reminder that the most worthwhile questions are the ones we ask ourselves.\n\n🔗 **Demo video:** [8-min walkthrough](https://huggingface.co/spaces/build-small-hackathon/Tianwen/resolve/main/video/Tianwen%20Demo%20Video%208min.mp4) ·\n🔗 **Social post:** [r/huggingface](https://www.reddit.com/r/huggingface/comments/1u6x7ui/i_finetuned_minicpm51b_to_turn_chinese_astrology/) ·\n🤗 **Model:** `build-small-hackathon/tianwen-minicpm5-1b` ·\n🤗 **Dataset:** `build-small-hackathon/tianwen-distill`\n\n<video controls src=\"https://huggingface.co/spaces/build-small-hackathon/Tianwen/resolve/main/video/Tianwen%20Demo%20Video%208min.mp4\" title=\"Tianwen — 8-min demo\"></video>\n\n---\n\n## What it is\n\nTianwen takes a deeply esoteric domain — Chinese BaZi (八字) astrology and I-Ching (六爻) divination —\nand turns it into a gentle, gamified instrument for **self-reflection**. It never tells fortunes; it\nreframes ancient symbols as everyday psychological language and always ends with one concrete action.\n\nFour readings, from the widest lens to the closest moment:\n\n- **命盘 Chart** — your whole-life portrait (BaZi four pillars)\n- **年运 Year** — this year's tone, plus twelve months one at a time\n- **日盘 Today** — today's energy + a daily sigil wallpaper + a 30-day mood trend\n- **请事 Oracle** — toss three coins six times for an I-Ching reading on a question you hold\n\nLanguages: **简体 / 繁體 / English**. Themes: Ru-kiln White / Xuanwu Black. Installable PWA.\n\n## The idea & the tech (write-up)\n\nThe hackathon's thesis is *\"the future of AI doesn't have to live in someone else's data center.\"*\nTianwen is a full app where **everything that matters runs on your own machine**:\n\n| Layer | How |\n|---|---|\n| **Charting** | Deterministic — `lunar-python` computes BaZi / liunian / liuyue / hexagrams locally. No model guesses dates. |\n| **Interpretation** | A **fine-tuned MiniCPM5-1B** (1B params) served via **llama.cpp**, OpenAI-compatible. Distilled from a teacher model into our house voice (plain, second-person, anti-doom). |\n| **Safety** | A **deterministic** guardrail (`safety/intercept`) — crisis words trip an instant circuit-breaker to hotlines; ominous symbols are re-mapped to growth language. Never left to a 1B model's probabilities. |\n| **Fallback** | If the LLM is unreachable, a rules engine still produces a full reading — so it works **offline**. |\n\nWe didn't prompt our way to the voice — we **taught** it. See the\n**[Fine-Tuning Field Notes](docs/FINETUNE_REPORT.md)** for the full distillation → LoRA → GGUF →\ndeployment story (and every bug we hit).\n\n**Stack:** MiniCPM5-1B · LLaMA-Factory (LoRA, bf16) · Modal (A100 training) · llama.cpp (GGUF Q4_K_M) ·\nHF ZeroGPU (inference) · FastAPI + Gradio (backend) · React + shadcn/ui + Tailwind (frontend) · lunar-python.\n\n## Tracks & badges we're going for\n\n- **Track — 🌲 Thousand Token Wood**: an esoteric professional domain made playful and self-reflective with a tiny model.\n- 🏅 **Off the Grid** — fully local, zero external network requests, installable offline PWA.\n- 🏅 **Well-Tuned** — custom distilled dataset + LoRA fine-tune of MiniCPM5-1B, healthy loss curve.\n- 🏅 **Sharing is Caring** — open dataset, open training/distillation scripts, and a textbook-grade build log.\n- 🏅 **Field Notes** — a detailed, honest construction log with the full bug chain.\n- 🏆 **OpenBMB** — built on MiniCPM5-1B.\n- 🏆 **Modal** — distillation/training pipeline ran on Modal A100.\n- 🏆 **Tiny Titan** — a 1B model doing what you'd expect a 4B+ to do.\n\n## How it meets the requirements\n\n| # | Requirement | How Tianwen meets it |\n|---|---|---|\n| REQ-01 | Under 32B | MiniCPM5-**1B** (and the rules fallback uses no model at all). |\n| REQ-02 | Gradio app on the org | Deployed as a Gradio Space (`app_file: app.py`) under `build-small-hackathon`. |\n| REQ-03 | Demo video | Linked at the top of this README. |\n| REQ-04 | Social post | Linked at the top of this README. |\n| REQ-05 | ≤10 ZeroGPU apps | One app Space + one ZeroGPU model Space. |\n| REQ-06 | Tagged README | Track + badge tags in the frontmatter; write-up above. |\n\n## Run locally\n\n```bash\n# 1) Build the React frontend (the backend serves this build)\ncd frontend && pnpm install && pnpm build && cd ..\n\n# 2) Start the backend\npython app.py                      # → http://127.0.0.1:7860/"
  },
  {
    "repo_id": "build-small-hackathon/podify",
    "title": "Podify",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/podify",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "podify",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Podify - AI Podcast Generator\nemoji: 🎙️\ncolorFrom: indigo\ncolorTo: purple\nsdk: gradio\nsdk_version: 5.27.0\napp_file: app.py\npython_version: \"3.10\"\nhardware: zero-a10g\nsuggested_hardware: zero-a10g\npinned: false\nshort_description: Research a topic and turn it into a voiced podcast\ntags:\n  - track:backyard\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\n---\n\n![Podify banner](assets/readme-banner.png)\n\n# 🎙️ Podify — AI Podcast Generator\n\nTurn any topic into a finished, voiced podcast in two phases:\n\n1. **Content** — research agents (LangGraph) use a HuggingFace-hosted LLM plus live\n   DuckDuckGo web search to research the topic and write a speaker-tagged script.\n2. **Audio** — the self-hosted **Fish Audio / OpenAudio S1-mini** model speaks the\n   script, with selectable preset voices and zero-shot **voice cloning** from an\n   uploaded clip or a live mic recording.\n\nEverything runs inside this single Gradio Space; the TTS model runs on **ZeroGPU**.\n\n## Architecture\n\n```\nTopic ─▶ LangGraph: plan ─▶ DDG search ─▶ outline ─▶ write ─▶ Script\nScript ─▶ Fish Audio S1-mini (@spaces.GPU): per-line synth ─▶ stitched podcast WAV\n```\n\n- `app.py` — Gradio Blocks UI (two tabs) wiring both phases.\n- `research/` — `llm.py` (HF Inference client), `search.py` (DuckDuckGo), `graph.py`\n  (LangGraph research graph).\n- `tts/` — `engine.py` (model load + GPU synthesis + multi-speaker stitching),\n  `voices.py` (preset voice registry).\n\n## Configuration\n\nSet these as **Space secrets / variables** (Settings → Variables and secrets):\n\n| Name        | Required | Purpose                                                        |\n|-------------|----------|----------------------------------------------------------------|\n| `HF_TOKEN`  | ✅       | LLM inference (Inference Providers) + model download.          |\n| `LLM_MODEL` | optional | Override the content LLM (default `Qwen/Qwen2.5-14B-Instruct`, <32B). |\n| `TTS_MODEL_REPO` | optional | Override the TTS model repo (default `fishaudio/openaudio-s1-mini`). |\n\n**ZeroGPU** requires a HuggingFace **PRO** account on the Space owner.\n\n## Run locally\n\n```bash\npip install -r requirements.txt\nexport HF_TOKEN=hf_xxx          # PowerShell: $env:HF_TOKEN=\"hf_xxx\"\npython app.py\n```\n\nPhase 1 (research + script) runs on CPU. Phase 2 (TTS) needs a GPU and the\n`fish-speech` package; on CPU-only machines the UI loads but synthesis is disabled.\n\n## Models Used\n- **Qwen/Qwen2.5-7B-Instruct** For Research and Script Generation\n- **fishaudio/openaudio-s1-mini 0.5b** For Audio Generation\n\n## Deploy to a Space\n\n```bash\nhuggingface-cli login\nhuggingface-cli upload <user>/podify . --repo-type=space\n# or: git push to the Space remote (preset .wav files tracked via Git LFS)\n```\n\n## Credits / assets\n\n- **Voice samples** (`tts/voices/`): derived from [CMU ARCTIC](http://festvox.org/cmu_arctic/)\n  (free for research and commercial use). Rebuild with `scripts/build_voice_samples.py`.\n- **Background-music loops** (`tts/music_loops/`): [FreePD](https://freepd.com/) by Kevin\n  MacLeod — 100% public domain (CC0). Rebuild with `scripts/build_music_loops.py`.\n  A procedural numpy fallback in `tts/music.py` is used if the loops are absent.\n\n## Contributots\n- **nvipin63**\n- **jayaspjacob**\n\n\n#backyard-ai\n- Blog: [Article](https://huggingface.co/blog/build-small-hackathon/podify)\n- Social Media Post: [Post](https://substack.com/@nvipin63/note/c-276881572?r=637t58&utm_source=notes-share-action&utm_medium=web)\n- Demo: [Video](https://youtu.be/DRVf_Q8IoOI)"
  },
  {
    "repo_id": "build-small-hackathon/arcana",
    "title": "Arcana",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/arcana",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "arcana",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: Arcana\nemoji: 🔮\ncolorFrom: purple\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.14.0\napp_file: app.py\npinned: false\nlicense: mit\ntags:\n  - track:wood\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:llama\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# 🔮 Arcana — a themed concept-tarot generator & reader\n\nName any **domain or theme** — *thermodynamics*, *Lord of the Rings*, *breakfast\nfoods*, *the French Revolution* — and Arcana invents a whole tarot deck for it,\npaints the 22 cards, and reads your fortune in them.\n\nBuilt for the **HF / Gradio Build Small Hackathon** — track *Adventure in\nThousand Token Wood*.\n\n### Submission\n\n- 🎥 **Demo video:** https://youtu.be/j-yLxVL9WPY\n- 📝 **Blog post** (the build, in depth): https://huggingface.co/blog/build-small-hackathon/arcana-blog\n- 📣 **Social post:** https://www.linkedin.com/posts/jeremy-e-b-guntoro-8406a5174_huggingface-buildsmallhackathon-ai-share-7472048547522134016-2LBD/\n- 🔍 **Agent traces** (Hub dataset): https://huggingface.co/datasets/build-small-hackathon/arcana-deliverables\n\nBadges claimed: *Off the Grid* (no cloud APIs), *Off-Brand* (custom UI), *Llama\nChampion* (llama.cpp), *Sharing is Caring* (open agent traces), *Field Notes*\n(write-up).\n\n## How it works — invent the meanings, then reason over them\n\nThe load-bearing AI move is **not** the art (plenty of tools draw tarot cards).\nIt's the **semantic invention** — deciding what each domain concept *means\narchetypally* — and then the **progressive reasoning over a spread** of those\nconcepts in combination. Both are language tasks, so a small ≤32B text model is\nthe right core.\n\n```\ntheme + visual style\n      │\n      ▼\n Deck Designer (Qwen3-14B)     maps the 22 Major Arcana → in-theme concepts (+ why)\n      │\n      ▼\n Scribe (Qwen3-14B)            invents each card's meaning + art brief from the concept\n      │                        (fanned across the GPU slots — many cards at once)\n      ├─► image model (FLUX.2-klein) → 22 central arts + one deck-back (parallel)\n      │        │\n      │        ▼\n      │   Compositor (Pillow)  art + reusable gold frame + crisp title/numeral\n      ▼\n Reader / Oracle (Qwen3-14B)   draws a spread and turns the cards one at a time —\n                               a partial read as each lands, the synthesis held to the end\n```\n\nFor example, mapping *thermodynamics*: **Entropy → Death**, **Absolute Zero →\nThe Hermit**, **The Critical Point → The Tower**. The fit has to feel\n*clever-but-right*, never arbitrary — that quality is the whole point.\n\n## Off the grid — open models, ≤32B, running on the model in front of you\n\nNo closed/frontier models, and **no cloud inference APIs**. Everything runs\n**locally on the Space's own GPU** (Hugging Face ZeroGPU):\n\n- **Language — [Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B)** (Apache-2.0,\n  ~14.8B), one model for all three roles (mapping, scribe, reading), served\n  in-process via **llama.cpp** (a `Q4_K_M` GGUF).\n- **Image — [FLUX.2-klein](https://huggingface.co/black-forest-labs/FLUX.2-klein-9B)**\n  (~9B, distilled, 4 steps), served in-process via **diffusers**.\n\nThe scribe and the painter are each **fanned out across the ZeroGPU slots** so the\n22 cards' meanings and images are produced in parallel.\n\n## Using it\n\n- **Generate New Deck** — type a theme, pick a visual style (Rider-Waite-Smith,\n  Thoth, Marseille, Playful, or your own *Custom* look), conjure the deck. The\n  deck view fills in card-by-card; download it as a self-contained **`.zip`**.\n- **View Deck / Read from Existing Deck** — upload a deck `.zip` you downloaded\n  earlier; it's reconstructed and opened (portable, no server-side storage).\n- A reading turns the cards face-up one at a time with a partial interpretation\n  each, then weaves them into one synthesis.\n\n## Configuration (Space defaults to fully local)\n\n| env var | value on the Space | meaning |\n|---|---|---|\n| `LLM_BACKEND` | `local` | run the LLM in-process on the GPU (llama.cpp) |\n| `IMAGE_BACKEND` | `local` | run the image model in-process on the GPU (diffusers) |\n| `LOCAL_IMAGE_MODEL` | `black-forest-labs/FLUX.2-klein-9B` | the image model |\n| `HF_TOKEN` | — | Hub access for the model weights (Space **secret**) |\n\nThe LLM defaults to `unsloth/Qwen3-14B-GGUF`. There is no cloud-inference path —\nevery model runs in-process on the Space's own GPU. It's \"off the grid\".\n\n## Assets & licensing\n\nApp code is MIT. Text is generated by **Qwen3-14B** (Apache-2.0). Card art is\ngenerated by **FLUX.2-klein-9B** — *\"This FLUX Model is licensed by Black Forest\nLabs Inc. under the FLUX.2 Non-Commercial License\"* — so generated decks are for\n**non-commercial** use. Title/numeral type is\n[Cinzel](https://fonts.google.com/specimen/Cinzel) (OFL).\n"
  },
  {
    "repo_id": "build-small-hackathon/quantsafe-certifier",
    "title": "Quantsafe Certifier",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/quantsafe-certifier",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "modal",
      "quantsafe",
      "certifier",
      "summary"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Inspect how the app uses models, traces, or datasets.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: QuantSafe Certifier\nemoji: 🛡️\ncolorFrom: red\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npinned: false\nlicense: apache-2.0\nshort_description: Signed release-gate records for quantized small models.\ntags:\n  - track:backyard\n  - sponsor:openai\n  - sponsor:modal\n  - sponsor:nvidia\n  - sponsor:openbmb\n  - achievement:offbrand\n  - achievement:welltuned\n  - achievement:sharing\n  - achievement:fieldnotes\n  - achievement:llama\n  - safety\n  - safety-evaluation\n  - quantization\n  - llm\n  - refusal\n  - text-classification\n  - modernbert\n  - gradio\n  - backyard-ai\n  - model-evaluation\n  - agents\n  - multi-agent\n  - ed25519\n  - cryptography\n  - attestation\n  - provenance\n  - model-supply-chain\n  - release-gating\n  - arxiv:2606.10154\n  - llama-cpp\n  - gguf\n  - modal\n  - codex\nmodels:\n  - Qwen/Qwen3-0.6B\n  - Qwen/Qwen3-1.7B\n  - Qwen/Qwen2.5-1.5B-Instruct\n  - meta-llama/Llama-3.2-1B-Instruct\n  - Qwen/Qwen3-8B\n  - HuggingFaceTB/SmolLM3-3B\n  - Qwen/Qwen3Guard-Gen-0.6B\n  - ibm-granite/granite-guardian-3.3-8b\n  - nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3\n  - openbmb/MiniCPM4.1-8B\n  - Crusadersk/quantsafe-refusal-modernbert\n---\n\n# QuantSafe Certifier\n\n**QuantSafe creates a release-target-bound, Ed25519-signed, tamper-evident release-screen record for a published quantized model.** For the 11 published AWQ/GPTQ checkpoints in the measured matrix, record v2 signs a publisher-linked Hub revision plus a content-addressed manifest of the frozen matrix, validation report, judge results, scorer, artifact mapping, and signing policy.\n\nThe signature proves issuer identity and payload integrity. It does **not** prove that a model is safe. RTSI is a study-internal triage signal that decides whether a configuration clears this screen, needs review, or must be routed to direct safety evaluation.\n\nThe historical study did not retain cryptographic weight digests. The signed\nrevision is therefore an explicit release target linked by the publisher, not\nproof that those exact weights generated the historical measurement.\n\n**Research basis:** Sahil Kadadekar, [*Quality Is Not a Safety Proxy Under Quantization*](https://arxiv.org/abs/2606.10154), arXiv:2606.10154 (2026 preprint).\n\n**Who uses it.** I publish 11 public GPTQ/AWQ 4-bit checkpoints. QuantSafe is the release gate I built for that catalog after a retrospective audit found that ordinary quality results could hide severe refusal damage.\n\n| Audited artifact | Immutable revision | Finding | Release-gate action |\n|---|---|---|---|\n| [`phi-2-gptq-4bit`](https://huggingface.co/Crusadersk/phi-2-gptq-4bit) | [`6385e88d733f…`](https://huggingface.co/Crusadersk/phi-2-gptq-4bit/tree/6385e88d733fe95b67dc6d18f264b83c6462e681) | RTSI `0.6199` (`HIGH`) | `ROUTE` |\n| [`qwen2.5-1.5b-gptq-4bit`](https://huggingface.co/Crusadersk/qwen2.5-1.5b-gptq-4bit) | [`4e1c7d4d78a3…`](https://huggingface.co/Crusadersk/qwen2.5-1.5b-gptq-4bit/tree/4e1c7d4d78a3fbb82742207baa7ac305bd836cb5) | RTSI `0.7864` (`HIGH`, matrix maximum) | `ROUTE` |\n\n[Open the Space](https://huggingface.co/spaces/build-small-hackathon/quantsafe-certifier) · [Watch the 36-second judge demo](demo/quantsafe-demo.webm) · [Download the social-ready MP4](demo/quantsafe-demo.mp4) · [Browse the GitHub source](https://github.com/Sahil170595/huggingface-RTSI) · [Browse the Space source](https://huggingface.co/spaces/build-small-hackathon/quantsafe-certifier/tree/main) · [Read the paper](https://arxiv.org/abs/2606.10154) · [Field notes](FIELD_NOTES.md) · [Agent trace](AGENT_TRACE.md) · [Judge benchmark dataset](https://huggingface.co/datasets/Crusadersk/quantsafe-judge-benchmark) · [Adversarial audit](SECURITY_AUDIT.md) · [Launch post](https://www.linkedin.com/posts/sahilkadadekar_quantsafe-certifier-a-hugging-face-space-activity-7472355496486711296-Rgl9) · [Launch thread](https://x.com/KadadekarSahil/status/2066592448172720210) · [Launch article](https://huggingface.co/blog/build-small-hackathon/quantsafe)\n\n**Built & audited in the open.** The full agent build/audit trace is published at [Crusadersk/quantsafe-agent-trace](https://huggingface.co/datasets/Crusadersk/quantsafe-agent-trace).\n\n## Sponsors, prizes & badges at a glance\n\nEvery partner below is a load-bearing runtime or build dependency, not a metadata mention; deeper sourced evidence for each is in the sections further down.\n\n| Partner | How it is load-bearing here | Prize fit |\n|---|---|---|\n| **Modal** | Authenticated A10G endpoints run the live constitutional debate and regenerate the judge cache — Modal powers **both development and runtime** | Best Use of Modal |\n| **OpenBMB** | `MiniCPM4.1-8B` is a live debater (flips `DEPLOY → ROUTE` after critique) and a benchmarked guard on the external N=400 BeaverTails set | Best MiniCPM Build |\n| **NVIDIA** | `Nemotron-Safety-Guard-8B-v3` is one of three independent-family judge models (native BF16 through the Modal `/judge` backend) | Nemotron Hardware Prize |\n| **OpenAI** | Codex co-built + hardened major lanes — pa"
  },
  {
    "repo_id": "build-small-hackathon/karate-wiener",
    "title": "Karate Wiener",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/karate-wiener",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "karate",
      "wiener",
      "summary",
      "available"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 4,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Karate Wiener\nemoji: 🥋\ncolorFrom: green\ncolorTo: gray\nsdk: gradio\nsdk_version: 6.9.0\napp_file: app.py\npython_version: \"3.10\"\nsuggested_hardware: zero-a10g\npinned: true\nlicense: apache-2.0\nhf_oauth: true\nshort_description: Create your own karate moves with a sausage man\ntags:\n  - track:wood\n  - sponsor:nvidia\n  - sponsor:openbmb\n  - achievement:offbrand\n  - achievement:welltuned\n  - achievement:fieldnotes\n  - achievement:tinytitan\n  - achievement:bestdemo\nmodels:\n  - nvidia/Kimodo-SMPLX-RP-v1\n  - McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp\n  - McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised\n  - openbmb/VoxCPM-0.5B\n  - black-forest-labs/FLUX.2-klein-4B\n  - nvidia/nemotron-3-nano-30b-a3b\n  - polats/weiner-klein-lora\n---\n\n# Karate Wiener 🥋\n\nA whimsical voice-chat dojo where **Karate Wiener** — a wise hotdog sensei —\nteaches you kata. Talk to him by voice, and he generates and demonstrates\nkarate moves on a 3D character in real time, then builds you a custom dojo to\ntrain in.\n\n**Build Small Hackathon** · Track: *Thousand Token Wood* ·\nSponsors: NVIDIA (Nemotron), OpenBMB (VoxCPM) ·\nAchievements: Off Brand, Well Tuned, Field Notes, Tiny Titan, Best Demo\n\n🚀 **Live Space:** https://huggingface.co/spaces/build-small-hackathon/karate-wiener\n📹 **Demo video:** https://www.youtube.com/watch?v=t1j7ZGs03ps\n📣 **Social post:** https://www.linkedin.com/feed/update/urn:li:activity:7472190937620951040/\n\n## Tracks & badges\n\nWhat each tag in the YAML block above means:\n\n- **`track:wood`** — Thousand Token Wood (whimsical, fun).\n- **`sponsor:nvidia`** — Nemotron (Skill Forge); powers Karate Wiener's chat as the fallback text model via NVIDIA NIM.\n- **`sponsor:openbmb`** — VoxCPM, the MiniCPM-based voice-cloning TTS for Wiener's replies.\n- **`achievement:offbrand`** — fully custom Three.js + hand-written CSS/HTML UI, no default Gradio styling.\n- **`achievement:welltuned`** — the Karate Wiener LoRA (`polats/weiner-klein-lora`, trained on FLUX.2-klein-4B) is published on HF.\n- **`achievement:fieldnotes`** — build write-up in this README (see [Build notes](#build-notes)).\n- **`achievement:tinytitan`** — built on ≤4B models (VoxCPM ~0.5B, FLUX.2-klein-4B).\n- **`achievement:bestdemo`** — demo video + social post (linked above).\n\n## Tech stack\n\nCustom Gradio + Three.js UI (no default Gradio styling) over a set of small\nmodels, all **< 32B parameters**:\n\n| Role | Model / Space |\n|---|---|\n| Text-to-motion (kata + moves) | `nvidia/Kimodo-SMPLX-RP-v1` |\n| Karate Wiener chat persona + dojo prompt suggestion/refinement | Cohere **Tiny Aya** (sidecar Space), with hosted **Nemotron** (`nvidia/nemotron-3-nano-30b-a3b`, NVIDIA NIM) as fallback |\n| Voice cloning TTS for Wiener's replies | **VoxCPM** ~0.5B — MiniCPM-based (sidecar Space) |\n| Prompt→motion cache embeddings | `McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp` |\n| Dojo scene image generation | **`black-forest-labs/FLUX.2-klein-4B`** isometric render (via the Klein sidecar Space) |\n| Dojo scene image→3D | **TripoSplat** Gaussian-splat conversion |\n\nThe image/3D/TTS/LLM models run as remote ZeroGPU \"sidecar\" Spaces reached via\n`gradio_client`, so the same code path runs locally and on the Space.\n\n## Generation model\n\nThe Space generates on the **SMPL-X 22-joint** skeleton via `kimodo-smplx-rp`\n(set `KIMODO_MODEL` to override). This matches the skeleton used by the kimodo\nweb/kata viewers, so generated clips can be retargeted directly onto skinned\ndisplay characters with no joint remapping.\n\nGenerated animations are saved into the configured Hugging Face Dataset store so\nthey survive Space rebuilds and can be reused by the kata maker. Animation IDs\nare deterministic from the normalized prompt, model, seconds, denoising steps,\nseed, and generation schema; regenerating with the same settings returns the\ncached animation instead of spending GPU time again.\n\n> Note: `nvidia/Kimodo-SMPLX-RP-v1` is distributed under NVIDIA's research/R&D\n> model license, which is more restrictive than the NVIDIA Open Model license\n> used by the SOMA models. Review it before public/commercial use.\n\n## Dojo scenes\n\nThe dojo a clip plays in is generated on demand: **Tiny Aya** expands a few\nkeywords into a full scene description, **FLUX** (via the Klein Space) renders an\nisometric image of that dojo, and **TripoSplat** lifts the image into a 3D\nGaussian splat you can orbit around your character.\n\n## Display characters\n\nThe 3D viewer can play a clip as the procedural skeleton or retarget it onto a\nskinned rig (an s&box Citizen ships in `assets/`). The picker in the viewer HUD\nswitches between them; retargeting is rest-pose alignment driven by each clip's\n`global_quats_xyzw`. Additional rigs can be added to `_CHARACTER_CATALOG` in\n`app.py` as `{ id, label, mapping, glb_b64 }`.\n\n## Build notes\n\nA short field report from building Karate Wiener under the < 32B constraint:\n\n- **Everything heavy is a sidecar.** Rather than load every model in one Space,\n  the GPU work (FLUX image gen, TripoS"
  },
  {
    "repo_id": "build-small-hackathon/documind",
    "title": "Documind",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/documind",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "documind",
      "summary",
      "available",
      "yet.",
      "gradio",
      "track"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: DocuMind\nemoji: 🧠\ncolorFrom: blue\ncolorTo: purple\nsdk: gradio\napp_file: app.py\npinned: true\nlicense: mit\nsdk_version: 6.18.0\n\ntags:\n  - track:backyard\n  - achievement:offgrid\n---\n\n# 🧠 DocuMind - Smart Document Analyzer\n\n> AI-powered document analysis and OCR tool | AI驱动的文档分析与OCR工具\n\n## ✨ Features | 功能特性\n\n| Feature | Description | 技术 |\n|---------|-------------|------|\n| 📄 Document Analysis | Parse PDF, Word, TXT files | PyPDF2, python-docx |\n| 🤖 AI Summary | Generate intelligent summaries | GPT-2 (1.5B) |\n| ✍️ AI Rewrite | Rewrite with custom prompts | GPT-2 (1.5B) |\n| 💬 Document Q&A | Answer questions about document | GPT-2 + TF-IDF |\n| 🖼️ Image Description | AI image captioning | BLIP (0.4B) |\n| 🔍 OCR Recognition | Extract text from images | EasyOCR |\n| 🌐 Bilingual | Chinese & English support | - |\n\n## 🎬 Demo Video | 演示视频\n\n- YouTube: https://youtu.be/1iR65sq0HmA\n\n## 📱 Social Media | 社交媒体\n\n\n- Twitter (Video): https://x.com/bingeaigc/status/2064999685476257999\n- Twitter (Project): https://x.com/bingeaigc/status/2064999682582196665\n\n## 🔗 Links | 链接\n\n- **Live Demo**: https://huggingface.co/spaces/build-small-hackathon/documind\n- **GitHub**: https://github.com/Binge-666/documind\n\n## 🚀 Quick Start | 快速开始\n\n```bash\npip install -r requirements.txt\npython app.py\nOpen http://localhost:7860 | 打开 http://localhost:7860\n📋 Requirements | 要求\n- ✅ AI Model ≤ 32B params (GPT-2: 1.5B + BLIP: 0.4B)\n- ✅ Hugging Face ecosystem\n- ✅ Deployed on HF Spaces\n🛠️ Tech Stack | 技术栈\n- UI: Gradio\n- NLP: GPT-2 (Hugging Face Transformers)\n- Q&A: TF-IDF + Cosine Similarity (scikit-learn)\n- Image AI: BLIP (Hugging Face Transformers)\n- OCR: EasyOCR\n- Document: PyPDF2, python-docx\n📄 License\nMIT"
  },
  {
    "repo_id": "build-small-hackathon/thousand-token-theater",
    "title": "Thousand Token Theater",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/thousand-token-theater",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "openbmb",
      "thousand",
      "token",
      "theater",
      "summary"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Thousand-Token Theater\nemoji: 🎭\ncolorFrom: indigo\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npinned: false\nlicense: mit\nsuggested_hardware: zero-a10g\nshort_description: MiniCPM improv troupe with a hard 1,000-token memory\ntags:\n  - build-small-hackathon\n  - thousand-token-wood\n  - minicpm\n  - openbmb\n  - zerogpu\n  - tiny-titan\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:fieldnotes\nmodels:\n  - openbmb/MiniCPM5-1B\n  - openbmb/VoxCPM2\n---\n\n# 🎭 Thousand-Token Theater\n\nA troupe of small-model actors improvises a one-act play that **you** direct —\nand now you can **hear** it: every actor performs each line aloud in their own\nvoice. The twist that gives the piece its name: the troupe's *entire* shared\nmemory of the play is hard-capped at **1,000 tokens**, measured by the model's own\ntokenizer. As the script grows past the cap, the oldest beats are **forgotten** —\nand the actors carry on with only what still fits.\n\nThe forgetting is the point. You watch the play drift: a plot the cast cared\nabout ten beats ago quietly falls out of memory, a character reappears with no\nidea who they once were, a secret you planted dissolves into the dark. It is a\nliving demonstration of what a small context window does to a story.\n\nBuilt for the **Build Small Hackathon** · track *Adventure in Thousand Token Wood*.\n\n## What's new\n\n- **🔊 Every actor has a voice.** Each line is spoken aloud in a distinct,\n  consistent voice via [`openbmb/VoxCPM2`](https://huggingface.co/openbmb/VoxCPM2).\n  Voices are *designed* per character (grand badger baritone, breathless little\n  wren, croaky toad impresario, velvet narrator) and then cloned per line so each\n  actor sounds like themselves across the whole play. Toggle it with **🔊 Voices**.\n- **🧠 The memory meter now drains.** It shows how much of the 1,000-token memory\n  is still free, draining toward empty as the scene fills; when it hits zero the\n  troupe is at capacity and every new line visibly buries an older one.\n- **⭐ Save highlights.** Bookmark a favourite line or a whole scene — saved\n  moments survive even after the troupe forgets them, and you can download them.\n- **🪶 Tiny actors.** The whole troupe now runs on **`openbmb/MiniCPM5-1B`**, a\n  1-billion-parameter model — small enough to leave the GPU free for the voice\n  model, and proof that a *tiny* model can carry an entire improv show.\n\n## Submission Evidence\n\n- Live Space: https://huggingface.co/spaces/build-small-hackathon/thousand-token-theater\n- Public GitHub evidence repo: https://github.com/himanshu748/build-small-hackathon-thousand-token-theater\n- Demo video + social post: https://x.com/i/status/2064354192748110158\n- Field Notes / build report: https://github.com/himanshu748/build-small-hackathon-thousand-token-theater/blob/main/docs/field-notes.md\nNo public traces are claimed for this project yet.\n\n## Hackathon Fit\n\n- Track: Adventure in Thousand Token Wood.\n- Build surface: custom Gradio `Blocks` app hosted as a Hugging Face Space.\n- Model rule: the actors run on `openbmb/MiniCPM5-1B` and the voices on `openbmb/VoxCPM2` — both OpenBMB small models under the `≤32B` limit.\n- OpenBMB angle: two OpenBMB models drive the show end-to-end — MiniCPM5-1B writes every line and counts the 1,000-token memory, and VoxCPM2 speaks every line aloud.\n- Tiny Titan angle: the entire improv troupe is powered by a single **1B** model (`MiniCPM5-1B`) — a genuinely tiny model carrying a whole live performance.\n- Off-Brand angle: custom theater/playbill UI with stage, director controls, per-actor voices, a draining memory meter, a Forgotten panel, and saveable highlights.\n- Off the Grid angle: the app avoids external cloud model APIs; both generation and speech run through models loaded on the Space runtime.\n\nNot claimed: OpenAI Codex, Sharing is Caring, Llama Champion, Modal, Well-Tuned, or Best Agent.\n\n## How to play\n\n1. Pick a **setting** (woodland fable, noir alley, derelict starship, royal banquet) and an optional **premise**.\n2. **Raise the curtain** — the Narrator opens the scene, and you hear it.\n3. Press **Play it ▶** to let the next actor improvise (and speak), or **Surprise me 🎲** for a random shock.\n4. Type a **stage direction** in the Director box to steer the play (it, too, can later be forgotten).\n5. Watch the **Troupe memory** meter **drain** toward empty — and the **Forgotten** panel catch what slips away.\n6. Tap **⭐ Save last line** or **🎬 Save this scene** to keep favourite moments; they survive the forgetting and can be downloaded. Use **🔊 Voices** to turn speech on or off.\n\n## What actually runs (no hand-waving)\n\n- **Actors:** [`openbmb/MiniCPM5-1B`](https://huggingface.co/openbmb/MiniCPM5-1B) — an OpenBMB small model (≤32B) — loaded with `trust_remote_code=True` and run **on the Space's ZeroGPU (A10G)** via `@spaces.GPU`. Every line on stage is generated live; nothing is pre-written.\n- **Voices:** [`openbmb/VoxCPM2`](https://huggingface.co/"
  },
  {
    "repo_id": "build-small-hackathon/The-Eighth-Exit",
    "title": "The Eighth Exit",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/The-Eighth-Exit",
    "zone": "Builder Workshop",
    "track": "Backyard AI",
    "tags": [
      "eighth",
      "exit",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Builder-Focused",
    "quest": "🛠 Builder Workshop: Check what small-model technique this project demonstrates.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛠",
    "readme_text": "---\ntitle: The Eighth Exit\nemoji: 👀\ncolorFrom: gray\ncolorTo: blue\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: false\nshort_description: A Text-Based Anomaly Horror Game\ntrack: An Adventure in Thousand Token Wood\ntags:\n    - Thousand Token Wood\n    - Small Model (Qwen/Qwen3-1.7B)\n    - Off the Grid\n    - Off-Brand\n    -  Tiny Titan\n---\n\n# The Eighth Exit\n\nA minimalist text-based anomaly detection game inspired by observation, memory, and pattern recognition.\n\nYour objective is simple:\n\nMemorize the baseline.\n\nDetect anomalies.\n\nReach Exit 8.\n\nEscape.\n\n---\n\n## How to Play\n\n### Rules\n\n* If you see an anomaly, **Turn Back**.\n* If there is no anomaly, **Move Forward**.\n* A wrong decision resets you to Exit 0.\n* Reach Exit 8 to escape.\n\n### Buttons\n\n* **Start** — Begin the current run.\n* **Move Forward** — Use when the corridor matches the baseline.\n* **Turn Back** — Use when you detect an anomaly.\n* **New Game** — Generate a new baseline and start over.\n\n---\n\n## Example\n\nBaseline:\n\n```text\nDOG\nCAT\nFOX\nOWL\nBAT\n```\n\nCorridor:\n\n```text\nDOG\nCAT\nFOX\nOWLX\nBAT\n```\n\nAnomaly detected.\n\nTurn Back.\n\n---\n\n## Anomaly Types\n\nThe game contains multiple anomaly categories:\n\n* Character additions\n* Character removals\n* Character replacements\n* Letter transpositions\n* Duplicate characters\n* Duplicate entries\n* Entry movement\n* Ordering violations\n* Sequence distortions\n\nSome anomalies are obvious.\n\nOthers are designed to be subtle.\n\n---\n\n## Technology\n\nBuilt with:\n\n* Python\n* Gradio\n* Hugging Face Spaces\n* Qwen (model: Qwen/Qwen3-1.7B)\n\n---\n\n## Play Online\n\nSpace:\n\n**[Play here](https://huggingface.co/spaces/build-small-hackathon/The-Eighth-Exit)**\n\n---\n\n## Video\n\nYouTube:\n\n**[Demo](https://youtu.be/bw0ucKZI5_k?si=HxNbGTWkn3zBi2d7)**\n\n---\n\n## Social\n\nX / Twitter:\n\n**[Post](https://x.com/nharshav/status/2065082676760162720?s=20)**\n\n---\n\n## Challenge\n\nCan you reach Exit 8 without making a mistake?\n\nGood luck.\n"
  },
  {
    "repo_id": "build-small-hackathon/stage-whisper",
    "title": "Stage Whisper",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/stage-whisper",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "stage",
      "whisper",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Stage Whisper\nemoji: 🎭\ncolorFrom: gray\ncolorTo: purple\nsdk: gradio\nsdk_version: \"6.16.0\"\napp_file: app.py\npinned: false\nlicense: apache-2.0\nsecrets:\n  - ELEVENLABS_API_KEY\n  - HF_TOKEN\n---\n\n# 🎭 Stage Whisper\n\nAn AI-powered interactive theater. Choose a preset drama or describe your own — watch scenes unfold with generated images and voiced dialogue. Interrupt at any time as the Director or as a Character to reshape the story.\n\nBuilt for the [Build Small Hackathon](https://huggingface.co/build-small-hackathon)\n\n**Playwright:** Qwen3-27B · **Images:** FLUX.1-pro · **TTS:** ElevenLabs · **Framework:** Gradio 6.16.0\n"
  },
  {
    "repo_id": "build-small-hackathon/Fabella",
    "title": "Fabella",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Fabella",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "sponsor",
      "achievement",
      "fabella",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Find one way this prototype could become more useful.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Fabella\nemoji: 📖\ncolorFrom: green\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.18.0\napp_file: app.py\npinned: true\nhf_oauth: true\nlicense: apache-2.0\nshort_description: Small words for big questions.\ndatasets:\n  - build-small-hackathon/fabella-traces\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - sponsor:modal\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n---\n\n# Fabella\n\n**Small words for big questions.** Tell Fabella what's going on in a sentence or two. She drafts a short, kind, age-appropriate explanation you can read aloud — a second small model checks it against a six-criterion rubric before you see it.\n\n**Submission for the [Build Small Hackathon](https://huggingface.co/spaces/build-small-hackathon/README) · Track I · Backyard AI.**\n\n[Live demo](https://build-small-hackathon-fabella.hf.space) · [Public GitHub repo](https://github.com/Kiy-K/Fabella) · [HF Space repo](https://huggingface.co/spaces/build-small-hackathon/Fabella) · [Modal app](https://modal.com/apps/khoitruong071510/main/deployed/fabella)\n\n## Demo video\n\n[Watch on YouTube](https://youtu.be/dAoy1GRbEV8)\n\nThe 90-second walkthrough shows the parent flow (situation → age → tone → validated draft → read aloud), the 3-model pipeline (Gemma 4 E4B drafter · Nemotron 3 Nano judge · VoxCPM2 read-aloud), the HF Bucket memory layer, and the anonymized trace dataset. Narration is ElevenLabs (`eleven_multilingual_v2`, voice `Roger`); caption timings are derived from a Whisper `small.en` pass over the synthesized audio.\n\n**Social post:** [X / Twitter](https://x.com/Kiy_K127/status/2066356328466202914?s=20)\n\nSource code: [`Kiy-K/Fabella`](https://github.com/Kiy-K/Fabella)\n\n---\n\n## The neighbor next door\n\nThis is the person I built Fabella for: a parent I know, at 9 p.m., trying to explain to a 6-year-old that the family dog was not coming back. She had already had a hard day. She did not have the words she wanted, and she did not have the bandwidth to draft them. She needed a second pair of eyes that could read what she was about to say and tell her whether it would land.\n\n**Backyard AI** is exactly that brief: solve a real problem for someone you actually know. Fabella solves it for a parent in the moment they need help most — translating a hard adult situation into language a small child can hear, then having a second model double-check the draft before a human reads it.\n\n---\n\n## What it does\n\nA parent types one or two sentences about the situation: a parent's hospitalization, a house move, a pet dying, a refusal to buy a phone. They pick the child's age, the child's name, and a tone (gentle, matter-of-fact, playful). The app drafts an explanation in the shape **Opener → Body → Closer → optional \"if they ask more\"**, then a second small model judges the draft against a rubric. The parent reads it, clicks **New version** if it isn't right, or clicks **Read aloud** for VoxCPM2 narration.\n\nThe rubric the judge scores against (six checks, all hard-coded in `judge.py`):\n\n1. All three primary sections (opener, body, closer) are present and non-empty\n2. Body length is appropriate (1–3 short paragraphs, not a wall of text)\n3. Vocabulary matches the child's age\n4. No moralizing, no lecturing, no \"you should feel...\"\n5. No scary or violent content beyond what the situation requires\n6. No invented facts — only what the parent actually said\n\nThe parent sees the validated draft, not a raw model output. If the judge rejects, the drafter gets one revision pass. If it still fails, the rule-based fallback runs in `agent.py` so the parent always gets *something* usable.\n\n---\n\n## The two-model pipeline\n\n| Layer | Model | Size | Runtime | Why this model | Why this execution |\n|---|---|---|---|---|---|\n| **Drafter** | `google/gemma-4-E4B-it` | 4B | Modal A10G · vLLM | Apache 2.0, fast on short empathetic text, native tool calling | **LangGraph ReAct** — needs the state machine (draft → validate → revise → end) with tool calls and middleware-driven early exit |\n| **Judge** | `nvidia/NVIDIA-Nemotron-3-Nano-4B-BF16` | 4B | Modal A10G · vLLM | Follows structured-output instructions reliably | **Pydantic v2** + one LLM call + one repair retry — task is bounded, no agent loop needed |\n| **Read aloud** | `openbmb/VoxCPM2` | ~2B | Modal L4 · FastAPI | Apache 2.0, 48 kHz, voice-description control | Separate FastAPI server; only called when the user clicks **Read aloud** |\n| **Voice note** | `nvidia/nemotron-3.5-asr-streaming-0.6b` | 0.6B | Modal T4 · NeMo | Small multilingual streaming ASR with language prompts | Optional **Record** button: transcribes a short parent voice note into the textbox for review before drafting |\n\nThe split is deliberate. The drafter needs agentic machinery (state machine, tool calls, conditional edges, jump-to-end). The judge doesn't — its job is \"receive rubric + draft, return a structured verdict.\" Pydantic gives disciplined output, type safety, and a one-shot repair retry. "
  },
  {
    "repo_id": "build-small-hackathon/noticecheck",
    "title": "Noticecheck",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/noticecheck",
    "zone": "Safety Shield",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "noticecheck",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Useful",
    "quest": "🛡 Safety Shield: Check if the warning or recommendation is understandable.",
    "likes": 2,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🛡",
    "readme_text": "---\ntitle: NoticeCheck\nemoji: 🔎\nauthor: kingabzpro\ncollaborators:\n- Codex\ncolorFrom: indigo\ncolorTo: red\nsdk: gradio\nsdk_version: 6.17.3\napp_file: app.py\npython_version: 3.12\npinned: true\nlicense: mit\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - sponsor:nvidia\n  - achievement:offgrid\n  - achievement:offbrand\n  - achievement:sharing\n  - achievement:fieldnotes\n  - transformers\n  - minicpm5-1b\n  - nemotron-parse\n  - zerogpu\n  - scam-detection\n  - online-safety\n  - pakistan\n  - english\n\nshort_description: Review suspicious Pakistani messages before you act.\n---\n\n# NoticeCheck\n\nThis repository is the local version of the\n[Pakistan Notice Helper Hugging Face Space](https://huggingface.co/spaces/build-small-hackathon/pakistan-notice-helper).\nIt keeps the same notice-checking purpose with a redesigned English interface.\nThe hosted demo uses Hugging Face ZeroGPU, while Docker Compose runs the models\non a local NVIDIA GPU.\n\n**[Try the live demo](https://huggingface.co/spaces/build-small-hackathon/noticecheck)**\n\n![NoticeCheck demo](docs/app-demo.gif)\n\n## Project Links\n\n- [Live Hugging Face Space](https://huggingface.co/spaces/build-small-hackathon/noticecheck)\n- [GitHub repository](https://github.com/kingabzpro/local-notice-check)\n- [LinkedIn project post](https://www.linkedin.com/posts/1abidaliawan_huggingfacehackathon-huggingface-ai-ugcPost-7471594790506192896--_53/)\n- [Demo GIF](docs/app-demo.gif)\n- [Field notes: making NoticeCheck fully local](docs/field-notes.md)\n- [Privacy-safe trace dataset](https://huggingface.co/datasets/build-small-hackathon/pakistan-notice-helper-traces)\n\nNoticeCheck is a safety assistant for suspicious Pakistani messages, bills,\nbank alerts, challans, courier notices, and screenshots. It returns:\n\n- a risk label\n- a short explanation based on visible evidence\n- warning signs and safer next actions\n- a brief reply draft when replying is appropriate\n\nNoticeCheck does not verify the sender and does not provide legal or financial\nadvice. Find official contact details independently before paying, clicking,\nreplying, or sharing personal information.\n\n## Runtime\n\n```text\nText or screenshot\n        |\n        v\nCustom Gradio Server frontend\n        |\n        +--> Nemotron-Parse v1.2 for screenshot text\n        |\n        v\nMiniCPM5-1B through Transformers on ZeroGPU\n        |\n        v\nStructured risk assessment\n```\n\n- **Reasoning:** `openbmb/MiniCPM5-1B` through Transformers\n- **OCR:** `nvidia/NVIDIA-Nemotron-Parse-v1.2` through Transformers\n- **Compute:** Hugging Face Spaces ZeroGPU\n- **Interface:** redesigned custom HTML, CSS, and JavaScript\n- **Language:** English only\n\nThe application does not use a remote model API and has no heuristic assessment\nfallback. Model and OCR failures are returned explicitly.\n\nBoth models run through Transformers on the Hugging Face ZeroGPU deployment.\n\n## Run Locally With Docker and CUDA\n\nThe included Compose setup runs the same Transformers pipeline entirely on a\nlocal NVIDIA GPU. It does not use ZeroGPU or a remote inference API.\n\nPrerequisites:\n\n- Docker Engine with Docker Compose 2.30 or newer\n- a supported NVIDIA GPU and current NVIDIA driver\n- NVIDIA Container Toolkit configured for Docker Engine on Linux\n- Git LFS for downloading the interface images\n- enough GPU memory for MiniCPM5-1B and Nemotron-Parse v1.2\n\nClone the GitHub repository and download its Git LFS assets:\n\n```bash\ngit clone https://github.com/kingabzpro/local-notice-check.git\ncd local-notice-check\n\ngit lfs install\ngit lfs pull\n```\n\nBefore building, confirm that NVIDIA Container Toolkit exposes CUDA and the GPU\ninside Docker:\n\n```bash\ndocker run --rm --gpus all \\\n  pytorch/pytorch:2.9.1-cuda12.8-cudnn9-runtime \\\n  python -c \"import torch; print(torch.cuda.is_available(), torch.cuda.get_device_name(0))\"\n```\n\nThe command must print `True` and the NVIDIA GPU name. If it fails, configure\nNVIDIA Container Toolkit before starting the application.\n\nBuild and start NoticeCheck:\n\n```bash\ndocker compose up --build\n```\n\nOpen <http://localhost:7860>. The first startup takes longer because both model\nrepositories are downloaded. Downloads are retained in the\n`huggingface-cache` Docker volume.\n\nOptional environment overrides can be placed in a local `.env` file:\n\n```dotenv\nNOTICECHECK_PORT=7860\nTRANSFORMERS_MODEL_REPO=openbmb/MiniCPM5-1B\nMODEL_ENABLE_THINKING=0\nHF_TOKEN=\n```\n\nStop the application without deleting downloaded models:\n\n```bash\ndocker compose down\n```\n\nTo also remove the model cache and trace volumes:\n\n```bash\ndocker compose down --volumes\n```\n\n## Repository Layout\n\n```text\napp.py                 Thin Space launcher\nDockerfile             Local CUDA image\ncompose.yaml           Local NVIDIA GPU deployment\napp/\n  cli.py               CLI and startup\n  config.py            Environment configuration\n  model_endpoint.py    Space Transformers inference\n  ocr.py               Nemotron-Parse adapter\n  server.py            Gradio/FastAPI routes\n  service.py           Assessment o"
  },
  {
    "repo_id": "build-small-hackathon/legislation-explainer",
    "title": "Legislation Explainer",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/legislation-explainer",
    "zone": "Learning Library",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "legislation",
      "explainer",
      "summary",
      "available",
      "yet.",
      "gradio"
    ],
    "difficulty": "Useful",
    "quest": "📚 Learning Library: Try one learning prompt and check if the explanation is clear.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "📚",
    "readme_text": "---\ntitle: Legislation Explainer\nemoji: 📚\ncolorFrom: gray\ncolorTo: indigo\nsdk: gradio\nsdk_version: 6.17.3\npython_version: '3.12'\napp_file: app.py\npinned: false\nshort_description: Simplify complex legislation that affects you!\ntags:\n  - track:backyard\n  - sponsor:openai\n  - achievement:fieldnotes\nmodels:\n  - Qwen/Qwen3-14B\n  - sentence-transformers/all-MiniLM-L6-v2\n---\n\n# Legislation Explainer\n\nLegislation Explainer is a Gradio app for helping regular people understand public-interest legislation and how it affects them.\n\n## Links\n\n- Hugging Face Space: https://huggingface.co/spaces/build-small-hackathon/legislation-explainer\n- Live app: https://build-small-hackathon-legislation-explainer.hf.space/\n- GitHub repo: https://github.com/KayO-GH/legislation-explainer\n- Demo video: https://www.loom.com/share/ed7b682ab6774d5499fca4c06c290bfc\n- Social post: https://www.linkedin.com/posts/kwadwo-agyapon-ntra_rag-huggingface-gradio-activity-7470763368237244416-JseD/\n- Blog post: https://kayo-gh.github.io/building-legislation-explainer/\n\n## Motivation\n\nIn May 2026, the Ghana tech community on X/Twitter reacted strongly to the contents of a draft National Information Technology Authority bill proposed by the Ministry of Communication, Digital Technology and Innovations. The draft bill, and 14 others, had been public for months, but the legal language made it hard for many people in the developer ecosystem to evaluate quickly. When people noticed proposals around licensing fees, revenue-based charges, and restrictions affecting company formation, the discussion became urgent.\n\nThis project is my attempt to help civic groups, technologists, journalists, students, entrepreneurs, policy watchers, and regular people understand legislation as soon as they come across it.\n\nIt is created for the Hugging Face Build Small Hackathon under the `Backyard AI` track: a practical, small-model assistant for people who need to understand a real bill quickly without reading every clause first.\n\n## Hackathon Fit\n\n- Track: `Backyard AI`\n- Real user: Ghanaian citizens and digital-policy stakeholders who need a clearer view of a bill's practical effects.\n- Small-model constraint: each model used by the app is individually below the hackathon's `<= 32B` cap.\n- Required surface: Gradio app, ready for Hugging Face Spaces through `app.py`.\n\n## Try The Demo\n\n1. Open the live app.\n2. Paste this public NITA bill URL into `Document URL`: `https://moc.gov.gh/wp-content/uploads/2023/03/NITA-NATIONAL-INFORMATION-TECHNOLOGY-AUTHORITY-BILL_-10-07-25.pdf`.\n3. Click `Run analysis` to load the precomputed example analysis.\n4. Ask a follow-up question such as `What should startup founders pay attention to first?`.\n\n_**Note:** The ministry's website does not have consistent uptime. In the event that the file cannot be accessed from the site, you can [download it from here](https://drive.google.com/file/d/1P-cvgp-bX42QU2zijRPpmnE-MNPgXuGD/view?usp=sharing). Aslo feel free to experiment with any relevant documents you have. Nothing is saved beyond a session._\n\n## What It Does\n\n- Ingests PDF, DOCX, TXT, Markdown, or document URLs.\n- Produces a structured policy brief:\n  - executive summary\n  - bill summary\n  - implementation implications\n  - critique and recommendations\n  - SWOT analysis\n- Supports follow-up Q&A over the generated analysis.\n- Offers deeper full-document answering when the summary is not enough.\n- Includes bundled example-bill assets for faster demos.\n\n## Model And Provider Notes\n\nThe hackathon-safe default is Qwen3 14B through the Hugging Face router, with a small embedding model for retrieval.\n\n- Default generator: `Qwen/Qwen3-14B`\n- Embeddings and chunk retrieval: `sentence-transformers/all-MiniLM-L6-v2`\n- Default credential path: `HF_TOKEN`\n- Default provider: `qwen`\n- Parameter disclosure: `Qwen3-14B` is under 32B, and `all-MiniLM-L6-v2` is far below the cap at ~22.7M.\n\n_A smaller Qwen model could have been used successfully, but considering the real-world importance, a model with a higher capactity for reasoning is desired._\n\n## Local Run\n\n```bash\n# activate virtual env\npip install -r requirements.txt   # alternatively, uv sync to install from pyproject.toml\nsource .venv/bin/activate\ngradio app.py                     # alternatively, uv run gradio app.py\n```\n\nSet these environment variables for the default Qwen path:\n\n- `HF_TOKEN`\n\n## Space Deployment\n\nThis directory is structured as a Hugging Face Space:\n\n- `app.py` exposes `demo` and `app`.\n- `requirements.txt` lists runtime dependencies.\n- `assets/` contains bundled example documents and precomputed artifacts.\n- `services/` contains ingestion, provider, and RAG logic.\n\nUpload the contents of `legislation-explainer/` to a Gradio Space under the hackathon organization.\n"
  },
  {
    "repo_id": "build-small-hackathon/ink-witch",
    "title": "Ink Witch",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/ink-witch",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "ink",
      "witch",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: InkWitch\nemoji: 🪄\ncolorFrom: indigo\ncolorTo: purple\nsdk: gradio\nsdk_version: 6.16.0\npython_version: '3.12'\napp_file: app.py\npinned: false\ntags:\n  - track:wood\n  - sponsor:openbmb\n  - achievement:offgrid\n  - achievement:offbrand\n---\n\n# 🪄 InkWitch\n\n> **Build Small Hackathon submission**. *An Adventure in Thousand Token Wood* Track\n\nYour research grant is running dry, and the loan shark has sent you an ultimatum for the debt. \nHe repossessed your lab and turned it into a witchcraft store. \n\nTo keep your roof and clear the debt, you serve the customers who wander in — each wants an object with a few specific\nqualities, *a key with wings*, *a mug with a cat face*, etc.\n\nSketch glyphs and transmute them to fulfill the customers' orders, so you can receive payment and build your reputation. But watch out, every stroke drains your mana. Reinvest wisely, sketch sparingly, and DON'T GO BANKRUPT!\n\n## Model Pipeline\n1. **Glyph appraisal** — `MiniCPM-V-4.6` looks at your raw glyph and scores it\n   for *recognition* (is it the right object at all?) plus a confidence on each\n   requested *quality*.\n2. **Transmutation** — if the object is recognized, your glyph is fed as a\n   scribble control image into an `SDXL-Flash + Scribble-ControlNet` pipeline,\n   which renders a finished illustration that follows your lines.\n3. **Re-check** — the VLM then re-judges the *transmuted image*, but only on the\n   qualities your glyph **failed**.\n   Your gold and reputation reflect this final artwork — so a rough glyph that\n   the transmutation fleshes out faithfully can win back reputation it missed.\n\nIn other words, you're judged not just on what you drew, but on what your glyph\n*becomes*. The AI is load-bearing twice over: once as the judge, once as the artist.\n\n## Models (all running on the Space — Off the Grid)\n\n| Role | Model | Params |\n|---|---|---|\n| Vision judge | [`openbmb/MiniCPM-V-4.6`](https://huggingface.co/openbmb/MiniCPM-V-4.6) | ~1.3B |\n| Image generator | [`sd-community/sdxl-flash`](https://huggingface.co/sd-community/sdxl-flash) | ~2.6B |\n| Scribble control | [`xinsir/controlnet-scribble-sdxl-1.0`](https://huggingface.co/xinsir/controlnet-scribble-sdxl-1.0) | ~1.25B |\n| VAE | [`madebyollin/sdxl-vae-fp16-fix`](https://huggingface.co/madebyollin/sdxl-vae-fp16-fix) | ~84M |\n\nThe image pipeline is adapted from\n[`linoyts/scribble-sdxl-flash`](https://huggingface.co/spaces/linoyts/scribble-sdxl-flash).\nNo external inference APIs are called — both heavy models are eagerly loaded at\nSpace startup and every appraisal + transmutation happens inside one `@spaces.GPU`\nacquisition.\n\n## Tech Notes\n\n- **Custom frontend on `gr.Server`** — `/` serves a hand-written `index.html` +\n  `app.js` + `style.css` (the drawing canvas, shop, and night phase), driven\n  through the official `@gradio/client` so ZeroGPU per-user quota is attributed\n  correctly. `app.launch(ssr_mode=False)` keeps Gradio's SSR from shadowing it.\n- **Per-user state** is keyed on `gr.Request.session_hash`, so simultaneous\n  players get independent games.\n- **One GPU call per Transmute** — glyph appraisal, transmutation, and re-check\n  are fused into a single `@spaces.GPU` function.\n- **Hidden test mode** — click **Reset** 5 times in a row to flip on a test session: \n  unlimited gold/mana, the full request pool, and the image-gen tuning panel. \n  Refresh the page to return to a normal run.\n\n## Team Member\n- [gizzleon](https://huggingface.co/gizzleon)\n\n## 🎥 Demo video\n\nhttps://youtu.be/3R139SneUzA?si=zGK5RLvHhFSDOs44\n\n## 📣 Social post\n\nhttps://www.reddit.com/r/aigamedev/comments/1u55h2u/webbased_ai_doodle_game/\n\n## Run it locally\n\nOnly the **image generation** stage (SDXL-Flash + ControlNet) requires a CUDA\nGPU — it does not run on CPU/MPS. The MiniCPM judge runs fine without one, so on\na machine with no CUDA GPU, disable image-gen with `WITCH_WOOD_GEN_ENABLED=0`.\n\n```bash\n# Set up the project with uv (https://github.com/astral-sh/uv)\nuv sync\n\n# Full pipeline — real MiniCPM judge + image generation (image-gen needs a CUDA GPU):\npython app.py\n\n# No CUDA GPU? Disable image-gen only — the real MiniCPM judge still runs\n# (on CPU/MPS) and the SDXL pipe is never loaded:\nWITCH_WOOD_GEN_ENABLED=0 python app.py\n\n# Fully model-free — skips ALL model calls (mock judge, no image generation):\nWITCH_WOOD_MOCK=1 python app.py\n```\n\nRuns on Hugging Face Spaces **ZeroGPU** in production.\n"
  },
  {
    "repo_id": "build-small-hackathon/Cook_with_a_LLM",
    "title": "Cook_With_A_Llm",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/Cook_with_a_LLM",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "cook_with_a_llm",
      "summary",
      "available",
      "yet.",
      "gradio",
      "backyard-ai"
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Explore the Space and write one improvement idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Cook With A LLM\nemoji: 🍲\ncolorFrom: red\ncolorTo: yellow\nsdk: gradio\nsdk_version: 6.15.2\npython_version: '3.12'\napp_file: app.py\npinned: false\nlicense: apache-2.0\ntags:\n  - backyard-ai\n  - well-tuned\n  - off-brand\n  - sharing-is-caring\n  - field-notes\n---\n\n# 🍲 Cook With Me — Multimodal Sous-Chef\n\n> *Snap your fridge. Pick a dish. Cook step by step. Check your progress with a photo.*\n\nA closed-loop multimodal cooking assistant built for the **Hugging Face Small Models / Big Adventures Hackathon (June 2026)**.\n\n---\n\n# Contributors\n\n1. **eldinosaur** - Carlos Castañeda Mora\n1. **Fred1e4** - Fredin Vazquez\n\n---\n\n## 🔗 Links\n\n- 🎥 **Demo video:** https://youtube.com/shorts/c3PikNvKAjQ\n- 📱 **Social post:** https://www.instagram.com/fd_albert14/p/DZnz-oaGorr/\n- 🤗 **Live Space:** https://huggingface.co/spaces/build-small-hackathon/Cook_with_a_LLM\n- 🧠 **Fine-tuned planner:** https://huggingface.co/eldinosaur/cook-with-me-planner-8b\n- 📊 **SFT dataset:** https://huggingface.co/datasets/eldinosaur/cook-with-me-recipes-sft\n\n---\n\n## How it works\n\n```\n📸 Fridge photo  ──▶  [Vision Agent]          identify ingredients\n                            │\n                            ▼\n                      [Recipe Planner]         propose 3 dishes → full recipe JSON\n                            │\n                            ▼\n                      [Nutrition Engine]       per-serving macros (lookup, no hallucination)\n                            │\n                            ▼\n📸 Progress photo ──▶  [Progress Validator]    go / wait / fix verdict\n```\n\n1. **Snap** your fridge or pantry — the fine-tuned vision model identifies every ingredient.\n2. **Pick** one of three AI-suggested dishes tailored to what you have.\n3. **Cook** step by step with a generated recipe and per-serving nutrition info.\n4. **Check** your progress by uploading a photo of your pan — the model tells you *go*, *wait*, or *fix*.\n\n---\n\n## Models\n\n| Role | Model | Params | Runtime |\n|---|---|---|---|\n| Vision — ingredients + progress validation | `openbmb/MiniCPM-V-4.6` (fine-tuned) | ~4.6B | `transformers` / ZeroGPU |\n| Recipe planner — dishes + recipe JSON | `openbmb/MiniCPM4.1-8B` → [`eldinosaur/cook-with-me-planner-8b`](https://huggingface.co/eldinosaur/cook-with-me-planner-8b) (fine-tuned) | ~8B | Modal (transformers 4.x) |\n| Step illustrator — per-step images | `FLUX.2-klein-9B` (SDXL-Turbo fallback) | ~9B | Modal (L4) |\n\n**Total: ~21.6B parameters** (≤ 32B cap ✓)\n\n**Two models are fine-tuned:** the vision model on fridge/pantry photos for ingredient\ndetection, and the planner on **2,046 recipe pairs** for reliable recipe-JSON generation.\nThe planner and illustrator run on dedicated **Modal** GPU endpoints (the planner needs\n`transformers` 4.x while the vision model needs 5.x, so they live in separate containers).\n\n---\n\n## Badges targeted\n\n| Badge | Status | How |\n|---|---|---|\n| 🎯 Well-Tuned | ✓ | **Two** fine-tuned models on Hub: MiniCPM-V-4.6 (ingredient detection) + MiniCPM4.1-8B (recipe planner, SFT on 2,046 pairs) |\n| 🎨 Off-Brand | ✓ | Custom recipe-card UI with bespoke CSS components (chips, dish cards, step cards, nutrition pills) |\n| 📡 Sharing is Caring | ✓ | Agent traces shared on Hub |\n| 📓 Field Notes | ✓ | Blog post: \"Building a closed-loop visual cooking coach\" |\n\n---\n\n## Architecture highlights\n\n- **Specialized small models, one pipeline:** a fine-tuned vision model for ingredients/progress, a separately fine-tuned 8B planner for recipe JSON, and a diffusion model for step images — each on the runtime it needs (ZeroGPU + Modal endpoints).\n- **Closed-loop visual validation:** the planner writes the steps → the illustrator renders each step → user cooks → the vision model compares the pan photo and returns *go / wait / fix* — a real agent loop, not a wrapper.\n- **Hallucination-free nutrition:** macros come from a lookup table, not LLM arithmetic.\n- **Robust JSON extraction:** multi-strategy parser handles markdown fences, single quotes, and trailing commas so generation failures degrade gracefully.\n\n---\n\n## Track\n\n**Chapter One — Backyard AI** · \"Build something for someone you actually know.\"\n\nSubmission for the Hugging Face Hackathon · June 5–15, 2026.\n"
  },
  {
    "repo_id": "build-small-hackathon/workbench",
    "title": "Workbench",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/workbench",
    "zone": "Career Camp",
    "track": "Backyard AI",
    "tags": [
      "achievement",
      "sponsor",
      "workbench",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Useful",
    "quest": "💼 Career Camp: Find one output that would help someone applying for a job.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "💼",
    "readme_text": "---\ntitle: OpenBMB Local AI Workbench\ncolorFrom: green\ncolorTo: gray\nsdk: gradio\napp_file: app.py\npinned: false\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - achievement:welltuned\n  - achievement:llama\n  - achievement:sharing\n---\n\n📺 **Demo video:** <a href=\"https://huggingface.co/spaces/build-small-hackathon/workbench/resolve/main/screencast.webm\">HF Space Recording</a> ·  \n\n📺 **Social Demo:**\nhttps://twitter.com/zX14_7/status/2066669462179242412\n\n# OpenBMB Local AI Workbench\n\nA Gradio workbench for the Build Small Hackathon: small local models, practical experimentation,\nand a clear path from local demo to Hugging Face Space.\n\n## What This Is\n\nThe project turns the PRD in `HF_PRD_v1.md` into a staged implementation:\n\n1. A working Gradio app shell.\n2. Config-driven OpenBMB model registry.\n3. Local-first inference path through real backends: Transformers, Ollama, llama.cpp,\n   LM Studio/OpenAI-compatible, SGLang, and vLLM.\n4. Field notes for collecting corrections.\n5. Extension points for training, GGUF export, Trackio traces, MCP tools, and agent workflows.\n\n## Hackathon Fit\n\n- **Track:** Backyard AI or Thousand Token Wood, depending on the final user story.\n- **Canvas:** Gradio app, deployable to Hugging Face Spaces.\n- **Small model rule:** target models stay at or below 32B parameters.\n- **Bonus quests:** local-first, field notes/report, possible llama.cpp and trace sharing.\n\n## Quick Start\n\nPython is not currently available on PATH in this workspace shell. Install Python first, then:\n\n```powershell\n.\\scripts\\verify_structure.ps1\n& \"$env:LOCALAPPDATA\\Microsoft\\WindowsApps\\python3.11.exe\" -m venv .venv\n.venv\\Scripts\\Activate.ps1\npython -m pip install -r requirements.txt\npython -m pip install -r requirements-dev.txt\n.\\scripts\\run_tests.ps1\npython app.py\n```\n\n## Browser E2E Screenshots\n\nAfter installing Node.js:\n\n```powershell\nnpm install\nnpm run e2e:install\nnpm run e2e\n```\n\nThe Playwright user-story tests run separate Workbench and Plant flows and save documentation\nscreenshots under `assets/e2e/workbench/` and `assets/e2e/plant/`. The Workbench screenshot now runs\nlocal GGUF chat through `llama-cpp-python`; the Plant screenshot can run OpenBMB MiniCPM-V on\n`assets/plant_sample.jpg` with `RUN_REAL_MODEL_E2E=1`.\nonly when `RUN_REAL_MODEL_E2E=1` is set; otherwise the browser tests verify real-backend setup\nsurfaces without using mock responses. To record or edit the browser flow manually, run\n`npm run e2e:record`.\n\nGenerated screenshot sets:\n\n- [Workbench home](assets/e2e/workbench/01-workbench-home.png)\n- [Workbench backend status](assets/e2e/workbench/05-backend-status.png)\n- [Plant tool home](assets/e2e/plant/01-plant-home.png)\n- [Plant corrections export](assets/e2e/plant/03-corrections-export.png)\n\n## Template And Reference Apps\n\nThis repo is also a template for focused local-first Gradio apps. The first reference app is\nPlant Discovery under `plant/`.\n\n```powershell\n.venv\\Scripts\\python.exe -m plant.app --no-model --port 7861\n```\n\nUse the real OpenBMB VLM path after installing optional plant dependencies:\n\n```powershell\n.venv\\Scripts\\python.exe -m plant.app --model-mode openbmb --port 7861\n```\n\nThe detailed build guide is [docs/TEMPLATE_HOWTO.md](docs/TEMPLATE_HOWTO.md), and the Plant\nDiscovery checklist is [docs/PLANT_DISCOVERY_APP_PLAN.md](docs/PLANT_DISCOVERY_APP_PLAN.md).\nModel and adapter training steps are in\n[docs/PLANT_MODEL_AND_TRAINING_HOWTO.md](docs/PLANT_MODEL_AND_TRAINING_HOWTO.md).\n\nIf `python` is not recognized, install Python 3.11+ from python.org or the Windows Store,\nthen reopen the terminal so PATH is refreshed.\n\nOpen the local URL shown by Gradio, usually `http://127.0.0.1:7860`.\n\n## Project Structure\n\n```text\n.\n|-- app.py\n|-- AGENTS.md\n|-- README.md\n|-- requirements.txt\n|-- config/\n|   |-- models.yaml\n|   `-- training.yaml\n|-- core/\n|   |-- events.py\n|   `-- registry.py\n|-- datasets/\n|   `-- field_notes.py\n|-- models/\n|   |-- placeholder_service.py\n|   `-- model_catalog.py\n|-- ui/\n|   |-- chat_tab.py\n|   |-- dataset_tab.py\n|   |-- export_tab.py\n|   |-- notes_tab.py\n|   |-- traces_tab.py\n|   |-- agent_tab.py\n|   |-- status_tab.py\n|   |-- train_tab.py\n|   `-- vision_tab.py\n|-- data/\n|   `-- .gitkeep\n|-- exports/\n|   `-- .gitkeep\n|-- HF_PRD_v1.md\n`-- HF_PRD_ext.md\n```\n\n## Project Docs\n\nThe working docs live in [docs/README.md](docs/README.md).\n\n- [Task checklist](docs/TASKS.md)\n- [Implementation status](docs/IMPLEMENTATION_STATUS.md)\n- [PRD implementation matrix](docs/PRD_IMPLEMENTATION_MATRIX.md)\n- [Acceptance criteria](docs/ACCEPTANCE_CRITERIA.md)\n- [Roadmap](docs/ROADMAP.md)\n- [Template how-to](docs/TEMPLATE_HOWTO.md)\n- [Plant Discovery plan](docs/PLANT_DISCOVERY_APP_PLAN.md)\n- [Usage guide](docs/USAGE.md)\n- [Architecture guide](docs/ARCHITECTURE.md)\n- [Extension guide](docs/EXTENDING.md)\n- [Hackathon submission package](docs/HACKATHON_SUBMISSION.md)\n- [Test and quality policy](docs/ACCEPTANCE_CRITERIA.md)\n\n## Current Truth\n\nThe full PRD is not implemented yet."
  },
  {
    "repo_id": "build-small-hackathon/plant_identification_tool",
    "title": "Plant_Identification_Tool",
    "summary": "No summary available yet.",
    "url": "https://huggingface.co/spaces/build-small-hackathon/plant_identification_tool",
    "zone": "Experiment Lab",
    "track": "An Adventure in Thousand Token Wood",
    "tags": [
      "achievement",
      "sponsor",
      "plant_identification_tool",
      "summary",
      "available",
      "yet."
    ],
    "difficulty": "Experimental",
    "quest": "🧪 Experiment Lab: Try the demo and identify its core idea.",
    "likes": 1,
    "sdk": "gradio",
    "status": "unknown",
    "last_modified": "",
    "emoji": "🧪",
    "readme_text": "---\ntitle: Plant Identification Tool\ncolorFrom: green\ncolorTo: gray\nsdk: gradio\napp_file: plant_space_app.py\npinned: false\ntags:\n  - track:backyard\n  - sponsor:openbmb\n  - sponsor:openai\n  - achievement:welltuned\n  - achievement:llama\n  - achievement:sharing\n\n---\n\n📺 **Demo video:** <a href=\"https://huggingface.co/spaces/build-small-hackathon/plant_identification_tool/resolve/main/screencast.webm\">HF Space Recording</a> ·  \n\n📺 **Social Demo:**\nhttps://twitter.com/zX14_7/status/2066669462179242412 \n\n# Plant Identification Tool\n\nA Gradio reference app from the OpenBMB Local AI Workbench template: identify plants locally,\ncorrect mistakes, and export field notes as training data. Deployed on Hugging Face Spaces with\nzero-GPU support via `@spaces.GPU` decorators on the vision callbacks.\n\n## What This Is\n\nThe project turns the PRD in `HF_PRD_v1.md` into a staged implementation:\n\n1. A working Gradio app shell.\n2. Config-driven OpenBMB model registry.\n3. Local-first inference path through real backends: Transformers, Ollama, llama.cpp,\n   LM Studio/OpenAI-compatible, SGLang, and vLLM.\n4. Field notes for collecting corrections.\n5. Extension points for training, GGUF export, Trackio traces, MCP tools, and agent workflows.\n\n## Hackathon Fit\n\n- **Track:** Backyard AI or Thousand Token Wood, depending on the final user story.\n- **Canvas:** Gradio app, deployable to Hugging Face Spaces.\n- **Small model rule:** target models stay at or below 32B parameters.\n- **Bonus quests:** local-first, field notes/report, possible llama.cpp and trace sharing.\n\n## Quick Start\n\nPython is not currently available on PATH in this workspace shell. Install Python first, then:\n\n```powershell\n.\\scripts\\verify_structure.ps1\n& \"$env:LOCALAPPDATA\\Microsoft\\WindowsApps\\python3.11.exe\" -m venv .venv\n.venv\\Scripts\\Activate.ps1\npython -m pip install -r requirements.txt\npython -m pip install -r requirements-dev.txt\n.\\scripts\\run_tests.ps1\npython app.py\n```\n\n## Browser E2E Screenshots\n\nAfter installing Node.js:\n\n```powershell\nnpm install\nnpm run e2e:install\nnpm run e2e\n```\n\nThe Playwright user-story tests run separate Workbench and Plant flows and save documentation\nscreenshots under `assets/e2e/workbench/` and `assets/e2e/plant/`. The Workbench screenshot now runs\nlocal GGUF chat through `llama-cpp-python`; the Plant screenshot can run OpenBMB MiniCPM-V on\n`assets/plant_sample.jpg` with `RUN_REAL_MODEL_E2E=1`.\nonly when `RUN_REAL_MODEL_E2E=1` is set; otherwise the browser tests verify real-backend setup\nsurfaces without using mock responses. To record or edit the browser flow manually, run\n`npm run e2e:record`.\n\nGenerated screenshot sets:\n\n- [Workbench home](assets/e2e/workbench/01-workbench-home.png)\n- [Workbench backend status](assets/e2e/workbench/05-backend-status.png)\n- [Plant tool home](assets/e2e/plant/01-plant-home.png)\n- [Plant corrections export](assets/e2e/plant/03-corrections-export.png)\n\n## Template And Reference Apps\n\nThis repo is also a template for focused local-first Gradio apps. The first reference app is\nPlant Discovery under `plant/`.\n\n```powershell\n.venv\\Scripts\\python.exe -m plant.app --no-model --port 7861\n```\n\nUse the real OpenBMB VLM path after installing optional plant dependencies:\n\n```powershell\n.venv\\Scripts\\python.exe -m plant.app --model-mode openbmb --port 7861\n```\n\nThe detailed build guide is [docs/TEMPLATE_HOWTO.md](docs/TEMPLATE_HOWTO.md), and the Plant\nDiscovery checklist is [docs/PLANT_DISCOVERY_APP_PLAN.md](docs/PLANT_DISCOVERY_APP_PLAN.md).\nModel and adapter training steps are in\n[docs/PLANT_MODEL_AND_TRAINING_HOWTO.md](docs/PLANT_MODEL_AND_TRAINING_HOWTO.md).\n\nIf `python` is not recognized, install Python 3.11+ from python.org or the Windows Store,\nthen reopen the terminal so PATH is refreshed.\n\nOpen the local URL shown by Gradio, usually `http://127.0.0.1:7860`.\n\n## Project Structure\n\n```text\n.\n|-- app.py\n|-- AGENTS.md\n|-- README.md\n|-- requirements.txt\n|-- config/\n|   |-- models.yaml\n|   `-- training.yaml\n|-- core/\n|   |-- events.py\n|   `-- registry.py\n|-- datasets/\n|   `-- field_notes.py\n|-- models/\n|   |-- placeholder_service.py\n|   `-- model_catalog.py\n|-- ui/\n|   |-- chat_tab.py\n|   |-- dataset_tab.py\n|   |-- export_tab.py\n|   |-- notes_tab.py\n|   |-- traces_tab.py\n|   |-- agent_tab.py\n|   |-- status_tab.py\n|   |-- train_tab.py\n|   `-- vision_tab.py\n|-- data/\n|   `-- .gitkeep\n|-- exports/\n|   `-- .gitkeep\n|-- HF_PRD_v1.md\n`-- HF_PRD_ext.md\n```\n\n## Project Docs\n\nThe working docs live in [docs/README.md](docs/README.md).\n\n- [Task checklist](docs/TASKS.md)\n- [Implementation status](docs/IMPLEMENTATION_STATUS.md)\n- [PRD implementation matrix](docs/PRD_IMPLEMENTATION_MATRIX.md)\n- [Acceptance criteria](docs/ACCEPTANCE_CRITERIA.md)\n- [Roadmap](docs/ROADMAP.md)\n- [Template how-to](docs/TEMPLATE_HOWTO.md)\n- [Plant Discovery plan](docs/PLANT_DISCOVERY_APP_PLAN.md)\n- [Usage guide](docs/USAGE.md)\n- [Architecture guide](docs/ARCHITECTURE.md)\n- [Extension guide](docs/EXTENDING.md)\n- [Hackathon submission package](docs/H"
  }
]