{"taxonomy_text": "- Off the Grid (Local-first): Runs entirely on local or open-weight models with no proprietary cloud inference APIs. Signals: local transformers/llama.cpp/vLLM model load, GGUF weights, no openai/anthropic/gemini/cohere API client.\n- Well-Tuned (Fine-tuned): Uses or publishes a fine-tuned or LoRA-adapted model rather than only stock checkpoints. Signals: LoRA/PEFT adapter, fine-tuned model repo, training script, words like fine-tune, adapter, SFT, distilled.\n- Off-Brand (Custom frontend): Ships a custom interface beyond default Gradio styling, with a memorable look or voice. Signals: custom CSS/HTML/JS, gr.HTML, gr.Blocks theme/css=, gr.Server, custom components, bespoke theming.\n- Llama Champion (llama.cpp path): Runs a model through the llama.cpp runtime. Signals: llama-cpp-python, from llama_cpp import Llama, GGUF file, llama.cpp, Llama( constructor.\n- Sharing is Caring (Shareable artifact): Produces an output people can save, post, or compare, or publishes an agent trace to the Hub. Signals: download/export button, gr.File/gr.DownloadButton, save PNG/PDF/JSON, push_to_hub of a trace or dataset.\n- Field Notes (Build notes): Documents the build itself with notes, a write-up, or a blog/report link. Signals: README has a substantial build write-up, devlog, lessons learned, or a blog/report/Notion link.\n- Backyard AI (Real problem for one person): Solves a concrete real-world problem for a specific, named person or persona. Signals: README frames a real user and task (caregiving, a relative, a job, a household chore), practical utility.\n- Thousand Token Wood (Delightful & creative): A delightful, playful, or artistic experience that would not exist without AI. Signals: story/game/art/whimsy framing, generative characters or worlds, playful tone, creative novelty.\n- OpenBMB (OpenBMB model): Uses an OpenBMB model such as the MiniCPM family. Signals: model repo openbmb/..., MiniCPM, MiniCPM-V, MiniCPM5, OpenCPM.\n- Nemotron (NVIDIA Nemotron): Uses an NVIDIA Nemotron model (Nemotron LLM, Parakeet, Nemotron-Speech, Canary). Signals: model repo nvidia/...nemotron..., Parakeet, nemotron-speech, Canary ASR.\n- Modal (Modal-powered): Uses Modal for training, inference, or background compute. Signals: import modal, modal.App, @app.function, Modal endpoint/volume, README cites Modal compute.\n- Tiny Titan (Small model (<=4B)): Runs on a genuinely small model of about four billion parameters or fewer. Signals: declared model is 0.5B/1B/1.5B/2B/3B/4B or labelled tiny/small/nano/mini (e.g. Qwen2.5-1.5B, MiniCPM5-1B, gemma-2b).\n- Best Agent (Agentic): An agentic build: tool use, function calling, planning, or an autonomous multi-step loop. Signals: tool/function calling, an agent/planner loop, multiple orchestrated tools, ReAct, multi-step reasoning over tools.", "count": 108, "projects": [{"id": "build-small-hackathon/dental-soap", "slug": "dental-soap", "title": "Dental SOAP", "sdk": "gradio", "models": ["Qwen/Qwen3-4B-Instruct-2507"], "tags": ["agents", "bilingual", "healthcare", "zero-gpu"], "app_file": "app.py", "structural_profile": "both", "readme": "# Dental SOAP A doctor could not clearly explain his own crown, root canal, bite, and TMJ story to his dentist, so he built a small-model visit-prep tool that turns patient chaos into a one-page dentist handoff. Dental SOAP is not an AI dentist. It is a patient education and visit-documentation aid. It organizes patient-reported history to bring to a licensed dentist. It does not diagnose, interpret imaging, prescribe medication, or choose a dental procedure. Fixed safety rules can advise urgent in-person care. ## Why This Exists (A Real Case) The builder's own case took three months and two specialties to untangle: an extraction with an immediate sinus repair, a crown that felt high from day one, a molar adjusted five times without relief, jaw/TMJ soreness — and finally an ENT confirming the sinus infection that tied it together. Referred pain does not respect specialty boundaries; the only thing that crossed them cleanly was a written handoff a clinician could scan in under a minute. He lived that workflow manually with a frontier cloud model between real appointments. Dental SOAP is the Build Small answer: the same narrow job, done by a 4B open model inside the Space, with deterministic rules guarding safety. The `Try Ahmed's case` example **is** that case, de-identified — and the `repeated bite adjustments without lasting relief` safety rule exists because it happened to him. **Try it in 10 seconds:** click **Try Ahmed's case** — the handoff renders instantly from a valid ...", "app": "from __future__ import annotations\nfrom pathlib import Path\nfrom typing import Any\nfrom examples import CHECK_OPTIONS, EXAMPLES, STEP2_CHECKS, STEP3_CHECKS, STEP4_CHECKS\nfrom interview_schema import ExtractedIntake, extracted_to_intake\nfrom pdf_export import build_pdf\nfrom render import (\nfrom safety_rules import evaluate_red_flags\nfrom pydantic import ValidationError\nfrom schema import (\nfrom transformers import AutoModelForCausalLM, AutoTokenizer as _AutoTokenizer\nfrom transformers import AutoModelForCausalLM, AutoTokenizer\n\nfrom __future__ import annotations\n\nimport dataclasses\nimport json\nimport os\nimport re\nimport sys\nimport threading\nfrom pathlib import Path\nfrom typing import Any\n\nimport gradio as gr\n\nclass AgentUnavailable(RuntimeError):\n \"\"\"The local model endpoint could not produce a usable response.\"\"\"\n\nimport interview as interview_mod\nfrom examples import CHECK_OPTIONS, EXAMPLES, STEP2_CHECKS, STEP3_CHECKS, STEP4_CHECKS\nfrom interview_schema import ExtractedIntake, extracted_to_intake\nfrom pdf_export import build_pdf\nfrom render import (\n footer_html,\n header_html,\n initial_safety_html,\n placeholder_handoff_html,\n plain_text_handoff,\n rail_html,\n render_handoff_html,\n render_safety_html,\n step_head,\n initial_agent_dashboard_html,\n render_agent_dashboard,\n)\nfrom safety_rules import evaluate_red_flags\nfrom pydantic import ValidationError\nfrom schema import (\n BLOCKED_QUESTION_TERMS,\n EvidenceSpan,\n HandoffOutput,\n ModelHandoffDraft,\n PatientProfile,\n StructuredIntake,\n model_text_is_safe,\n)\n\n\ntry:\n import spaces\nexcept Exception:\n class _SpacesFallback:\n @staticmethod\n def GPU(fn=None, /, *, duration: int = 120):\n # Support both @spaces.GPU (fn is callable) and @spaces.GPU(duration=N)\n # (fn is None, returns a decorator).\n def decorator(f): ..."}, {"id": "build-small-hackathon/pakistan-notice-helper", "slug": "pakistan-notice-helper", "title": "Pakistan Notice Helper", "sdk": "gradio", "models": [], "tags": ["backyard-ai", "build-small-hackathon", "gradio", "llama.cpp", "modal"], "app_file": "app.py", "structural_profile": "both", "readme": "# Pakistan Notice Helper Pakistan Notice Helper is a MiniCPM-V-powered safety assistant for confusing or suspicious Pakistani notices, bills, SMS messages, bank alerts, FBR-style messages, challans, and courier/customs messages. It accepts pasted text and screenshots, then returns: - **Risk label:** Looks normal, Verify first, Suspicious, or Likely scam - A simple English explanation - Red flags found - Safe next steps - A polite reply draft The interface is a custom mobile-first frontend served by [`gradio.Server`](https://www.gradio.app/main/guides/server-mode). Gradio provides queueing, API routes, and Hugging Face Spaces hosting without exposing a default Gradio UI. > **Pakistan Notice Helper does not provide official verification. It checks > common scam signals and gives safe next steps. Always verify through official > websites or helplines before making payments or sharing personal > information.** ## Build Small Hackathon This is a **Backyard AI** project built for the [Build Small Hackathon](https://huggingface.co/build-small-hackathon). It addresses a common local problem: people receive convincing payment notices, bank alerts, courier messages, challans, and government impersonation scams but may not know which details are unsafe. - **Space:** [build-small-hackathon/pakistan-notice-helper](https://huggingface.co/spaces/build-small-hackathon/pakistan-notice-helper) - **Source:** [kingabzpro/pakistan-notice-helper](https://github.com/kingabzpro/pakistan-notice-helpe ...", "app": "from __future__ import annotations\nfrom io import BytesIO\nfrom pathlib import Path\nfrom typing import Any\nfrom fastapi.responses import FileResponse\nfrom fastapi.staticfiles import StaticFiles\nfrom gradio import Server\nfrom PIL import Image, ImageOps, UnidentifiedImageError\nfrom traces.runtime import queue_trace, start_trace_worker, trace_status\nfrom huggingface_hub import hf_hub_download\nfrom llama_cpp import Llama\nfrom llama_cpp.llama_chat_format import MTMDChatHandler\nfrom llama_cpp import Llama, llama_cpp\n\n\"\"\"Pakistan Notice Helper: custom frontend with a queued Gradio backend.\"\"\"\n\nfrom __future__ import annotations\n\nimport argparse\nimport base64\nimport ctypes\nimport gc\nimport importlib.util\nimport json\nimport logging\nimport os\nimport re\nimport sys\nimport threading\nimport time\nfrom io import BytesIO\nfrom pathlib import Path\nfrom typing import Any\n\nimport spaces\nfrom fastapi.responses import FileResponse\nfrom fastapi.staticfiles import StaticFiles\nfrom gradio import Server\nfrom PIL import Image, ImageOps, UnidentifiedImageError\nfrom traces.runtime import queue_trace, start_trace_worker, trace_status\n\nROOT = Path(__file__).resolve().parent\nSTATIC_DIR = ROOT / \"static\"\nDISCLAIMER = (\n \"Pakistan Notice Helper does not provide official verification. It checks \"\n \"common scam signals and gives safe next steps. Always verify through \"\n \"official websites or helplines before making payments or sharing personal \"\n \"information.\"\n)\nRISK_LABELS = (\"Looks normal\", \"Verify first\", \"Suspicious\", \"Likely scam\", \"Inappropriate\")\nDEFAULT_MODEL_REPO = \"openbmb/MiniCPM-V-4.6-gguf\"\nDEFAULT_MODEL_FILE = \"MiniCPM-V-4_6-Q4_K_M.gguf\"\nDEFAULT_MMPROJ_FILE = \"mmproj-model-f16.gguf\"\nMAX_IMAGE_BYTES = 8 * 1024 * 1024\nMAX_IMAGE_PIXELS = 40_000_000\nDEFAULT_IMAGE_MAX_DIMENSION = 1536\nDEFAULT_IMAGE_JPEG_QUALITY = 90\nREQUIRED_FIELDS = {\n \"risk_label\",\n \"simple_explanation\",\n \" ..."}, {"id": "build-small-hackathon/Kintsugi-Garden", "slug": "Kintsugi-Garden", "title": "The Kintsugi Garden", "sdk": "gradio", "models": [], "tags": ["gradio", "region:us"], "app_file": "app.py", "structural_profile": "both", "readme": "
हेर — Marathi for “detective.”
A detective for your coding-agent sessions. Drop a Claude Code session export and Her reads the whole trace — so you can see what actually happened, and what to do better next time.
A whole town of tiny AI minds — alive, gossiping, and feuding on your laptop. Fully offline.
Big labs need a datacenter to run one mind.
Smol Town runs a whole town of them on a gaming GPU.