{"type": "lora_sft_manifest", "schema_version": 1, "generated_at": "2026-06-08T18:35:51+00:00", "app": "hackathon-advisor", "base_model": "openbmb/MiniCPM5-1B", "adapter_task": "hackathon_advisor_quest_classification", "format": "chat-jsonl", "record_kinds": ["quest_classification"], "source": "build_small_hackathon real projects + targeted augmentations", "example_count": 185, "empty_match_examples": 27, "variant_counts": {"natural": 108, "app_only": 16, "missing_app_file": 16, "noisy_metadata": 8, "contradiction": 6, "empty": 2, "remote_app_only": 15, "hard_negative": 14}, "quest_positive_counts": {"Off the Grid": 93, "Well-Tuned": 31, "Off-Brand": 64, "Llama Champion": 37, "Sharing is Caring": 19, "Field Notes": 15, "Backyard AI": 36, "Thousand Token Wood": 50, "OpenBMB": 29, "Nemotron": 19, "Modal": 16, "Tiny Titan": 61, "Best Agent": 15}, "quests": ["Off the Grid", "Well-Tuned", "Off-Brand", "Llama Champion", "Sharing is Caring", "Field Notes", "Backyard AI", "Thousand Token Wood", "OpenBMB", "Nemotron", "Modal", "Tiny Titan", "Best Agent"]} {"type": "lora_sft_example", "schema_version": 1, "base_model": "openbmb/MiniCPM5-1B", "adapter_task": "hackathon_advisor_quest_classification", "example_kind": "quest_classification", "project_id": "build-small-hackathon/dental-soap", "variant": "natural", "match_count": 6, "quests": ["Backyard AI", "Field Notes", "Off the Grid", "Off-Brand", "Sharing is Caring", "Tiny Titan"], "messages": [{"role": "system", "content": "You classify hackathon projects against fixed quest dimensions. Return exactly one strict JSON object and nothing else. The first character must be { and the last character must be }. Each match needs quest, confidence, evidence, and source (readme or app_file). Never emit markdown, prose, a top-level array, extra keys, or an unknown or rephrased quest name."}, {"role": "user", "content": "Classify this hackathon project against the quest dimensions below.\nRead the two evidence segments (README and APP_FILE) and judge each quest only from them.\n\nQuests (copy the id on the left verbatim):\n- Off the Grid: Runs the model on-device with no remote inference call: weights load locally and inference happens in-process, not over a hosted API. Signals: AWARD on a local in-process load: from_pretrained / pipeline / llama_cpp / diffusers / vLLM / ONNX, GGUF weights, @spaces.GPU. DISQUALIFY (do NOT award) on ANY remote inference call, even via huggingface_hub: InferenceClient, HF Inference API/Endpoints, gradio_client to a remote Space, replicate/together/openrouter/fal/groq, a *.modal.run or other HTTP inference endpoint, or openai/anthropic/gemini/cohere clients. A remote call disqualifies regardless of which model it names.\n- Well-Tuned: Uses or publishes a fine-tuned or LoRA-adapted model rather than only stock checkpoints. Signals: LoRA/PEFT adapter, fine-tuned model repo, training script, words like fine-tune, adapter, SFT, distilled.\n- Off-Brand: Ships a custom interface beyond default Gradio styling, with a memorable look or voice. Signals: custom CSS/HTML/JS, gr.HTML, gr.Blocks theme/css=, gr.Server, custom components, bespoke theming.\n- Llama Champion: Runs a model through the llama.cpp runtime. Signals: llama-cpp-python, from llama_cpp import Llama, GGUF file, llama.cpp, Llama( constructor.\n- Sharing is Caring: Produces an output people can save, post, or compare, or publishes an agent trace to the Hub. Signals: download/export button, gr.File/gr.DownloadButton, save PNG/PDF/JSON, push_to_hub of a trace or dataset.\n- Field Notes: Documents the build itself with notes, a write-up, or a blog/report link. Signals: README has a substantial build write-up, devlog, lessons learned, or a blog/report/Notion link.\n- Backyard AI: Solves a concrete real-world problem for a specific, named person or persona. Signals: README frames a real user and task (caregiving, a relative, a job, a household chore), practical utility.\n- Thousand Token Wood: A delightful, playful, or artistic experience that would not exist without AI. Signals: story/game/art/whimsy framing, generative characters or worlds, playful tone, creative novelty.\n- OpenBMB: Uses a model published by OpenBMB (the openbmb org), such as the MiniCPM family. Signals: The model id org prefix must be exactly openbmb/ (openbmb/MiniCPM*, OpenCPM). A model from any other org is NOT OpenBMB: openai/gpt-oss, Qwen/..., meta-llama/..., google/..., nvidia/..., microsoft/..., mistralai/... do NOT count just because a model id is present.\n- Nemotron: Uses an NVIDIA Nemotron model (Nemotron LLM, Parakeet, Nemotron-Speech, Canary). Signals: model repo nvidia/...nemotron..., Parakeet, nemotron-speech, Canary ASR.\n- Modal: Uses Modal for training, inference, or background compute. Signals: import modal, modal.App, @app.function, Modal endpoint/volume, README cites Modal compute.\n- Tiny Titan: Runs on a genuinely small model of about four billion parameters or fewer. Signals: AWARD when the model name says <=4B: 0.5B/1B/1.5B/2B/3B/4B or tiny/small/nano/mini (Qwen2.5-1.5B, MiniCPM5-1B, gemma-2b). Do NOT award for 7B/8B/12B/13B/20B/27B/35B+ models (e.g. gpt-oss-20b, Qwen2.5-7B); a version number like V-4.6 is not a parameter count.\n- Best Agent: An agentic build: tool use, function calling, planning, or an autonomous multi-step loop. Signals: tool/function calling, an agent/planner loop, multiple orchestrated tools, ReAct, multi-step reasoning over tools.\n\nRules:\n- Include a quest only when a segment gives clear, specific evidence.\n- quest must be one id from the list above, copied exactly. Never invent or rephrase a quest name.\n- confidence is a number between 0 and 1.\n- evidence is a 3-to-12 word quote or tight paraphrase taken from the segment you cite.\n- source is \"readme\" when the evidence is in the README segment, \"app_file\" when it is in the APP_FILE segment.\n- At most one match per quest. Sort matches by confidence, highest first.\n- If no quest has clear evidence, return an empty matches list.\n- Output exactly one JSON object: {\"matches\":[{\"quest\":\"...\",\"confidence\":0.0,\"evidence\":\"...\",\"source\":\"readme\"}]}.\n- No markdown, no code fences, no commentary, no extra keys.\n\nMETADATA: {\"title\": \"Dental SOAP\", \"sdk\": \"gradio\", \"declared_models\": [\"Qwen/Qwen3-4B-Instruct-2507\"], \"tags\": [\"agents\", \"bilingual\", \"healthcare\", \"zero-gpu\"]}\n\n[README]\n# Dental SOAP A doctor could not clearly explain his own crown, root canal, bite, and TMJ story to his dentist, so he built a small-model visit-prep tool that turns patient chaos into a one-page dentist handoff. Dental SOAP is not an AI dentist. It is a patient education and visit-documentation aid. It organizes patient-reported history to bring to a licensed dentist. It does not diagnose, interpret imaging, prescribe medication, or choose a dental procedure. Fixed safety rules can advise urgent in-person care. ## Why This Exists (A Real Case) The builder's own case took three months and two specialties to untangle: an extraction with an immediate sinus repair, a crown that felt high from day one, a molar adjusted five times without relief, jaw/TMJ soreness — and finally an ENT confirming the sinus infection that tied it together. Referred pain does not respect specialty boundaries; the only thing that crossed them cleanly was a written handoff a clinician could scan in under a minute. He lived that workflow manually with a frontier cloud model between real appointments. Dental SOAP is the Build Small answer: the same narrow job, done by a 4B open model inside the Space, with deterministic rules guarding safety. The `Try Ahmed's case` example **is** that case, de-identified — and the `repeated bite adjustments without lasting relief` safety rule exists because it happened to him. **Try it in 10 seconds:** click **Try Ahmed's case** — the handoff renders instantly from a valid ...\n\n[APP_FILE] app.py\nfrom __future__ import annotations\nfrom pathlib import Path\nfrom typing import Any\nfrom examples import CHECK_OPTIONS, EXAMPLES, STEP2_CHECKS, STEP3_CHECKS, STEP4_CHECKS\nfrom interview_schema import ExtractedIntake, extracted_to_intake\nfrom pdf_export import build_pdf\nfrom render import (\nfrom safety_rules import evaluate_red_flags\nfrom pydantic import ValidationError\nfrom schema import (\nfrom transformers import AutoModelForCausalLM, AutoTokenizer as _AutoTokenizer\nfrom transformers import AutoModelForCausalLM, AutoTokenizer\n\nfrom __future__ import annotations\n\nimport dataclasses\nimport json\nimport os\nimport re\nimport sys\nimport threading\nfrom pathlib import Path\nfrom typing import Any\n\nimport gradio as gr\n\nclass AgentUnavailable(RuntimeError):\n \"\"\"The local model endpoint could not produce a usable response.\"\"\"\n\nimport interview as interview_mod\nfrom examples import CHECK_OPTIONS, EXAMPLES, STEP2_CHECKS, STEP3_CHECKS, STEP4_CHECKS\nfrom interview_schema import ExtractedIntake, extracted_to_intake\nfrom pdf_export import build_pdf\nfrom render import (\n footer_html,\n header_html,\n initial_safety_html,\n placeholder_handoff_html,\n plain_text_handoff,\n rail_html,\n render_handoff_html,\n render_safety_html,\n step_head,\n initial_agent_dashboard_html,\n render_agent_dashboard,\n)\nfrom safety_rules import evaluate_red_flags\nfrom pydantic import ValidationError\nfrom schema import (\n BLOCKED_QUESTION_TERMS,\n EvidenceSpan,\n HandoffOutput,\n ModelHandoffDraft,\n PatientProfile,\n StructuredIntake,\n model_text_is_safe,\n)\n\n\ntry:\n import spaces\nexcept Exception:\n class _SpacesFallback:\n @staticmethod\n def GPU(fn=None, /, *, duration: int = 120):\n # Support both @spaces.GPU (fn is callable) and @spaces.GPU(duration=N)\n # (fn is None, returns a decorator).\n def decorator(f): ..."}, {"role": "assistant", "content": "{\"matches\":[{\"quest\":\"Tiny Titan\",\"confidence\":0.97,\"evidence\":\"4B open model inside the Space; Qwen3-4B-Instruct-2507\",\"source\":\"readme\"},{\"quest\":\"Backyard AI\",\"confidence\":0.96,\"evidence\":\"builder's own case took three months and two specialties to untangle\",\"source\":\"readme\"},{\"quest\":\"Off the Grid\",\"confidence\":0.9,\"evidence\":\"from transformers import AutoModelForCausalLM, AutoTokenizer; no cloud client\",\"source\":\"app_file\"},{\"quest\":\"Sharing is Caring\",\"confidence\":0.85,\"evidence\":\"from pdf_export import build_pdf\",\"source\":\"app_file\"},{\"quest\":\"Off-Brand\",\"confidence\":0.82,\"evidence\":\"header_html, footer_html, rail_html, render_handoff_html custom HTML\",\"source\":\"app_file\"},{\"quest\":\"Field Notes\",\"confidence\":0.78,\"evidence\":\"Why This Exists (A Real Case) write-up of the build motivation\",\"source\":\"readme\"}]}"}]} {"type": "lora_sft_example", "schema_version": 1, "base_model": "openbmb/MiniCPM5-1B", "adapter_task": "hackathon_advisor_quest_classification", "example_kind": "quest_classification", "project_id": "build-small-hackathon/pakistan-notice-helper", "variant": "natural", "match_count": 5, "quests": ["Backyard AI", "Llama Champion", "Off the Grid", "Off-Brand", "OpenBMB"], "messages": [{"role": "system", "content": "You classify hackathon projects against fixed quest dimensions. Return exactly one strict JSON object and nothing else. The first character must be { and the last character must be }. Each match needs quest, confidence, evidence, and source (readme or app_file). Never emit markdown, prose, a top-level array, extra keys, or an unknown or rephrased quest name."}, {"role": "user", "content": "Classify this hackathon project against the quest dimensions below.\nRead the two evidence segments (README and APP_FILE) and judge each quest only from them.\n\nQuests (copy the id on the left verbatim):\n- Off the Grid: Runs the model on-device with no remote inference call: weights load locally and inference happens in-process, not over a hosted API. Signals: AWARD on a local in-process load: from_pretrained / pipeline / llama_cpp / diffusers / vLLM / ONNX, GGUF weights, @spaces.GPU. DISQUALIFY (do NOT award) on ANY remote inference call, even via huggingface_hub: InferenceClient, HF Inference API/Endpoints, gradio_client to a remote Space, replicate/together/openrouter/fal/groq, a *.modal.run or other HTTP inference endpoint, or openai/anthropic/gemini/cohere clients. A remote call disqualifies regardless of which model it names.\n- Well-Tuned: Uses or publishes a fine-tuned or LoRA-adapted model rather than only stock checkpoints. Signals: LoRA/PEFT adapter, fine-tuned model repo, training script, words like fine-tune, adapter, SFT, distilled.\n- Off-Brand: Ships a custom interface beyond default Gradio styling, with a memorable look or voice. Signals: custom CSS/HTML/JS, gr.HTML, gr.Blocks theme/css=, gr.Server, custom components, bespoke theming.\n- Llama Champion: Runs a model through the llama.cpp runtime. Signals: llama-cpp-python, from llama_cpp import Llama, GGUF file, llama.cpp, Llama( constructor.\n- Sharing is Caring: Produces an output people can save, post, or compare, or publishes an agent trace to the Hub. Signals: download/export button, gr.File/gr.DownloadButton, save PNG/PDF/JSON, push_to_hub of a trace or dataset.\n- Field Notes: Documents the build itself with notes, a write-up, or a blog/report link. Signals: README has a substantial build write-up, devlog, lessons learned, or a blog/report/Notion link.\n- Backyard AI: Solves a concrete real-world problem for a specific, named person or persona. Signals: README frames a real user and task (caregiving, a relative, a job, a household chore), practical utility.\n- Thousand Token Wood: A delightful, playful, or artistic experience that would not exist without AI. Signals: story/game/art/whimsy framing, generative characters or worlds, playful tone, creative novelty.\n- OpenBMB: Uses a model published by OpenBMB (the openbmb org), such as the MiniCPM family. Signals: The model id org prefix must be exactly openbmb/ (openbmb/MiniCPM*, OpenCPM). A model from any other org is NOT OpenBMB: openai/gpt-oss, Qwen/..., meta-llama/..., google/..., nvidia/..., microsoft/..., mistralai/... do NOT count just because a model id is present.\n- Nemotron: Uses an NVIDIA Nemotron model (Nemotron LLM, Parakeet, Nemotron-Speech, Canary). Signals: model repo nvidia/...nemotron..., Parakeet, nemotron-speech, Canary ASR.\n- Modal: Uses Modal for training, inference, or background compute. Signals: import modal, modal.App, @app.function, Modal endpoint/volume, README cites Modal compute.\n- Tiny Titan: Runs on a genuinely small model of about four billion parameters or fewer. Signals: AWARD when the model name says <=4B: 0.5B/1B/1.5B/2B/3B/4B or tiny/small/nano/mini (Qwen2.5-1.5B, MiniCPM5-1B, gemma-2b). Do NOT award for 7B/8B/12B/13B/20B/27B/35B+ models (e.g. gpt-oss-20b, Qwen2.5-7B); a version number like V-4.6 is not a parameter count.\n- Best Agent: An agentic build: tool use, function calling, planning, or an autonomous multi-step loop. Signals: tool/function calling, an agent/planner loop, multiple orchestrated tools, ReAct, multi-step reasoning over tools.\n\nRules:\n- Include a quest only when a segment gives clear, specific evidence.\n- quest must be one id from the list above, copied exactly. Never invent or rephrase a quest name.\n- confidence is a number between 0 and 1.\n- evidence is a 3-to-12 word quote or tight paraphrase taken from the segment you cite.\n- source is \"readme\" when the evidence is in the README segment, \"app_file\" when it is in the APP_FILE segment.\n- At most one match per quest. Sort matches by confidence, highest first.\n- If no quest has clear evidence, return an empty matches list.\n- Output exactly one JSON object: {\"matches\":[{\"quest\":\"...\",\"confidence\":0.0,\"evidence\":\"...\",\"source\":\"readme\"}]}.\n- No markdown, no code fences, no commentary, no extra keys.\n\nMETADATA: {\"title\": \"Pakistan Notice Helper\", \"sdk\": \"gradio\", \"declared_models\": [], \"tags\": [\"backyard-ai\", \"build-small-hackathon\", \"gradio\", \"llama.cpp\", \"modal\"]}\n\n[README]\n# Pakistan Notice Helper Pakistan Notice Helper is a MiniCPM-V-powered safety assistant for confusing or suspicious Pakistani notices, bills, SMS messages, bank alerts, FBR-style messages, challans, and courier/customs messages. It accepts pasted text and screenshots, then returns: - **Risk label:** Looks normal, Verify first, Suspicious, or Likely scam - A simple English explanation - Red flags found - Safe next steps - A polite reply draft The interface is a custom mobile-first frontend served by [`gradio.Server`](https://www.gradio.app/main/guides/server-mode). Gradio provides queueing, API routes, and Hugging Face Spaces hosting without exposing a default Gradio UI. > **Pakistan Notice Helper does not provide official verification. It checks > common scam signals and gives safe next steps. Always verify through official > websites or helplines before making payments or sharing personal > information.** ## Build Small Hackathon This is a **Backyard AI** project built for the [Build Small Hackathon](https://huggingface.co/build-small-hackathon). It addresses a common local problem: people receive convincing payment notices, bank alerts, courier messages, challans, and government impersonation scams but may not know which details are unsafe. - **Space:** [build-small-hackathon/pakistan-notice-helper](https://huggingface.co/spaces/build-small-hackathon/pakistan-notice-helper) - **Source:** [kingabzpro/pakistan-notice-helper](https://github.com/kingabzpro/pakistan-notice-helpe ...\n\n[APP_FILE] app.py\nfrom __future__ import annotations\nfrom io import BytesIO\nfrom pathlib import Path\nfrom typing import Any\nfrom fastapi.responses import FileResponse\nfrom fastapi.staticfiles import StaticFiles\nfrom gradio import Server\nfrom PIL import Image, ImageOps, UnidentifiedImageError\nfrom traces.runtime import queue_trace, start_trace_worker, trace_status\nfrom huggingface_hub import hf_hub_download\nfrom llama_cpp import Llama\nfrom llama_cpp.llama_chat_format import MTMDChatHandler\nfrom llama_cpp import Llama, llama_cpp\n\n\"\"\"Pakistan Notice Helper: custom frontend with a queued Gradio backend.\"\"\"\n\nfrom __future__ import annotations\n\nimport argparse\nimport base64\nimport ctypes\nimport gc\nimport importlib.util\nimport json\nimport logging\nimport os\nimport re\nimport sys\nimport threading\nimport time\nfrom io import BytesIO\nfrom pathlib import Path\nfrom typing import Any\n\nimport spaces\nfrom fastapi.responses import FileResponse\nfrom fastapi.staticfiles import StaticFiles\nfrom gradio import Server\nfrom PIL import Image, ImageOps, UnidentifiedImageError\nfrom traces.runtime import queue_trace, start_trace_worker, trace_status\n\nROOT = Path(__file__).resolve().parent\nSTATIC_DIR = ROOT / \"static\"\nDISCLAIMER = (\n \"Pakistan Notice Helper does not provide official verification. It checks \"\n \"common scam signals and gives safe next steps. Always verify through \"\n \"official websites or helplines before making payments or sharing personal \"\n \"information.\"\n)\nRISK_LABELS = (\"Looks normal\", \"Verify first\", \"Suspicious\", \"Likely scam\", \"Inappropriate\")\nDEFAULT_MODEL_REPO = \"openbmb/MiniCPM-V-4.6-gguf\"\nDEFAULT_MODEL_FILE = \"MiniCPM-V-4_6-Q4_K_M.gguf\"\nDEFAULT_MMPROJ_FILE = \"mmproj-model-f16.gguf\"\nMAX_IMAGE_BYTES = 8 * 1024 * 1024\nMAX_IMAGE_PIXELS = 40_000_000\nDEFAULT_IMAGE_MAX_DIMENSION = 1536\nDEFAULT_IMAGE_JPEG_QUALITY = 90\nREQUIRED_FIELDS = {\n \"risk_label\",\n \"simple_explanation\",\n \" ..."}, {"role": "assistant", "content": "{\"matches\":[{\"quest\":\"Llama Champion\",\"confidence\":0.98,\"evidence\":\"from llama_cpp import Llama\",\"source\":\"app_file\"},{\"quest\":\"OpenBMB\",\"confidence\":0.98,\"evidence\":\"DEFAULT_MODEL_REPO = \\\"openbmb/MiniCPM-V-4.6-gguf\\\"\",\"source\":\"app_file\"},{\"quest\":\"Off-Brand\",\"confidence\":0.93,\"evidence\":\"custom mobile-first frontend served by gradio.Server\",\"source\":\"readme\"},{\"quest\":\"Off the Grid\",\"confidence\":0.92,\"evidence\":\"from llama_cpp import Llama; MiniCPM-V-4_6 GGUF weights\",\"source\":\"app_file\"},{\"quest\":\"Backyard AI\",\"confidence\":0.78,\"evidence\":\"Backyard AI project; people receive convincing payment notices\",\"source\":\"readme\"}]}"}]} {"type": "lora_sft_example", "schema_version": 1, "base_model": "openbmb/MiniCPM5-1B", "adapter_task": "hackathon_advisor_quest_classification", "example_kind": "quest_classification", "project_id": "build-small-hackathon/Kintsugi-Garden", "variant": "natural", "match_count": 3, "quests": ["Off the Grid", "Thousand Token Wood", "Tiny Titan"], "messages": [{"role": "system", "content": "You classify hackathon projects against fixed quest dimensions. Return exactly one strict JSON object and nothing else. The first character must be { and the last character must be }. Each match needs quest, confidence, evidence, and source (readme or app_file). Never emit markdown, prose, a top-level array, extra keys, or an unknown or rephrased quest name."}, {"role": "user", "content": "Classify this hackathon project against the quest dimensions below.\nRead the two evidence segments (README and APP_FILE) and judge each quest only from them.\n\nQuests (copy the id on the left verbatim):\n- Off the Grid: Runs the model on-device with no remote inference call: weights load locally and inference happens in-process, not over a hosted API. Signals: AWARD on a local in-process load: from_pretrained / pipeline / llama_cpp / diffusers / vLLM / ONNX, GGUF weights, @spaces.GPU. DISQUALIFY (do NOT award) on ANY remote inference call, even via huggingface_hub: InferenceClient, HF Inference API/Endpoints, gradio_client to a remote Space, replicate/together/openrouter/fal/groq, a *.modal.run or other HTTP inference endpoint, or openai/anthropic/gemini/cohere clients. A remote call disqualifies regardless of which model it names.\n- Well-Tuned: Uses or publishes a fine-tuned or LoRA-adapted model rather than only stock checkpoints. Signals: LoRA/PEFT adapter, fine-tuned model repo, training script, words like fine-tune, adapter, SFT, distilled.\n- Off-Brand: Ships a custom interface beyond default Gradio styling, with a memorable look or voice. Signals: custom CSS/HTML/JS, gr.HTML, gr.Blocks theme/css=, gr.Server, custom components, bespoke theming.\n- Llama Champion: Runs a model through the llama.cpp runtime. Signals: llama-cpp-python, from llama_cpp import Llama, GGUF file, llama.cpp, Llama( constructor.\n- Sharing is Caring: Produces an output people can save, post, or compare, or publishes an agent trace to the Hub. Signals: download/export button, gr.File/gr.DownloadButton, save PNG/PDF/JSON, push_to_hub of a trace or dataset.\n- Field Notes: Documents the build itself with notes, a write-up, or a blog/report link. Signals: README has a substantial build write-up, devlog, lessons learned, or a blog/report/Notion link.\n- Backyard AI: Solves a concrete real-world problem for a specific, named person or persona. Signals: README frames a real user and task (caregiving, a relative, a job, a household chore), practical utility.\n- Thousand Token Wood: A delightful, playful, or artistic experience that would not exist without AI. Signals: story/game/art/whimsy framing, generative characters or worlds, playful tone, creative novelty.\n- OpenBMB: Uses a model published by OpenBMB (the openbmb org), such as the MiniCPM family. Signals: The model id org prefix must be exactly openbmb/ (openbmb/MiniCPM*, OpenCPM). A model from any other org is NOT OpenBMB: openai/gpt-oss, Qwen/..., meta-llama/..., google/..., nvidia/..., microsoft/..., mistralai/... do NOT count just because a model id is present.\n- Nemotron: Uses an NVIDIA Nemotron model (Nemotron LLM, Parakeet, Nemotron-Speech, Canary). Signals: model repo nvidia/...nemotron..., Parakeet, nemotron-speech, Canary ASR.\n- Modal: Uses Modal for training, inference, or background compute. Signals: import modal, modal.App, @app.function, Modal endpoint/volume, README cites Modal compute.\n- Tiny Titan: Runs on a genuinely small model of about four billion parameters or fewer. Signals: AWARD when the model name says <=4B: 0.5B/1B/1.5B/2B/3B/4B or tiny/small/nano/mini (Qwen2.5-1.5B, MiniCPM5-1B, gemma-2b). Do NOT award for 7B/8B/12B/13B/20B/27B/35B+ models (e.g. gpt-oss-20b, Qwen2.5-7B); a version number like V-4.6 is not a parameter count.\n- Best Agent: An agentic build: tool use, function calling, planning, or an autonomous multi-step loop. Signals: tool/function calling, an agent/planner loop, multiple orchestrated tools, ReAct, multi-step reasoning over tools.\n\nRules:\n- Include a quest only when a segment gives clear, specific evidence.\n- quest must be one id from the list above, copied exactly. Never invent or rephrase a quest name.\n- confidence is a number between 0 and 1.\n- evidence is a 3-to-12 word quote or tight paraphrase taken from the segment you cite.\n- source is \"readme\" when the evidence is in the README segment, \"app_file\" when it is in the APP_FILE segment.\n- At most one match per quest. Sort matches by confidence, highest first.\n- If no quest has clear evidence, return an empty matches list.\n- Output exactly one JSON object: {\"matches\":[{\"quest\":\"...\",\"confidence\":0.0,\"evidence\":\"...\",\"source\":\"readme\"}]}.\n- No markdown, no code fences, no commentary, no extra keys.\n\nMETADATA: {\"title\": \"The Kintsugi Garden\", \"sdk\": \"gradio\", \"declared_models\": [], \"tags\": [\"gradio\", \"region:us\"]}\n\n[README]\n
हेर — Marathi for “detective.”
A detective for your coding-agent sessions. Drop a Claude Code session export and Her reads the whole trace — so you can see what actually happened, and what to do better next time.
A whole town of tiny AI minds — alive, gossiping, and feuding on your laptop. Fully offline.
Big labs need a datacenter to run one mind.
Smol Town runs a whole town of them on a gaming GPU.