# Kaiju Coder 7 Public Testing Quickstart Kaiju Coder 7 is the public model name. The OpenAI-compatible model id is: ```text kaiju-coder-7 ``` Use this guide for serious public testing. It avoids internal checkpoint names and keeps the current limitations clear. ## Pick A Test Path ### Path 1: OpenCode Against An Existing Endpoint Use this if you already have Kaiju Coder 7 served at an OpenAI-compatible `/v1` endpoint. ```bash git clone https://huggingface.co/RMDWLLC/kaiju-coder-7-opencode cd kaiju-coder-7-opencode python3 scripts/install_kaiju_opencode_profile.py --base-url http://127.0.0.1:18181/v1 ``` Then run OpenCode inside the project you want to edit: ```bash opencode ``` The installer sets `kaiju/kaiju-coder-7` as the OpenCode model and `kaiju-coder-7` as the default agent. You can still select `kaiju/kaiju-coder-7` manually from OpenCode's model picker if you switch away. For a bounded smoke test: ```bash mkdir -p /tmp/kaiju-public-smoke opencode run --dir /tmp/kaiju-public-smoke \ "Create hello.txt with exactly: Kaiju Coder 7 is ready" ``` Or run the packaged verifier, which checks the installer, live model endpoint, OpenCode binary, actual file creation, and wrong-directory behavior: ```bash python3 scripts/run_kaiju_public_opencode_smoke.py ``` The helper installer adds: - the `kaiju` OpenAI-compatible provider - `model: kaiju/kaiju-coder-7` and `default_agent: kaiju-coder-7` - the lean `kaiju-coder-7` OpenCode agent - Kaiju as the default primary agent, so selecting Kaiju Coder 7 uses the hidden fast artifact path without requiring `/kaiju` - the `kaiju-coder-7-run` router command for fast websites, owner packs, and Desktop artifact folders - the `kaiju_artifact` OpenCode custom tool and `/kaiju` command for routing large artifact prompts through the fast local router - a scoped no-autocontinue plugin that prevents false completion loops after compaction or output limits For a fast website or owner-pack artifact without waiting on raw OpenCode multi-file streaming, run: ```bash kaiju-coder-7-run \ --no-planner \ --kind website \ --out-dir "$HOME/Desktop/Kaiju-Coder-7-Test" \ --prompt "Build a premium one-page website for Harborline Bookkeeping with pricing, FAQ, and a cleanup-call CTA." ``` OpenCode should use this same command internally for large website, business-pack, and Desktop-output requests after the helper is installed. Inside OpenCode, `/kaiju` is optional for large generated artifacts. The command is prompt-backed, but it points the Kaiju agent at the `kaiju_artifact` custom tool instead of making the model hand-write every file. ### Path 2: Full Local Weights Use this if the full `RMDWLLC/kaiju-coder-7` Hugging Face repo has been uploaded and you have suitable local GPU hardware. ```bash hf download RMDWLLC/kaiju-coder-7 --local-dir ./kaiju-coder-7 ``` Serve the downloaded folder with an OpenAI-compatible local server. Configure the server to expose: ```text model id: kaiju-coder-7 base URL: http://127.0.0.1:18084/v1 context: 16384 ``` For the fastest OpenCode behavior, run the bundled fast proxy in a separate terminal and point OpenCode at the proxy: ```bash KAIJU_OPENAI_BASE_URL=http://127.0.0.1:18084/v1 \ python3 scripts/kaiju_opencode_fast_proxy.py --host 127.0.0.1 --port 18181 ``` Then install the OpenCode helper with: ```bash git clone https://huggingface.co/RMDWLLC/kaiju-coder-7-opencode cd kaiju-coder-7-opencode python3 scripts/install_kaiju_opencode_profile.py --base-url http://127.0.0.1:18181/v1 ``` ### Path 3: Runtime-Quantized Local Candidate Use this only if you are comfortable with advanced serving setups. The current working quantized option is a runtime bitsandbytes recipe. A Q8_0 GGUF artifact has been converted, but it is still a candidate until runtime smoke passes. ```bash git clone https://huggingface.co/RMDWLLC/kaiju-coder-7-quantized-runtime cd kaiju-coder-7-quantized-runtime ``` Read `README.md` in that repo before serving. This path can reduce model memory at runtime, but it still depends on access to the full Kaiju Coder 7 weights. ## Recommended Test Prompt Run this from an empty project folder: ```text Build a launch-ready local service business website and operating pack. Include index.html, a Stripe checkout safety plan, a CSV parser with tests, a simple CRM schema, a weekly money report, and a safety/provenance note. Write the files, not just advice. ``` Expected result: - files are written in the requested project folder - `index.html` is complete HTML - business docs start with Markdown H1 headings - code includes a test or smoke-check command where practical - no fake API keys, OAuth tokens, payment secrets, or private customer data ## Current Recommended Defaults - Public model id: `kaiju-coder-7` - OpenCode context: `16384` - Output cap for public testing: `2500` - Fast OpenCode path: vLLM bitsandbytes runtime behind the Kaiju fast proxy - Current reliable product path: model plus deterministic business-owner harness/router plus verifier - Raw multi-file OpenCode generation: still too slow for broad paid claims; use `kaiju-coder-7-run` for fast public website and owner-pack tests while broader raw-model latency gates continue - Paid API: not public until launch preflight passes and the Stripe live-mode switch is deliberately completed ## What Not To Claim Yet Do not claim: - that raw model weights alone reliably build every business-owner artifact - that a paid hosted API is generally available - that persisted quantized weights exist - that 32k context is the current live default Do claim: - Kaiju Coder 7 has a working local/OpenCode release candidate - the current tested OpenCode default is 16k context - the helper package includes a lean agent and compaction loop guard - the helper package includes the `kaiju-coder-7-run` router command for fast artifact generation - the fast proxy keeps OpenCode tool calls intact while forcing bounded, non-thinking generation - the paid API scaffold has tests and a launch preflight, but is not yet public - the packaged public smoke verifies a fresh OpenCode one-file write before public claims are refreshed - a GGUF Q8_0 candidate exists, but is not public quantized-weights release evidence until runtime smoke passes ## Remaining Caveats Before Broader Claims - Hugging Face public release repos are uploaded and public under `RMDWLLC`. - The GGUF Q8_0 candidate still needs a runtime smoke before public quantized-weights upload. - Raw multi-file OpenCode generation is still not the public speed story; use the deterministic router/harness for websites and business-owner packs. - Public paid API launch has approval and preflight evidence, but real customer charging still needs a deliberate Stripe live-mode switch and controlled live payment verification. - Do not claim 32k context as the live default until it is freshly restarted and re-confirmed.