kaiju-coder-7-adapter / COMPLETION_AUDIT.md
restokes92's picture
Upload Kaiju Coder 7 adapter release package
97fc8fc verified
# Kaiju Business-Owner Release Completion Audit
Date: 2026-06-03
This audit maps the active goal to current evidence. It is intentionally
conservative: the product-path harness is release-candidate ready for local
testing, the fresh v1.8 Qwen 3.6 LoRA adapter exists, and a merged full-model
artifact serves locally on Gojira-B. Dynamic SGLang LoRA serving is not counted
as release evidence because the corrected LoRA selector crashes on this
adapter. The public Hugging Face repos are uploaded and public; the remaining
release caveats are raw-agent latency, GGUF runtime smoke, 32k live-default
proof, and real Stripe live-mode charging.
## Requirement Status
| Requirement | Current evidence | Status |
|---|---|---|
| Continue from `RichardEchols/kaiju-coder`, not a restart | Branch `codex/kaiju-business-owner-rc` is based on `3d57eae92ad523519473f0ff3eca6661a9736de3`, matching `origin/main`. | Passed |
| GitHub and local source inventory for Kaiju, Kiyomi, RMDW, Makoto, Mezzal, and wiki sources | `release/SOURCE_INVENTORY.md` and `release/source-inventory.json` generated from GitHub metadata, `git ls-remote` SHAs, and the requested local `/Users/richardecholsai7/Documents/RMDW-Wiki` snapshot marked non-authoritative/selective-reference-only. | Passed |
| Legally reusable, provenance-preserving dataset update | `datasets/candidates/v1.7-rmdw-business-owner-suite.jsonl` adds reviewed RMDW-owned examples with `source_repos`, `source_paths`, and `provenance_notes`. | Passed |
| Dataset validation | `python3 scripts/validate_training_data.py --min-examples 350` passes with `1,689` reviewed examples across `14` files. | Passed |
| v1.7 business-owner SFT build | `python3 scripts/build_v17_business_owner_sft_dataset.py` writes `1,881` rows and `192` controlled business-owner repeats. | Passed |
| Hard evals for business-owner workflows | `evals/tasks/router-hard-harness.jsonl` includes `business_suite` prompts; latest local RC smoke run produced `23/23` static pass. | Passed |
| Local Kaiju product path runs | `python3 scripts/run_kaiju_business_owner_rc_smoke.py` validates data, builds SFT, smokes the local API harness, runs router hard eval, and runs static checks. | Passed |
| Complete Kiyomi 7.7.7 AI-company artifact generation | `business_suite` route writes a 19-file pack including launch kit, content engine, connector checklist, intake CRM, reporting, automations, operator handbook, leads, sales, ROI dashboard, and Workshop artifact. | Passed |
| Secret/private-data guardrails | Dataset validation scans common secret patterns; verifier checks `no_hardcoded_secrets`; source inventory excludes credentials, tokens, private client data, and raw logs. | Passed |
| Release artifacts | `release/MODEL_CARD_DRAFT.md`, `release/HF_ADAPTER_MODEL_CARD.md`, `release/DATA_PROVENANCE_DRAFT.md`, `release/EVAL_SCOREBOARD.md`, `release/LOCAL_TEST_INSTRUCTIONS.md`, `release/HUGGINGFACE_RELEASE_DRAFT.md`, `release/FINAL_RELEASE_REPORT.md`, `release/UPSTREAM_LICENSE_CHECK.md`, and this audit. | Passed |
| Fresh Qwen 3.6 v1.7 fine-tune | After clearing old ComfyUI/Ollama workloads from Gojira B, training finished with `metrics.json`, train runtime `1663.7101s`, train loss `1.7260706673065822`, and an adapter directory. | Passed |
| Local inference against new v1.7 checkpoint | SGLang served `kaiju_v17_business_owner` over Tailscale at `http://100.109.109.14:18083/v1` with `context=4096` and `mem_fraction=0.90`; website and proposal smoke tasks returned non-empty outputs. | Passed |
| Stronger Qwen 3.6 v1.8 fine-tune | Gojira B was cleared of ComfyUI/SGLang/Ollama GPU conflicts; v1.8 finished with `metrics.json`, train runtime `11666.7564s`, train loss `0.9281658741335074`, and an adapter directory. | Passed |
| v1.8 adapter merged into full model | `scripts/run-gojira-b-qwen36-lora-merge.sh` merged `/workspace/kaiju-coder/runs/qwen36-27b-lora-v1.8-business-owner/adapter` into `/workspace/kaiju-coder/models/Kaiju-Coder-Qwen3.6-27B-v1.8-merged`; remote artifact is `51G` with `14` safetensor shards and preserved base config/processor sidecars. | Passed |
| Local inference against v1.8 merged checkpoint | Current fast path serves `kaiju-coder-7` through vLLM bitsandbytes on Gojira-B at `http://100.109.109.14:18084/v1`, exposed locally through `http://127.0.0.1:18181/v1`; current live endpoint reports max model len `16384`. Prior SGLang benchmarks proved 12k/16k/24k/32k startup and smoke evidence, with 32k treated as the high-context target rather than the currently parked runtime. | Passed |
| v1.8 merged business-owner eval | Probe returned `1,155` visible chars in `60.17s`; proposal rerun scored `1/1`, `4.0/4.0`, `4,014` chars in `212.72s`; Jah credits backend scored `4.0/4.0`, `9,718` chars in `566.36s`. | Passed with latency caveat |
| OpenCode local run path | Local OpenCode provider/agent is installed for `kaiju/kaiju-coder-7` with 16k context and the scoped no-autocontinue plugin at `/Users/richardecholsai7/.config/opencode/kaiju-no-autocontinue.mjs`. Packaged public verifier `python3 scripts/run_kaiju_public_opencode_smoke.py --base-url http://127.0.0.1:18181/v1 --timeout 900` passed `4/4` in `runs/public-opencode-smoke/20260603T235002Z/summary.md`, including wrong-directory leakage checks; loop-guard smoke wrote `loopguard.txt` with exactly `Kaiju Coder 7 loop guard installed`; latest harnessed customer-readiness pack `runs/opencode-customer-readiness/20260603T185835Z/summary.md` passed `4/4` with `28/28` required files, including release provenance and safety review. | Passed for harnessed/product path |
| Runtime-quantized local path | vLLM bitsandbytes runtime quantization passed identity/code/business-doc smokes at 8k/16k, reported about `17.8 GiB` model memory, and passed OpenCode one-file smoke with exact content `Kaiju Coder 7 quantized runtime ok`. Persisted quantized weights are still pending. | Runtime recipe passed; persisted weights pending |
| Paid API gateway scaffold | `cd gateway/cloudflare-worker && npm run check` passes `16/16` Worker tests covering bearer auth, inactive keys, insufficient credits, debit/refund, rate limit before debit, model `kaiju-coder-7` enforcement, streaming/thinking/token caps, secret-content rejection without logging, signed Stripe Checkout top-up idempotency, origin-only R2 artifact upload, and account-scoped artifact download. `python3 scripts/check_paid_api_readiness.py --mode scaffold` passes `17` checks. `python3 scripts/check_paid_api_readiness.py --mode launch` passes `27/27` checks after live Cloudflare bindings, Worker-to-Gojira proof, Stripe test-mode webhook evidence, staging latency, and rollback proof. Real customer charging still requires a deliberate Stripe live-mode switch and controlled live payment verification. | Scaffold and launch preflight passed; live-mode charging pending |
| Dynamic SGLang LoRA selector | Adapter-name-only serving can be base-equivalent; corrected selector `qwen36-27b:kaiju_v18_business_owner` crashes with `LoRA buffer shape torch.Size([8192, 16]) does not match weight shape torch.Size([14336, 16])`. | Not release path |
| Hugging Face helper repo upload readiness | Adapter, OpenCode helper, and runtime-quantized recipe staging folders build under `/tmp/kaiju-coder-7-hf-staging`; public repos `RMDWLLC/kaiju-coder-7-adapter`, `RMDWLLC/kaiju-coder-7-opencode`, and `RMDWLLC/kaiju-coder-7-quantized-runtime` are uploaded and public. `python3 scripts/check_hf_uploaded_release.py --namespace RMDWLLC --apply --require-public` verifies public downloads and helper package content. | Uploaded and public |
| Hugging Face merged model upload readiness | `RMDWLLC/kaiju-coder-7` is uploaded and public with the merged `53.8G` model package and `14` safetensors shards recorded in `release/HF_UPLOAD_EVIDENCE.md`. Public downloads are verified; the previous private-storage blocker was resolved by switching the repos public. | Uploaded and public |
| Consolidated release readiness check | `python3 scripts/check_kaiju_public_release_readiness.py --mode local`, `--mode hf-release`, and `--mode public` pass against the current fast proxy and public HF evidence. The checker validates staged files, public naming hygiene, secret-looking raw values, checksums, final report, HF bundle checksum, uploaded evidence, and human signoff. | Local, HF, and public modes passed |
## Commands With Current Passing Evidence
```bash
python3 -m unittest discover -s tests -p 'test_*.py'
python3 scripts/run_kaiju_business_owner_rc_smoke.py
python3 scripts/run_kaiju_opencode_customer_pack.py --mode harnessed
python3 scripts/install_kaiju_opencode_profile.py
mkdir -p /tmp/kaiju-opencode-fresh-public-smoke
opencode run -m kaiju/kaiju-coder-7 --agent kaiju-coder-7 --dir /tmp/kaiju-opencode-fresh-public-smoke --dangerously-skip-permissions 'Create hello.txt with exactly: Kaiju Coder 7 fresh public smoke ok'
opencode run -m kaiju/kaiju-coder-7 --agent kaiju-coder-7 --dir /tmp/kaiju-opencode-loopguard-smoke --dangerously-skip-permissions 'Create loopguard.txt with exactly: Kaiju Coder 7 loop guard installed'
python3 scripts/check_paid_api_readiness.py --mode scaffold
python3 -m py_compile scripts/run_kaiju_api_harness_smoke.py scripts/run_kaiju_business_owner_rc_smoke.py scripts/build_v17_business_owner_sft_dataset.py kaiju_harness/business_suite.py kaiju_harness/router.py kaiju_harness/verification.py
git diff --check
bash scripts/upload_hf_merged_model_from_gojira_b.sh
KAIJU_MERGED_METADATA_APPLY=1 bash scripts/prepare_hf_merged_model_metadata.sh
python3 scripts/check_hf_staging_integrity.py
python3 scripts/check_human_release_review.py --mode local
python3 scripts/generate_kaiju_final_report.py
python3 scripts/create_hf_release_bundle.py
python3 scripts/check_kaiju_public_release_readiness.py --mode local
```
## Remaining Blocker
The fresh v1.8 adapter, merged full-model artifact, and direct merged-model inference path are proven. The current completed local release candidate is:
```text
Kaiju Coder 7 merged model + deterministic business-owner harness + verifier + source-backed v1.7/v1.8 dataset/release package
```
That must be described honestly until external release review confirms:
- GGUF Q8_0 runtime smoke before public quantized-weight claims
- raw website latency/SLA positioning or explicit harness-first website positioning
- broader base Qwen and GLM comparison results before superiority claims
- 32k context freshly restarted and re-confirmed before making it the live
default
- Stripe live-mode products/webhook secret and a controlled live payment before
selling real paid API access