kaiju-coder-7-adapter / COMPLETION_AUDIT.md
restokes92's picture
Upload Kaiju Coder 7 adapter release package
97fc8fc verified

Kaiju Business-Owner Release Completion Audit

Date: 2026-06-03

This audit maps the active goal to current evidence. It is intentionally conservative: the product-path harness is release-candidate ready for local testing, the fresh v1.8 Qwen 3.6 LoRA adapter exists, and a merged full-model artifact serves locally on Gojira-B. Dynamic SGLang LoRA serving is not counted as release evidence because the corrected LoRA selector crashes on this adapter. The public Hugging Face repos are uploaded and public; the remaining release caveats are raw-agent latency, GGUF runtime smoke, 32k live-default proof, and real Stripe live-mode charging.

Requirement Status

Requirement Current evidence Status
Continue from RichardEchols/kaiju-coder, not a restart Branch codex/kaiju-business-owner-rc is based on 3d57eae92ad523519473f0ff3eca6661a9736de3, matching origin/main. Passed
GitHub and local source inventory for Kaiju, Kiyomi, RMDW, Makoto, Mezzal, and wiki sources release/SOURCE_INVENTORY.md and release/source-inventory.json generated from GitHub metadata, git ls-remote SHAs, and the requested local /Users/richardecholsai7/Documents/RMDW-Wiki snapshot marked non-authoritative/selective-reference-only. Passed
Legally reusable, provenance-preserving dataset update datasets/candidates/v1.7-rmdw-business-owner-suite.jsonl adds reviewed RMDW-owned examples with source_repos, source_paths, and provenance_notes. Passed
Dataset validation python3 scripts/validate_training_data.py --min-examples 350 passes with 1,689 reviewed examples across 14 files. Passed
v1.7 business-owner SFT build python3 scripts/build_v17_business_owner_sft_dataset.py writes 1,881 rows and 192 controlled business-owner repeats. Passed
Hard evals for business-owner workflows evals/tasks/router-hard-harness.jsonl includes business_suite prompts; latest local RC smoke run produced 23/23 static pass. Passed
Local Kaiju product path runs python3 scripts/run_kaiju_business_owner_rc_smoke.py validates data, builds SFT, smokes the local API harness, runs router hard eval, and runs static checks. Passed
Complete Kiyomi 7.7.7 AI-company artifact generation business_suite route writes a 19-file pack including launch kit, content engine, connector checklist, intake CRM, reporting, automations, operator handbook, leads, sales, ROI dashboard, and Workshop artifact. Passed
Secret/private-data guardrails Dataset validation scans common secret patterns; verifier checks no_hardcoded_secrets; source inventory excludes credentials, tokens, private client data, and raw logs. Passed
Release artifacts release/MODEL_CARD_DRAFT.md, release/HF_ADAPTER_MODEL_CARD.md, release/DATA_PROVENANCE_DRAFT.md, release/EVAL_SCOREBOARD.md, release/LOCAL_TEST_INSTRUCTIONS.md, release/HUGGINGFACE_RELEASE_DRAFT.md, release/FINAL_RELEASE_REPORT.md, release/UPSTREAM_LICENSE_CHECK.md, and this audit. Passed
Fresh Qwen 3.6 v1.7 fine-tune After clearing old ComfyUI/Ollama workloads from Gojira B, training finished with metrics.json, train runtime 1663.7101s, train loss 1.7260706673065822, and an adapter directory. Passed
Local inference against new v1.7 checkpoint SGLang served kaiju_v17_business_owner over Tailscale at http://100.109.109.14:18083/v1 with context=4096 and mem_fraction=0.90; website and proposal smoke tasks returned non-empty outputs. Passed
Stronger Qwen 3.6 v1.8 fine-tune Gojira B was cleared of ComfyUI/SGLang/Ollama GPU conflicts; v1.8 finished with metrics.json, train runtime 11666.7564s, train loss 0.9281658741335074, and an adapter directory. Passed
v1.8 adapter merged into full model scripts/run-gojira-b-qwen36-lora-merge.sh merged /workspace/kaiju-coder/runs/qwen36-27b-lora-v1.8-business-owner/adapter into /workspace/kaiju-coder/models/Kaiju-Coder-Qwen3.6-27B-v1.8-merged; remote artifact is 51G with 14 safetensor shards and preserved base config/processor sidecars. Passed
Local inference against v1.8 merged checkpoint Current fast path serves kaiju-coder-7 through vLLM bitsandbytes on Gojira-B at http://100.109.109.14:18084/v1, exposed locally through http://127.0.0.1:18181/v1; current live endpoint reports max model len 16384. Prior SGLang benchmarks proved 12k/16k/24k/32k startup and smoke evidence, with 32k treated as the high-context target rather than the currently parked runtime. Passed
v1.8 merged business-owner eval Probe returned 1,155 visible chars in 60.17s; proposal rerun scored 1/1, 4.0/4.0, 4,014 chars in 212.72s; Jah credits backend scored 4.0/4.0, 9,718 chars in 566.36s. Passed with latency caveat
OpenCode local run path Local OpenCode provider/agent is installed for kaiju/kaiju-coder-7 with 16k context and the scoped no-autocontinue plugin at /Users/richardecholsai7/.config/opencode/kaiju-no-autocontinue.mjs. Packaged public verifier python3 scripts/run_kaiju_public_opencode_smoke.py --base-url http://127.0.0.1:18181/v1 --timeout 900 passed 4/4 in runs/public-opencode-smoke/20260603T235002Z/summary.md, including wrong-directory leakage checks; loop-guard smoke wrote loopguard.txt with exactly Kaiju Coder 7 loop guard installed; latest harnessed customer-readiness pack runs/opencode-customer-readiness/20260603T185835Z/summary.md passed 4/4 with 28/28 required files, including release provenance and safety review. Passed for harnessed/product path
Runtime-quantized local path vLLM bitsandbytes runtime quantization passed identity/code/business-doc smokes at 8k/16k, reported about 17.8 GiB model memory, and passed OpenCode one-file smoke with exact content Kaiju Coder 7 quantized runtime ok. Persisted quantized weights are still pending. Runtime recipe passed; persisted weights pending
Paid API gateway scaffold cd gateway/cloudflare-worker && npm run check passes 16/16 Worker tests covering bearer auth, inactive keys, insufficient credits, debit/refund, rate limit before debit, model kaiju-coder-7 enforcement, streaming/thinking/token caps, secret-content rejection without logging, signed Stripe Checkout top-up idempotency, origin-only R2 artifact upload, and account-scoped artifact download. python3 scripts/check_paid_api_readiness.py --mode scaffold passes 17 checks. python3 scripts/check_paid_api_readiness.py --mode launch passes 27/27 checks after live Cloudflare bindings, Worker-to-Gojira proof, Stripe test-mode webhook evidence, staging latency, and rollback proof. Real customer charging still requires a deliberate Stripe live-mode switch and controlled live payment verification. Scaffold and launch preflight passed; live-mode charging pending
Dynamic SGLang LoRA selector Adapter-name-only serving can be base-equivalent; corrected selector qwen36-27b:kaiju_v18_business_owner crashes with LoRA buffer shape torch.Size([8192, 16]) does not match weight shape torch.Size([14336, 16]). Not release path
Hugging Face helper repo upload readiness Adapter, OpenCode helper, and runtime-quantized recipe staging folders build under /tmp/kaiju-coder-7-hf-staging; public repos RMDWLLC/kaiju-coder-7-adapter, RMDWLLC/kaiju-coder-7-opencode, and RMDWLLC/kaiju-coder-7-quantized-runtime are uploaded and public. python3 scripts/check_hf_uploaded_release.py --namespace RMDWLLC --apply --require-public verifies public downloads and helper package content. Uploaded and public
Hugging Face merged model upload readiness RMDWLLC/kaiju-coder-7 is uploaded and public with the merged 53.8G model package and 14 safetensors shards recorded in release/HF_UPLOAD_EVIDENCE.md. Public downloads are verified; the previous private-storage blocker was resolved by switching the repos public. Uploaded and public
Consolidated release readiness check python3 scripts/check_kaiju_public_release_readiness.py --mode local, --mode hf-release, and --mode public pass against the current fast proxy and public HF evidence. The checker validates staged files, public naming hygiene, secret-looking raw values, checksums, final report, HF bundle checksum, uploaded evidence, and human signoff. Local, HF, and public modes passed

Commands With Current Passing Evidence

python3 -m unittest discover -s tests -p 'test_*.py'
python3 scripts/run_kaiju_business_owner_rc_smoke.py
python3 scripts/run_kaiju_opencode_customer_pack.py --mode harnessed
python3 scripts/install_kaiju_opencode_profile.py
mkdir -p /tmp/kaiju-opencode-fresh-public-smoke
opencode run -m kaiju/kaiju-coder-7 --agent kaiju-coder-7 --dir /tmp/kaiju-opencode-fresh-public-smoke --dangerously-skip-permissions 'Create hello.txt with exactly: Kaiju Coder 7 fresh public smoke ok'
opencode run -m kaiju/kaiju-coder-7 --agent kaiju-coder-7 --dir /tmp/kaiju-opencode-loopguard-smoke --dangerously-skip-permissions 'Create loopguard.txt with exactly: Kaiju Coder 7 loop guard installed'
python3 scripts/check_paid_api_readiness.py --mode scaffold
python3 -m py_compile scripts/run_kaiju_api_harness_smoke.py scripts/run_kaiju_business_owner_rc_smoke.py scripts/build_v17_business_owner_sft_dataset.py kaiju_harness/business_suite.py kaiju_harness/router.py kaiju_harness/verification.py
git diff --check
bash scripts/upload_hf_merged_model_from_gojira_b.sh
KAIJU_MERGED_METADATA_APPLY=1 bash scripts/prepare_hf_merged_model_metadata.sh
python3 scripts/check_hf_staging_integrity.py
python3 scripts/check_human_release_review.py --mode local
python3 scripts/generate_kaiju_final_report.py
python3 scripts/create_hf_release_bundle.py
python3 scripts/check_kaiju_public_release_readiness.py --mode local

Remaining Blocker

The fresh v1.8 adapter, merged full-model artifact, and direct merged-model inference path are proven. The current completed local release candidate is:

Kaiju Coder 7 merged model + deterministic business-owner harness + verifier + source-backed v1.7/v1.8 dataset/release package

That must be described honestly until external release review confirms:

  • GGUF Q8_0 runtime smoke before public quantized-weight claims
  • raw website latency/SLA positioning or explicit harness-first website positioning
  • broader base Qwen and GLM comparison results before superiority claims
  • 32k context freshly restarted and re-confirmed before making it the live default
  • Stripe live-mode products/webhook secret and a controlled live payment before selling real paid API access